This study reports the results of a genome-wide screening for alternative TSS usage between normal and neoplastic colorectal samples. In total, we identified nine genes with alternative TSS usage and validated them in an independent cohort. A subset of the candidates was validated by qRT-PCR on the RNA used for array analysis and on RNA from LCM samples. TF binding analysis suggested that p53 and SOX9 might be responsible for altering the TSS use as they are both upregulated in cancer and have enriched binding sites in the promoters associated with the TSS preferred by the neoplastic cells. Disruption of Wnt signaling using two different CRC cell lines further showed that perturbed Wnt-signaling may also play a role in regulating alternative TSS usage of
OSBPL1A and
TRAK1, maybe through SOX9 which is known to be a Wnt-regulated TF [
15].
The vast majority of scientific reports commonly refer to individual genes as one entity encoding a predominant transcript. This is a drastic oversimplification considering that experimental evidence supports the existence of at least two functional promoters for > 50% of all genes [
25,
26], and, furthermore, it has the unfortunate consequence that potentially important transcriptional variants may be ignored. Such transcript diversity is generated by various mechanisms such as alternative splicing, alternative TSS usage and alternative polyadenylation usage, which generate families of transcripts from a single gene locus. Additionally, the use of DNA methylation appears to contribute to differential usage of alternative promoters in multiple tissue types [
27] as well as in cancer [
28]. We have previously detected cancer specific alternative splicing in colorectal, prostate and bladder cancer [
29], and also shown that disruption of Wnt signaling can change the splicing pattern in colorectal, lung and gastric cancer [
6]. The findings in this paper extend these previous observations by providing specific examples of transcript isoforms that arise through tumor-specific alternative TSS usage that occur not only in colorectal, but also in a wide range of other cancers. We have not addressed potential alternative splicing of
TCF12, OSBPLA1 and
TRAK1 in this study, but have shown that the exon expression pattern and qPCR validation is consistent with alternative TSS. In
TCF12 and
OSBPL1A, the expression follows the pattern of already described short and long transcript isoforms, and for alternative splicing to explain this pattern, novel extreme 5' start exons would have to be included in the transcript of both genes followed by alternative splicing of at least eight otherwise constituve spliced exons, a scenario we consider highly unlikely.
For the transcriptional changes to impact tumor cell growth, it is crucial that they translate into protein changes. We used two different strategies to address this issue. First, IHC analysis showed a significant difference in the total protein level of TCF12 and OSBPL1A between normal and cancer samples consistent with the mRNA expression pattern we observed. Furthermore, we found a correlation between the total TCF12 protein expression level and progression free survival. We did not perform IHC analysis of TRAK1 as a recent study already reported that elevated levels of TRAK1 are associated with poor prognosis in CRC patients [
30]. Second,
in silico analysis was used to predict the impact of an altered TSS on protein function, and we found that OSBPL1A has 31 phosphorylation sites and protein interaction domains that are unique to the long isoform, indicating a loss of function for the short isoforms. The long TCF12 variant encodes 10 glycosylation and Yin-Yang sites along with 27 phosphorylation sites not present in the short isoform. However, both isoforms contain DNA binding and dimerization domains, indicating separate functions of the isoforms, and not a simple dominant negative function of the short isoform. Indeed, both the long and short isoform of the TCF12 have previously been described as having distinct functions in thymocyte development [
31,
32]. Deregulation of
Tcf12 has recently been causally implicated in CRC in a DNA transposon based forward genetic screen in a mouse model. Interestingly, in this study,
Tcf12 was the fourth most commonly mutated gene locus (
Apc being the most commonly mutated gene) [
33]. This clearly indicates that
TCF12 could have an important effect in CRC development.
In silico predictions for the remaining candidate genes indicated diverse functional differences such as altered protein localization, changes in protein-protein interaction domains and differences in translational regulation and efficiency.
The selective use of alternative TSSs has been observed in different cell types, tissue types and developmental stages [
8], however, the mechanism regulating the alternative TSS usage is often less well described. Aberrant TSS usage has also been linked to cancer, and
in vitro studies of hypoxia in a CRC cell line revealed hundreds of genes with altered TSSs [
34]. To elaborate on the possible mechanisms regulating the alternative TSS usage in CRC, we combined TF binding site enrichment analysis and TF expression analysis and showed that several TFs were dysregulated in tumors along with having enriched binding sites in the TSS regions. Furthermore, we used cell line models to demonstrate that abrogation of Wnt signalling leads to altered TSS usage for
OSBPL1A and
TRAK1, showing that the major pathway in CRC development could be a key regulator of TSS usage.