Our study provides the first comprehensive insight into the transcriptome of brain tissue affected by Alzheimer's disease. Using a whole transcriptome sequencing technique (RNA-Seq), we were able to identify the levels of differentially expressed genes and establish genes with alternative promoter usage and splicing patterns that changed in association with neurodegeneration. Moreover, comparative analysis of samples derived from different brain regions produced an increased molecular resolution for our analysis. This revealed that the frontal and temporal lobes of AD brains not only differed in the quantitative composition of the genes expressed but also showed lobe-specific alternations in transcript assembly.
For whole transcriptome sequencing, we used an Illumina Genome Analyser II with 36 bp sequence reads length. We obtained ~14×10
6 sequence reads per sample, which has been previously reported to deliver sufficient sequence coverage for transcriptome profiling
[13]. Our rate of 90-92% of reads that map to the reference genome met quality standards of the RNA-Seq technique
[29]. An estimation of the number of reads covering chromosome 1 (1,937,546 reads on average) was approximately 12.9% of all reads generated per transcriptome (14,974,824 reads on average). Human chromosome 1 comprises 8% of the human genome and contains 3,141 genes, or 13.6% of all annotated genes
[30]. Hence, we conclude that our mRNA-Seq data provide good representation of expressed genes in the human genome.
Cufflinks analysis of gene isoform expression levels, alternative splicing and alternative promoter usage revealed significant differences in transcriptome profiles between frontal and temporal lobe of the AD brain. These variations might reflect temporal and spatial differences in the progression of AD neuropathology across the aging brain. Widespread neuronal loss and a presence of the intraneuronal neurofibrillary tangles (NFTs) and the extracellular neuritic or senile plaques (NPs) are key features of the AD neuropathology. The main components of NPs are peptides of varying length collectively described as beta-amyloid whereas NFTs are mainly composed of paired helical filaments of a hyperphosphorylated form of the microtubule-associated protein tau (MAPT)
[31],
[32]. NFTs first arise in the entorhinal cortex of the medial temporal lobe and then spread toward the hippocampal CA1 region. NFTs formation then progresses to the temporal and frontal neocortices, and finally affects primary cortices
[33]. Thus the temporal and frontal lobe samples used in this study might approximately represent brain regions at distinct stages of the neurodegeneration process, with the temporal lobe affected first, followed by the frontal lobe of the brain.
The tissue-specific enrichment for gene ontology processes suggest region-specific, sequential progression of brain tissue neurodegeneration, with the temporal lobe being affected earlier than the frontal part of the cortex
[33]. Consequently, neuronal activity in the frontal lobe may be more vigorous at the time of sample donation. This might count for over-representation of GO terms such as regulation of synaptic plasticity and negative regulation of neuronal apoptosis. In contrast, neurons of the temporal lobe might exist in a more advanced phase of functional deterioration. This in turn is reflected by the more non-neuronally specific transcriptome patterns seen in samples derived from the total brain in this study. We do observe an over-representation of genes related to apoptosis that is consistent with previous reports, however there was no evidence in our analysis for AD-associated changes in the immune response
[34].
Many of the changes we observed in gene expression between normal and AD brains were similar to those reported previously. However, some differences were noted. This lack of concordance among our RNA-Seq transcriptome data set and previously reported gene expression profiles is likely to stem from inherent limitations in microarray systems. For example, background levels of hybridization (i.e. hybridization to a probe that occurs irrespective of the corresponding transcript's expression level) limit the accuracy of microarray expression measurements, particularly for transcripts present at low abundance. Furthermore, probes differ considerably in their hybridization properties
[35]. Thus, although comparing hybridization results across arrays can identify gene expression differences among samples
[36], hybridization results from a single sample may not provide a reliable measure of relative expression for different transcripts. By contrast, the Illumina sequencing data have been described as replicable with relatively little technical variation, thus for many purposes it may suffice to sequence each mRNA sample only once. The information gained from a single lane of Illumina flow cell, as done in the present study, provides a comprehensive analysis of transcripts and enables identification with confidence of differentially expressed genes
[11],
[37].
Moreover, validation techniques such as quantitative PCR (qPCR)
[38],
[39] and spike-in RNA
[29] have demonstrated that RNA-Seq is extremely accurate. Accordingly, a false positive rate <2% has been demonstrated for this technique
[40]. As recently reported by Marioni et al., qPCR results agreed more closely with Illumina sequencing results than with microarrays
[11].
Regarding quantification of gene expression, Cufflinks analysis of RNA-Seq data allowed us to dissect expression of individual genes into quantification of particular mRNA isoforms contributing to the final cumulative value of gene expression. To our knowledge, this is the first report where quantitative information about particular splice variants at a genome-wide scale has been generated for different anatomical segments of normal and AD brains. Thus, our study creates a useful data set supplementing previous microarray-generated information, which lacked isoform-specific resolution of gene expression
[9],
[41].
Despite the magnitude of the
APOE e4 risk effect and a possible mechanistic link with amyloid beta (Aβ) pathology
[34],
[42],
[43], it is still far from clear how
APOE e4 is involved in AD pathogenesis
[44]. Interestingly, the
APOE genotype in the case of AD samples used in this study was e3, which is considered to have no effect on AD onset. This suggests that the observed alternative promoter and TSS usage during
APOE expression in the AD temporal lobe might be independent of the Cys
![[implies]](/corehtml/pmc/pmcents/x21D2.gif)
Arg substitution at position 112. Following this line of reasoning, differential
APOE expression patterns - as indicated in this report - might be independent of the amyloid beta aggregation pathway in the course of Alzheimer's disease. Indeed, previous observations of alternative splicing in AD brains for glutamate transporter
[45],
PIN1 [46], estrogen receptor alpha
[47] and the
APOE receptor
[48] genes strongly suggest that alteration of transcriptional control for genes involved in neuronal physiology is a landmark of ongoing neurodegeneration. In light of our observations of alternative
APOE expression, the previously reported AD-specific splicing pattern of the
APOE receptor further suggests the functional relevance of lipid metabolism in the context of AD pathology
[49]. Moreover, it has previously been proposed that synthesis of ApoE might play a role in regional vulnerability of neurons in AD
[50]. How this might relate to the presence of different transcriptional variants of
APOE remains a subject for future studies.