Variation in the expression level and activity of genes involved in drug disposition and action (“pharmacogenes”) can affect drug response and toxicity, especially when in tissues of pharmacological importance. Previous studies have relied primarily on microarrays to understand gene expression differences, or have focused on a single tissue or small number of samples. The goal of this study was to use RNA-seq to determine the expression levels and alternative splicing of 389 PGRN pharmacogenes across four tissues (liver, kidney, heart and adipose) and lymphoblastoid cell lines (LCLs), which are used widely in pharmacogenomics studies. Analysis of RNA-seq data from 139 different individuals across the 5 tissues (20–45 individuals per tissue type) revealed substantial variation in both expression levels and splicing across samples and tissue types. This in-depth exploration also revealed 183 splicing events in pharmacogenes that were previously not annotated. Overall, this study serves as a rich resource for the research community to inform biomarker and drug discovery and use.
RNA-seq; Pharmacogenes; Splicing; Biomarkers; JuncBASE; Pharmacogenomics of Research Network
CYP2D6 metabolizes nearly 25% of clinically used drugs. Genetic polymorphisms cause large inter-individual variability in CYP2D6 enzyme activity and are currently used as biomarker to predict CYP2D6 metabolizer phenotype. Previously, we had identified a region 115 kb downstream of CYP2D6 as enhancer for CYP2D6, containing two completely linked single nucleotide polymorphisms (SNPs), rs133333 and rs5758550, associated with enhanced transcription. However, the enhancer effect on CYP2D6 expression, and the causative variant, remained to be ascertained. To characterize the CYP2D6 enhancer element, we applied chromatin conformation capture combined with the next-generation sequencing (4C assays) and chromatin immunoprecipitation with P300 antibody, in HepG2 and human primary culture hepatocytes. The results confirmed the role of the previously identified enhancer region in CYP2D6 expression, expanding the number of candidate variants to three highly linked SNPs (rs133333, rs5758550 and rs4822082). Among these, only rs5758550 demonstrated regulating enhancer activity in a reporter gene assay. Use of clustered regularly interspaced short palindromic repeats mediated genome editing in HepG2 cells targeting suspected enhancer regions decreased CYP2D6 mRNA expression by 70%, only upon deletion of the rs5758550 region. These results demonstrate robust effects of both the enhancer element and SNP rs5758550 on CYP2D6 expression, supporting consideration of rs5758550 for CYP2D6 genotyping panels to yield more accurate phenotype prediction.
We used RNA sequencing to analyze transcript profiles of ten autopsy brain regions from ten subjects. RNA sequencing techniques were designed to detect both coding and non-coding RNA, splice isoform composition, and allelic expression. Brain regions were selected from five subjects with a documented history of smoking and five non-smokers. Paired-end RNA sequencing was performed on SOLiD instruments to a depth of >40 million reads, using linearly amplified, ribosomally depleted RNA. Sequencing libraries were prepared with both poly-dT and random hexamer primers to detect all RNA classes, including long non-coding (lncRNA), intronic and intergenic transcripts, and transcripts lacking poly-A tails, providing additional data not previously available. The study was designed to generate a database of the complete transcriptomes in brain region for gene network analyses and discovery of regulatory variants.
Of 20,318 protein coding and 18,080 lncRNA genes annotated from GENCODE and lncipedia, 12 thousand protein coding and 2 thousand lncRNA transcripts were detectable at a conservative threshold. Of the aligned reads, 52 % were exonic, 34 % intronic and 14 % intergenic. A majority of protein coding genes (65 %) was expressed in all regions, whereas ncRNAs displayed a more restricted distribution. Profiles of RNA isoforms varied across brain regions and subjects at multiple gene loci, with neurexin 3 (NRXN3) a prominent example. Allelic RNA ratios deviating from unity were identified in > 400 genes, detectable in both protein-coding and non-coding genes, indicating the presence of cis-acting regulatory variants. Mathematical modeling was used to identify RNAs stably expressed in all brain regions (serving as potential markers for normalizing expression levels), linked to basic cellular functions. An initial analysis of differential expression analysis between smokers and nonsmokers implicated a number of genes, several previously associated with nicotine exposure.
RNA sequencing identifies distinct and consistent differences in gene expression between brain regions, with non-coding RNA displaying greater diversity between brain regions than mRNAs. Numerous RNAs exhibit robust allele selective expression, proving a means for discovery of cis-acting regulatory factors with potential clinical relevance.
Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-2207-8) contains supplementary material, which is available to authorized users.
RNA sequencing; Brain regions; Differential expression; Allelic expression imbalance; Isoform fraction; Non-coding RNA
Blood-brain barrier; solute carrier transporter; drug transporter; OCT3; organic cation transporter; MATE1; multidrug and toxin extrusion protein; expression profiling
mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs), and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs) in lymphoblast cell lines (LCL) and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T) in ABCB1 (MDR1) on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq) in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs) affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.
Membrane transporters control the influx and efflux of endogenous and xenobiotic substrates, including nutrients and drugs, across cellular membranes.
Whole transcriptome sequencing enables simultaneous analysis of overall and allele-specific mRNA expression, and the detection of multiple RNA isoforms.
Here we characterize variation in RNA transcripts emanating from gene loci encoding transporters based on RNAseq data from 10 human brains (including cocaine overdose and normal brain tissues) and 12 normal livers.
mRNA expression was detected in 65% of transporter genes in either tissue, with many genes generating multiple mRNA transcripts. Single-nucleotide polymorphisms within transporters with previous evidence for pharmacogenomics impact were detected. We also identified noncoding RNAs in the vicinity of transporter genes with potential regulatory functions.
The results obtained with RNAseq provide detailed information on transporter mRNA expression at the molecular level, affording new avenues for the study of membrane transport, with relevance to drug efficacy and toxicity.
alternative splicing; gene expression; RNAseq; transporters
The 5-hydroxytryptamine 2A receptor, encoded by HTR2A, is a major post-synaptic target for serotonin in the human brain and a therapeutic drug target. Despite hundreds of genetic associations investigating HTR2A polymorphisms in neuropsychiatric disorders and therapies, the role of genetic HTR2A variability in health and disease remains uncertain.
To discover and characterize regulatory HTR2A variants, we sequenced whole transcriptomes from ten human brain regions with massively-parallel RNA sequencing and measured allelic expression of multiple HTR2A mRNA transcript variants. Following discovery of functional variants, we further characterized their impact on genetic expression in vitro.
Three polymorphisms modulate the use of novel alternative exons and untranslated regions (UTRs), changing expression of RNA and protein. The frequent promoter variant rs6311, widely implicated in human neuropsychiatric disorders, decreases usage of an upstream transcription start site encoding a longer 5′UTR with greater translation efficiency. rs76665058, located in an extended 3′UTR and unique to individuals of African descent, modulates allelic HTR2A mRNA expression. The third SNP, unannotated and present in only a single subject, directs alternative splicing of exon 2. Targeted analysis of HTR2A in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study reveals associations between functional variants and depression severity or citalopram response.
Regulatory polymorphisms modulate HTR2A mRNA expression in an isoform-specific manner, directing the usage of novel untranslated regions and alternative exons. These results provide a foundation for delineating the role of HTR2A and serotonin signaling in CNS disorders.
serotonin; 5-HT2A; HTR2A; schizophrenia; depression; mRNA expression
Measuring allelic RNA expression ratios is a powerful approach for detecting cis-acting regulatory variants, RNA editing, loss of heterozygosity in cancer, copy number variation, and allele-specific epigenetic gene silencing. Whole transcriptome RNA sequencing (RNA-Seq) has emerged as a genome-wide tool for identifying allelic expression imbalance (AEI), but numerous factors bias allelic RNA ratio measurements. Here, we compare RNA-Seq allelic ratios measured in nine different human brain regions with a highly sensitive and accurate SNaPshot measure of allelic RNA ratios, identifying factors affecting reliable allelic ratio measurement. Accounting for these factors, we subsequently surveyed the variability of RNA editing across brain regions and across individuals.
We find that RNA-Seq allelic ratios from standard alignment methods correlate poorly with SNaPshot, but applying alternative alignment strategies and correcting for observed biases significantly improves correlations. Deploying these methods on a transcriptome-wide basis in nine brain regions from a single individual, we identified genes with AEI across all regions (SLC1A3, NHP2L1) and many others with region-specific AEI. In dorsolateral prefrontal cortex (DLPFC) tissues from 14 individuals, we found evidence for frequent regulatory variants affecting RNA expression in tens to hundreds of genes, depending on stringency for assigning AEI. Further, we find that the extent and variability of RNA editing is similar across brain regions and across individuals.
These results identify critical factors affecting allelic ratios measured by RNA-Seq and provide a foundation for using this technology to screen allelic RNA expression on a transcriptome-wide basis. Using this technology as a screening tool reveals tens to hundreds of genes harboring frequent functional variants affecting RNA expression in the human brain. With respect to RNA editing, the similarities within and between individuals leads us to conclude that this post-transcriptional process is under heavy regulatory influence to maintain an optimal degree of editing for normal biological function.
RNA-Seq; Whole transcriptome; Allele expression; mRNA expression; Functional genetics; Regulatory polymorphism; eQTL; Read alignment; Next generation sequencing; Bioinformatics
Sulfamethoxazole (SMX) is a commonly used antibiotic for prevention of infectious diseases associated with HIV/AIDS and immune-compromised states. SMX-induced hypersensitivity is an idiosyncratic cutaneous drug reaction with genetic components. Here, we tested association of candidate genes involved in SMX bioactivation and antioxidant defense with SMX-induced hypersensitivity.
Seventy seven single nucleotide polymorphisms (SNPs) from 14 candidate genes were genotyped and assessed for association with SMX-induced hypersensitivity, in a cohort of 171 HIV/AIDS patients. SNP rs761142 T > G, in glutamate cysteine ligase catalytic subunit (GCLC), was significantly associated with SMX-induced hypersensitivity, with an adjusted p value of 0.045. This result was replicated in a second cohort of 249 patients (p = 0.025). In the combined cohort, heterozygous and homozygous carriers of the minor G allele were at increased risk of developing hypersensitivity (GT vs TT, odds ratio = 2.2, 95% CL 1.4-3.7, p = 0.0014; GG vs TT, odds ratio = 3.3, 95% CL 1.6 – 6.8, p = 0.0010). Each minor allele copy increased risk of developing hypersensitivity 1.9 fold (95% CL 1.4 – 2.6, p = 0.00012). Moreover, in 91 human livers and 84 B-lymphocytes samples, SNP rs761142 homozygous G allele carriers expressed significantly less GCLC mRNA than homozygous TT carriers (p < 0.05).
rs761142 in GCLC was found to be associated with reduced GCLC mRNA expression and with SMX-induced hypersensitivity in HIV/AIDS patients. Catalyzing a critical step in glutathione biosynthesis, GCLC may play a broad role in idiosyncratic drug reactions.
Idiosyncratic drug reaction; Sulfamethoxazole; Hypersensitivity; Glutamate cysteine ligase catalytic subunit (GCLC); Association; HIV/AIDS
Polymorphisms in and around the Cholesteryl Ester Transfer Protein (CETP) gene have been associated with HDL levels, risk for coronary artery disease (CAD), and response to therapy. The mechanism of action of these polymorphisms has yet to be defined. We used mRNA allelic expression and splice isoform measurements in human liver tissues to identify the genetic variants affecting CETP levels. Allelic CETP mRNA expression ratios in 56 human livers were strongly associated with several variants 2.5–7 kb upstream of the transcription start site (e.g., rs247616 p = 6.4×10−5, allele frequency 33%). In addition, a common alternatively spliced CETP isoform lacking exon 9 (Δ9), has been shown to prevent CETP secretion in a dominant-negative manner. The Δ 9 expression ranged from 10 to 48% of total CETP mRNA in 94 livers. Increased formation of this isoform was exclusively associated with an exon 9 polymorphism rs5883-C>T (p = 6.8×10−10) and intron 8 polymorphism rs9930761-T>C (5.6×10−8) (in high linkage disequilibrium with allele frequencies 6–7%). rs9930761 changes a key splicing branch point nucleotide in intron 8, while rs5883 alters an exonic splicing enhancer sequence in exon 9.
The effect of these polymorphisms was evaluated in two clinical studies. In the Whitehall II study of 4745 subjects, both rs247616 and rs5883T/rs9930761C were independently associated with increased HDL-C levels in males with similar effect size (rs247616 p = 9.6×10−28 and rs5883 p = 8.6×10−10, adjusted for rs247616). In an independent multiethnic US cohort of hypertensive subjects with CAD (INVEST-GENE), rs5883T/rs9930761C alone were significantly associated with increased incidence of MI, stroke, and all-cause mortality in males (rs5883: OR 2.36 (CI 1.29–4.30), p = 0.005, n = 866). These variants did not reach significance in females in either study. Similar to earlier results linking low CETP activity with poor outcomes in males, our results suggest genetic, sex-dependent CETP splicing effects on cardiovascular risk by a mechanism independent of circulating HDL-C levels.
The dopamine receptor D2 (encoded by DRD2) is implicated in susceptibility to mental disorders and cocaine abuse, but mechanisms responsible for this relationship remain uncertain. DRD2 mRNA exists in two main splice isoforms with distinct functions: D2 long (D2L) and D2 short (D2S, lacking exon 6), expressed mainly postsynaptically and presynaptically, respectively. Two intronic single-nucleotide polymorphisms (SNPs rs2283265 (intron 5) and rs1076560 (intron 6)) in high linkage disequilibrium (LD) with each other have been reported to alter D2S/D2L splicing and several behavioral traits in human subjects, such as memory processing. To assess the role of DRD2 variants in cocaine abuse, we measured levels of D2S and D2L mRNA in human brain autopsy tissues (prefrontal cortex and putamen) obtained from cocaine abusers and controls, and genotyped a panel of DRD2 SNPs (119 abusers and 95 controls). Robust effects of rs2283265 and rs1076560 on reducing formation of D2S relative to D2L were confirmed. The minor alleles of rs2283265/rs1076560 were considerably more frequent in Caucasians (18%) compared with African Americans (7%). Also, in Caucasians, rs2283265/rs1076560 minor alleles were significantly overrepresented in cocaine abusers compared with controls (rs2283265: 25 to 9%, respectively; p=0.001; OR=3.4 (1.7–7.1)). Several SNPs previously implicated in diverse clinical association studies are in high LD with rs2283265/rs1076560 and could have served as surrogate markers. Our results confirm the role of rs2283265/rs1076560 in D2 alternative splicing and support a strong role in susceptibility to cocaine abuse.
alternative splicing; cocaine; dopamine; DRD2; D2S; human; addiction and substance abuse; dopamine; neurogenetics; psychostimulants; drd2; d2s; human; alternative splicing; cocaine
CHRNA5, encoding the nicotinic α5 subunit, is implicated in multiple disorders, including nicotine addiction and lung cancer. Previous studies demonstrate significant associations between promoter polymorphisms and CHRNA5 mRNA expression, but the responsible sequence variants remain uncertain. To search for cis-regulatory variants, we measured allele-specific mRNA expression of CHRNA5 in human prefrontal cortex autopsy tissues and scanned the CHRNA5 locus for regulatory variants. A cluster of six frequent single nucleotide polymorphisms (rs1979905, rs1979906, rs1979907, rs880395, rs905740, and rs7164030), in complete linkage disequilibrium, fully account for a >2.5-fold allelic expression difference and a fourfold increase in overall CHRNA5 mRNA expression. This proposed enhancer region resides more than 13 kilobases upstream of the CHRNA5 transcription start site. The same upstream variants failed to affect CHRNA5 mRNA expression in peripheral blood lymphocytes, indicating tissue-specific gene regulation. Other promoter polymorphisms were also correlated with overall CHRNA5 mRNA expression in the brain, but were inconsistent with allelic mRNA expression ratios, a robust and proximate measure of cis-regulatory variants. The enhancer region and the nonsynonymous polymorphism rs16969968 generate three main haplotypes that alter the risk of developing nicotine dependence. Ethnic differences in linkage disequilibrium across the CHRNA5 locus require consideration of the upstream enhancer variants when testing clinical associations.
Nicotinic receptor; alpha5 subunit; gene expression; nicotine dependence; lung cancer; enhancer
CHRNA5, encoding the nicotinic α5 subunit, is implicated in multiple disorders, including nicotine addiction and lung cancer. Previous studies demonstrate significant associations between promoter polymorphisms and CHRNA5 mRNA expression, but the responsible sequence variants remain uncertain. To search for cis-regulatory variants, we measured allele-specific mRNA expression of CHRNA5 in human prefrontal cortex autopsy tissues and scanned the CHRNA5 locus for regulatory variants. A cluster of six frequent single-nucleotide polymorphisms (rs1979905, rs1979906, rs1979907, rs880395, rs905740, and rs7164030), in complete linkage disequilibrium (LD), fully account for a >2.5-fold allelic expression difference and a fourfold increase in overall CHRNA5 mRNA expression. This proposed enhancer region resides more than 13 kilobases upstream of the CHRNA5 transcription start site. The same upstream variants failed to affect CHRNA5 mRNA expression in peripheral blood lymphocytes, indicating tissue-specific gene regulation. Other promoter polymorphisms were also correlated with overall CHRNA5 mRNA expression in the brain, but were inconsistent with allelic mRNA expression ratios, a robust and proximate measure of cis-regulatory variants. The enhancer region and the nonsynonymous polymorphism rs16969968 generate three main haplotypes that alter the risk of developing nicotine dependence. Ethnic differences in LD across the CHRNA5 locus require consideration of upstream enhancer variants when testing clinical associations.
nicotinic receptor; α5 subunit; gene expression; nicotine dependence; lung cancer; enhancer
Genetic variation in mRNA expression plays a critical role in human phenotypic diversity, but it has proven difficult to detect regulatory polymorphisms - mostly single nucleotide polymorphisms (rSNPs). Additionally, variants in the transcribed region, termed here ‘structural RNA SNPs’ (srSNPs), can affect mRNA processing and turnover. Both rSNPs and srSNPs cause allelic mRNA expression imbalance (AEI) in heterozygous individuals. We have applied a rapid and accurate AEI methodology for testing 42 genes implicated in human diseases and drug response, specifically cardiovascular and CNS diseases, and affecting drug metabolism and transport. Each gene was analyzed in physiologically relevant human autopsy tissues, including brain, heart, liver, intestines, and lymphocytes. Substantial AEI was observed in ∼55% of the surveyed genes. Focusing on cardiovascular candidate genes in human hearts, AEI analysis revealed frequent cis-acting regulatory factors in SOD2 and ACE mRNA expression, having potential clinical significance. SNP scanning to locate regulatory polymorphisms in a number of genes failed to support several previously proposed promoter SNPs discovered with use of reporter gene assays in heterologous tissues, while srSNPs appear more frequent than expected. Computational analysis of mRNA folding indicates that ∼90% of srSNPs affects mRNA folding, and hence potentially function. Our results indicate that both rSNPs and srSNPs represent a still largely untapped reservoir of variants that contribute to human phenotypic diversity.
With recent advances in understanding of the neuroscience of risk taking, attention is now turning to genetic factors that may contribute to individual heterogeneity in risk attitudes. In this paper we test for genetic associations with risk attitude measures derived from both the psychology and economics literature. To develop a long-term prospective study, we first evaluate both types of risk attitudes and find that the economic and psychological measures are poorly correlated, suggesting that different genetic factors may underlie human response to risk faced in different behavioral domains. We then examine polymorphisms in a spectrum of candidate genes that affect neurotransmitter systems influencing dopamine regulation or are thought to be associated with risk attitudes or impulsive disorders. Analysis of the genotyping data identified two single nucleotide polymorphisms (SNPs) in the gene encoding the alpha 4 nicotine receptor (CHRNA4, rs4603829 and rs4522666) that are significantly associated with harm avoidance, a risk attitude measurement drawn from the psychology literature. Novelty seeking, another risk attitude measure from the psychology literature, is associated with several COMT (catechol-O-methyl transferase) SNPs while economic risk attitude measures are associated with several VMAT2 (vesicular monoamine transporter) SNPs, but the significance of these associations did not withstand statistical adjustment for multiple testing and requires larger cohorts. These exploratory results provide a starting point for understanding the genetic basis of risk attitudes by considering the range of methods available for measuring risk attitudes and by searching beyond the traditional direct focus on dopamine and serotonin receptor and transporter genes.
The voltage-dependent L-type calcium channel α-subunit 1c (Cav1.2, CACNA1C) undergoes extensive mRNA splicing, leading to numerous isoforms with different functions. L-type calcium channel blockers are used in the treatment of hypertension and arrhythmias, but response varies between individuals. We have studied the interindividual variability in mRNA expression and splicing of CACNA1C, in 65 heart tissue samples, taken from heart transplant recipients.
Splice variants were measured quantitatively by polymerase chain reaction in 12 splicing loci of CACNA1C mRNA. To search for functional cis-acting polymorphisms, we determined allelic expression ratios for total CACNA1C mRNA and several splice variants using marker single nucleotide polymorphisms in exon 4 and exon 30.
Total CACNA1C mRNA levels varied ∼50-fold. Substantial splicing occurred in six loci generating two or more splice variants, some with known functional differences. Splice patterns varied broadly between individuals. Two heart tissues expressed predominantly the dihydropyridine-sensitive smooth muscle isoform of CACNA1C (containing exon 8), rather than the cardiac isoform (containing exon 8a). Lack of significant allelic expression imbalance, observed with total mRNA and several splice variants, argued against CACNA1C polymorphisms as a cause of variability. Taken together, highly variable splicing can cause profound phenotypic variations of CACNA1C function, potentially associated with disease susceptibility and response to L-type calcium channel blockers.
cis-acting polymorphism; L-type calcium channel α-subunit 1c; mRNA splicing
Variants in numerous genes are thought to affect the success or failure of cancer chemotherapy. Interindividual variability can result from genes involved in drug metabolism and transport, drug targets (receptors, enzymes, etc), and proteins relevant to cell survival (e.g., cell cycle, DNA repair, and apoptosis). The purpose of the current study is to establish a flexible, cost-effective, high-throughput genotyping platform for candidate genes involved in chemoresistance and -sensitivity, and treatment outcomes.
We have adopted SNPlex for genotyping 432 single nucleotide polymorphisms (SNPs) in 160 candidate genes implicated in response to anticancer chemotherapy.
The genotyping panels were applied to 39 patients with chronic lymphocytic leukemia undergoing flavopiridol chemotherapy, and 90 patients with colorectal cancer. 408 SNPs (94%) produced successful genotyping results. Additional genotyping methods were established for polymorphisms undetectable by SNPlex, including multiplexed SNaPshot for CYP2D6 SNPs, and PCR amplification with fluorescently labeled primers for the UGT1A1 promoter (TA)nTAA repeat polymorphism.
This genotyping panel is useful for supporting clinical anticancer drug trials to identify polymorphisms that contribute to interindividual variability in drug response. Availability of population genetic data across multiple studies has the potential to yield genetic biomarkers for optimizing anticancer therapy.