PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-9 (9)
 

Clipboard (0)
None
Journals
Year of Publication
Document Types
1.  Mapping cis- and trans-regulatory effects across multiple tissues in twins 
Nature genetics  2012;44(10):1084-1089.
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes.
doi:10.1038/ng.2394
PMCID: PMC3784328  PMID: 22941192
2.  Variants in MTNR1B influence fasting glucose levels 
Prokopenko, Inga | Langenberg, Claudia | Florez, Jose C | Saxena, Richa | Soranzo, Nicole | Thorleifsson, Gudmar | Loos, Ruth J F | Manning, Alisa K | Jackson, Anne U | Aulchenko, Yurii | Potter, Simon C | Erdos, Michael R | Sanna, Serena | Hottenga, Jouke-Jan | Wheeler, Eleanor | Kaakinen, Marika | Lyssenko, Valeriya | Chen, Wei-Min | Ahmadi, Kourosh | Beckmann, Jacques S | Bergman, Richard N | Bochud, Murielle | Bonnycastle, Lori L | Buchanan, Thomas A | Cao, Antonio | Cervino, Alessandra | Coin, Lachlan | Collins, Francis S | Crisponi, Laura | de Geus, Eco J C | Dehghan, Abbas | Deloukas, Panos | Doney, Alex S F | Elliott, Paul | Freimer, Nelson | Gateva, Vesela | Herder, Christian | Hofman, Albert | Hughes, Thomas E | Hunt, Sarah | Illig, Thomas | Inouye, Michael | Isomaa, Bo | Johnson, Toby | Kong, Augustine | Krestyaninova, Maria | Kuusisto, Johanna | Laakso, Markku | Lim, Noha | Lindblad, Ulf | Lindgren, Cecilia M | McCann, Owen T | Mohlke, Karen L | Morris, Andrew D | Naitza, Silvia | Orrù, Marco | Palmer, Colin N A | Pouta, Anneli | Randall, Joshua | Rathmann, Wolfgang | Saramies, Jouko | Scheet, Paul | Scott, Laura J | Scuteri, Angelo | Sharp, Stephen | Sijbrands, Eric | Smit, Jan H | Song, Kijoung | Steinthorsdottir, Valgerdur | Stringham, Heather M | Tuomi, Tiinamaija | Tuomilehto, Jaakko | Uitterlinden, André G | Voight, Benjamin F | Waterworth, Dawn | Wichmann, H-Erich | Willemsen, Gonneke | Witteman, Jacqueline C M | Yuan, Xin | Zhao, Jing Hua | Zeggini, Eleftheria | Schlessinger, David | Sandhu, Manjinder | Boomsma, Dorret I | Uda, Manuela | Spector, Tim D | Penninx, Brenda WJH | Altshuler, David | Vollenweider, Peter | Jarvelin, Marjo Riitta | Lakatta, Edward | Waeber, Gerard | Fox, Caroline S | Peltonen, Leena | Groop, Leif C | Mooser, Vincent | Cupples, L Adrienne | Thorsteinsdottir, Unnur | Boehnke, Michael | Barroso, Inês | Van Duijn, Cornelia | Dupuis, Josée | Watanabe, Richard M | Stefansson, Kari | McCarthy, Mark I | Wareham, Nicholas J | Meigs, James B | Abecasis, Gonçalo R
Nature genetics  2008;41(1):77-81.
To identify previously unknown genetic loci associated with fasting glucose concentrations, we examined the leading association signals in ten genome-wide association scans involving a total of 36,610 individuals of European descent. Variants in the gene encoding melatonin receptor 1B (MTNR1B) were consistently associated with fasting glucose across all ten studies. The strongest signal was observed at rs10830963, where each G allele (frequency 0.30 in HapMap CEU) was associated with an increase of 0.07 (95% CI = 0.06-0.08) mmol/l in fasting glucose levels (P = 3.2 = × 10−50) and reduced beta-cell function as measured by homeostasis model assessment (HOMA-B, P = 1.1 × 10−15). The same allele was associated with an increased risk of type 2 diabetes (odds ratio = 1.09 (1.05-1.12), per G allele P = 3.3 × 10−7) in a meta-analysis of 13 case-control studies totaling 18,236 cases and 64,453 controls. Our analyses also confirm previous associations of fasting glucose with variants at the G6PC2 (rs560887, P = 1.1 × 10−57) and GCK (rs4607517, P = 1.0 × 10−25) loci.
doi:10.1038/ng.290
PMCID: PMC2682768  PMID: 19060907
3.  A Genome-Wide Metabolic QTL Analysis in Europeans Implicates Two Loci Shaped by Recent Positive Selection 
PLoS Genetics  2011;7(9):e1002270.
We have performed a metabolite quantitative trait locus (mQTL) study of the 1H nuclear magnetic resonance spectroscopy (1H NMR) metabolome in humans, building on recent targeted knowledge of genetic drivers of metabolic regulation. Urine and plasma samples were collected from two cohorts of individuals of European descent, with one cohort comprised of female twins donating samples longitudinally. Sample metabolite concentrations were quantified by 1H NMR and tested for association with genome-wide single-nucleotide polymorphisms (SNPs). Four metabolites' concentrations exhibited significant, replicable association with SNP variation (8.6×10−11
Author Summary
Physiological concentrations of metabolites—small molecules involved in biochemical processes in living systems—can be measured and used to diagnose and predict disease states. A common goal is to detect and clinically exploit statistical differences in metabolite concentrations between diseased and healthy individuals. As a basis for the design and interpretation of case-control studies, it is useful to have a characterization of metabolic diversity amongst healthy individuals, some of which stems from inter-individual genetic variation. When a single genetic locus has a sufficiently strong effect on metabolism, its genomic position can be determined by collecting metabolite concentration data and genome-wide genotype data on a set of individuals and searching for associations between the two data sets—a so-called metabolite quantitative trait locus (mQTL) study. By so tracing mQTLs, we can identify the genetic drivers of metabolism, characterize how the nature or quantity of the corresponding expressed protein(s) feeds forward to influence metabolite levels, and specify disease-predictive models that incorporate mutual dependence amongst genetics, environment, and metabolism.
doi:10.1371/journal.pgen.1002270
PMCID: PMC3169529  PMID: 21931564
A comprehensive variation map of the human metabolome identifies genetic and stable-environmental sources as major drivers of metabolite concentrations. The data suggest that sample sizes of a few thousand are sufficient to detect metabolite biomarkers predictive of disease.
We designed a longitudinal twin study to characterize the genetic, stable-environmental, and longitudinally fluctuating influences on metabolite concentrations in two human biofluids—urine and plasma—focusing specifically on the representative subset of metabolites detectable by 1H nuclear magnetic resonance (1H NMR) spectroscopy.We identified widespread genetic and stable-environmental influences on the (urine and plasma) metabolomes, with (30 and 42%) attributable on average to familial sources, and (47 and 60%) attributable to longitudinally stable sources.Ten of the metabolites annotated in the study are estimated to have >60% familial contribution to their variation in concentration.Our findings have implications for the design and interpretation of 1H NMR-based molecular epidemiology studies. On the basis of the stable component of variation quantified in the current paper, we specified a model of disease association under which we inferred that sample sizes of a few thousand should be sufficient to detect disease-predictive metabolite biomarkers.
Metabolites are small molecules involved in biochemical processes in living systems. Their concentration in biofluids, such as urine and plasma, can offer insights into the functional status of biological pathways within an organism, and reflect input from multiple levels of biological organization—genetic, epigenetic, transcriptomic, and proteomic—as well as from environmental and lifestyle factors. Metabolite levels have the potential to indicate a broad variety of deviations from the ‘normal' physiological state, such as those that accompany a disease, or an increased susceptibility to disease. A number of recent studies have demonstrated that metabolite concentrations can be used to diagnose disease states accurately. A more ambitious goal is to identify metabolite biomarkers that are predictive of future disease onset, providing the possibility of intervention in susceptible individuals.
If an extreme concentration of a metabolite is to serve as an indicator of disease status, it is usually important to know the distribution of metabolite levels among healthy individuals. It is also useful to characterize the sources of that observed variation in the healthy population. A proportion of that variation—the heritable component—is attributable to genetic differences between individuals, potentially at many genetic loci. An effective, molecular indicator of a heritable, complex disease is likely to have a substantive heritable component. Non-heritable biological variation in metabolite concentrations can arise from a variety of environmental influences, such as dietary intake, lifestyle choices, general physical condition, composition of gut microflora, and use of medication. Variation across a population in stable-environmental influences leads to long-term differences between individuals in their baseline metabolite levels. Dynamic environmental pressures lead to short-term fluctuations within an individual about their baseline level. A metabolite whose concentration changes substantially in response to short-term pressures is relatively unlikely to offer long-term prediction of disease. In summary, the potential suitability of a metabolite to predict disease is reflected by the relative contributions of heritable and stable/unstable-environmental factors to its variation in concentration across the healthy population.
Studies involving twins are an established technique for quantifying the heritable component of phenotypes in human populations. Monozygotic (MZ) twins share the same DNA genome-wide, while dizygotic (DZ) twins share approximately half their inherited DNA, as do ordinary siblings. By comparing the average extent of phenotypic concordance within MZ pairs to that within DZ pairs, it is possible to quantify the heritability of a trait, and also to quantify the familiality, which refers to the combination of heritable and common-environmental effects (i.e., environmental influences shared by twins in a pair). In addition to incorporating twins into the study design, it is useful to quantify the phenotype in some individuals at multiple time points. The longitudinal aspect of such a study allows environmental effects to be decomposed into those that affect the phenotype over the short term and those that exert stable influence.
For the current study, urine and blood samples were collected from a cohort of MZ and DZ twins, with some twins donating samples on two occasions several months apart. Samples were analysed by 1H nuclear magnetic resonance (1H NMR) spectroscopy—an untargeted, discovery-driven technique for quantifying metabolite concentrations in biological samples. The application of 1H NMR to a biological sample creates a spectrum, made up of multiple peaks, with each peak's size quantitatively representing the concentration of its corresponding hydrogen-containing metabolite.
In each biological sample in our study, we extracted a full set of peaks, and thereby quantified the concentrations of all common plasma and urine metabolites detectable by 1H NMR. We developed bespoke statistical methods to decompose the observed concentration variation at each metabolite peak into that originating from familial, individual-environmental, and unstable-environmental sources.
We quantified the variability landscape across all common metabolite peaks in the urine and plasma 1H NMR metabolomes. We annotated a subset of peaks with a total of 65 metabolites; the variance decompositions for these are shown in Figure 1. Ten metabolites' concentrations were estimated to have familial contributions in excess of 60%. The average proportion of stable variation across all extracted metabolite peaks was estimated to be 47% in the urine samples and 60% in the plasma samples; the average estimated familiality was 30% for urine and 42% for plasma. These results comprise the first quantitative variation map of the 1H NMR metabolome. The identification and quantification of substantive widespread stability provides support for the use of these biofluids in molecular epidemiology studies. On the basis of our findings, we performed power calculations for a hypothetical study searching for predictive disease biomarkers among 1H NMR-detectable urine and plasma metabolites. Our calculations suggest that sample sizes of 2000–5000 should allow reliable identification of disease-predictive metabolite concentrations explaining 5–10% of disease risk, while greater sample sizes of 5000–20 000 would be required to identify metabolite concentrations explaining 1–2% of disease risk.
1H Nuclear Magnetic Resonance spectroscopy (1H NMR) is increasingly used to measure metabolite concentrations in sets of biological samples for top-down systems biology and molecular epidemiology. For such purposes, knowledge of the sources of human variation in metabolite concentrations is valuable, but currently sparse. We conducted and analysed a study to create such a resource. In our unique design, identical and non-identical twin pairs donated plasma and urine samples longitudinally. We acquired 1H NMR spectra on the samples, and statistically decomposed variation in metabolite concentration into familial (genetic and common-environmental), individual-environmental, and longitudinally unstable components. We estimate that stable variation, comprising familial and individual-environmental factors, accounts on average for 60% (plasma) and 47% (urine) of biological variation in 1H NMR-detectable metabolite concentrations. Clinically predictive metabolic variation is likely nested within this stable component, so our results have implications for the effective design of biomarker-discovery studies. We provide a power-calculation method which reveals that sample sizes of a few thousand should offer sufficient statistical precision to detect 1H NMR-based biomarkers quantifying predisposition to disease.
doi:10.1038/msb.2011.57
PMCID: PMC3202796  PMID: 21878913
biomarker; 1H nuclear magnetic resonance spectroscopy; metabolome-wide association study; top-down systems biology; variance decomposition
Thorgeirsson, Thorgeir E. | Gudbjartsson, Daniel F. | Surakka, Ida | Vink, Jacqueline M. | Amin, Najaf | Geller, Frank | Sulem, Patrick | Rafnar, Thorunn | Esko, Tõnu | Walter, Stefan | Gieger, Christian | Rawal, Rajesh | Mangino, Massimo | Prokopenko, Inga | Mägi, Reedik | Keskitalo, Kaisu | Gudjonsdottir, Iris H. | Gretarsdottir, Solveig | Stefansson, Hreinn | Thompson, John R. | Aulchenko, Yurii S. | Nelis, Mari | Aben, Katja K. | den Heijer, Martin | Dirksen, Asger | Ashraf, Haseem | Soranzo, Nicole | Valdes, Ana M | Steves, Claire | Uitterlinden, André G | Hofman, Albert | Tönjes, Anke | Kovacs, Peter | Hottenga, Jouke Jan | Willemsen, Gonneke | Vogelzangs, Nicole | Döring, Angela | Dahmen, Norbert | Nitz, Barbara | Pergadia, Michele L. | Saez, Berta | De Diego, Veronica | Lezcano, Victoria | Garcia-Prats, Maria D. | Ripatti, Samuli | Perola, Markus | Kettunen, Johannes | Hartikainen, Anna-Liisa | Pouta, Anneli | Laitinen, Jaana | Isohanni, Matti | Huei-Yi, Shen | Allen, Maxine | Krestyaninova, Maria | Hall, Alistair S | Jones, Gregory T. | van Rij, Andre M. | Mueller, Thomas | Dieplinger, Benjamin | Haltmayer, Meinhard | Jonsson, Steinn | Matthiasson, Stefan E. | Oskarsson, Hogni | Tyrfingsson, Thorarinn | Kiemeney, Lambertus A. | Mayordomo, Jose I. | Lindholt, Jes S | Pedersen, Jesper Holst | Franklin, Wilbur A. | Wolf, Holly | Montgomery, Grant W. | Heath, Andrew C. | Martin, Nicholas G. | Madden, Pamela A.F. | Giegling, Ina | Rujescu, Dan | Järvelin, Marjo-Riitta | Salomaa, Veikko | Stumvoll, Michael | Spector, Tim D | Wichmann, H-Erich | Metspalu, Andres | Samani, Nilesh J. | Penninx, Brenda W. | Oostra, Ben A. | Boomsma, Dorret I. | Tiemeier, Henning | van Duijn, Cornelia M. | Kaprio, Jaakko | Gulcher, Jeffrey R. | McCarthy, Mark I. | Peltonen, Leena | Thorsteinsdottir, Unnur | Stefansson, Kari
Nature genetics  2010;42(5):448-453.
Smoking is a risk factor for most of the diseases leading in mortality1. We conducted genome-wide association (GWA) meta-analyses of smoking data within the ENGAGE consortium to search for common alleles associating with the number of cigarettes smoked per day (CPD) in smokers (N=31,266) and smoking initiation (N=46,481). We tested selected SNPs in a second stage (N=45,691 smokers), and assessed some in a third sample (N=9,040). Variants in three genomic regions associated with CPD (P< 5·10−8), including previously identified SNPs at 15q25 represented by rs1051730-A (0.80 CPD,P=2.4·10−69), and SNPs at 19q13 and 8p11, represented by rs4105144-C (0.39 CPD, P=2.2·10−12) and rs6474412-T (0.29 CPD,P= 1.4·10−8), respectively. Among the genes at the two novel loci, are genes encoding nicotine-metabolizing enzymes (CYP2A6 and CYP2B6), and nicotinic acetylcholine receptor subunits (CHRNB3 and CHRNA6) highlighted in previous studies of nicotine dependence2-3. Nominal associations with lung cancer were observed at both 8p11 (rs6474412-T,OR=1.09,P=0.04) and 19q13 (rs4105144-C,OR=1.12,P=0.0006).
doi:10.1038/ng.573
PMCID: PMC3080600  PMID: 20418888
Bioinformatics  2010;27(4):589-591.
Summary: The Sample avAILability system—SAIL—is a web based application for searching, browsing and annotating biological sample collections or biobank entries. By providing individual-level information on the availability of specific data types (phenotypes, genetic or genomic data) and samples within a collection, rather than the actual measurement data, resource integration can be facilitated. A flexible data structure enables the collection owners to provide descriptive information on their samples using existing or custom vocabularies. Users can query for the available samples by various parameters combining them via logical expressions. The system can be scaled to hold data from millions of samples with thousands of variables.
Availability: SAIL is available under Aferro-GPL open source license: https://github.com/sail.
Contact: gostev@ebi.ac.uk, support@simbioms.org
Supplementary information: Supplementary data are available at Bioinformatics online and from http://www.simbioms.org.
doi:10.1093/bioinformatics/btq693
PMCID: PMC3035801  PMID: 21169373
Prokopenko, Inga | Langenberg, Claudia | Florez, Jose C. | Saxena, Richa | Soranzo, Nicole | Thorleifsson, Gudmar | Loos, Ruth J.F. | Manning, Alisa K. | Jackson, Anne U. | Aulchenko, Yurii | Potter, Simon C. | Erdos, Michael R. | Sanna, Serena | Hottenga, Jouke-Jan | Wheeler, Eleanor | Kaakinen, Marika | Lyssenko, Valeriya | Chen, Wei-Min | Ahmadi, Kourosh | Beckmann, Jacques S. | Bergman, Richard N. | Bochud, Murielle | Bonnycastle, Lori L. | Buchanan, Thomas A. | Cao, Antonio | Cervino, Alessandra | Coin, Lachlan | Collins, Francis S. | Crisponi, Laura | de Geus, Eco JC | Dehghan, Abbas | Deloukas, Panos | Doney, Alex S F | Elliott, Paul | Freimer, Nelson | Gateva, Vesela | Herder, Christian | Hofman, Albert | Hughes, Thomas E. | Hunt, Sarah | Illig, Thomas | Inouye, Michael | Isomaa, Bo | Johnson, Toby | Kong, Augustine | Krestyaninova, Maria | Kuusisto, Johanna | Laakso, Markku | Lim, Noha | Lindblad, Ulf | Lindgren, Cecilia M. | McCann, Owen T. | Mohlke, Karen L. | Morris, Andrew D | Naitza, Silvia | Orrù, Marco | Palmer, Colin N A | Pouta, Anneli | Randall, Joshua | Rathmann, Wolfgang | Saramies, Jouko | Scheet, Paul | Scott, Laura J. | Scuteri, Angelo | Sharp, Stephen | Sijbrands, Eric | Smit, Jan H. | Song, Kijoung | Steinthorsdottir, Valgerdur | Stringham, Heather M. | Tuomi, Tiinamaija | Tuomilehto, Jaakko | Uitterlinden, André G. | Voight, Benjamin F. | Waterworth, Dawn | Wichmann, H.-Erich | Willemsen, Gonneke | Witteman, Jacqueline CM | Yuan, Xin | Zhao, Jing Hua | Zeggini, Eleftheria | Schlessinger, David | Sandhu, Manjinder | Boomsma, Dorret I | Uda, Manuela | Spector, Tim D. | Penninx, Brenda WJH | Altshuler, David | Vollenweider, Peter | Jarvelin, Marjo Riitta | Lakatta, Edward | Waeber, Gerard | Fox, Caroline S. | Peltonen, Leena | Groop, Leif C. | Mooser, Vincent | Cupples, L. Adrienne | Thorsteinsdottir, Unnur | Boehnke, Michael | Barroso, Inês | Van Duijn, Cornelia | Dupuis, Josée | Watanabe, Richard M. | Stefansson, Kari | McCarthy, Mark I. | Wareham, Nicholas J. | Meigs, James B. | Abecasis, Goncalo R.
Nature genetics  2008;41(1):77-81.
To identify novel genetic loci associated with fasting glucose concentrations, we examined the leading association signals in 10 genome-wide association scans involving a total of 36,610 individuals of European descent. Variants in the gene encoding the melatonin receptor 1B (MTNR1B) were consistently associated with fasting glucose across all ten studies. The strongest signal was observed at rs10830963, where each G-allele (frequency 0.30 in HapMap CEU) was associated with an increase of 0.07 (95%CI 0.06–0.08) mmol/L in fasting glucose levels (P=3.2×10−50) and reduced beta-cell function as measured by homeostasis model assessment (HOMA-B, P=1.1×10−15). The same allele was associated with an increased risk of type 2 diabetes (odds ratio = 1.09 (1.05–1.12), per G allele P=3.3×10−7) in a meta-analysis of thirteen case-control studies totalling 18,236 cases and 64,453 controls. Our analyses also confirm previous associations of fasting glucose with variants at the G6PC2 (rs560887, P=1.1×10−57) and GCK (rs4607517, P=1.0×10−25) loci.
doi:10.1038/ng.290
PMCID: PMC2682768  PMID: 19060907
Bioinformatics  2009;25(20):2768-2769.
Summary: SIMBioMS is a web-based open source software system for managing data and information in biomedical studies. It provides a solution for the collection, storage, management and retrieval of information about research subjects and biomedical samples, as well as experimental data obtained using a range of high-throughput technologies, including gene expression, genotyping, proteomics and metabonomics. The system can easily be customized and has proven to be successful in several large-scale multi-site collaborative projects. It is compatible with emerging functional genomics data standards and provides data import and export in accepted standard formats. Protocols for transferring data to durable archives at the European Bioinformatics Institute have been implemented.
Availability: The source code, documentation and initialization scripts are available at http://simbioms.org.
Contact: support@simbioms.org; mariak@ebi.ac.uk
doi:10.1093/bioinformatics/btp420
PMCID: PMC2759553  PMID: 19633095
BMC Bioinformatics  2007;8:52.
Background
One of the crucial aspects of day-to-day laboratory information management is collection, storage and retrieval of information about research subjects and biomedical samples. An efficient link between sample data and experiment results is absolutely imperative for a successful outcome of a biomedical study. Currently available software solutions are largely limited to large-scale, expensive commercial Laboratory Information Management Systems (LIMS). Acquiring such LIMS indeed can bring laboratory information management to a higher level, but often implies sufficient investment of time, effort and funds, which are not always available. There is a clear need for lightweight open source systems for patient and sample information management.
Results
We present a web-based tool for submission, management and retrieval of sample and research subject data. The system secures confidentiality by separating anonymized sample information from individuals' records. It is simple and generic, and can be customised for various biomedical studies. Information can be both entered and accessed using the same web interface. User groups and their privileges can be defined. The system is open-source and is supplied with an on-line tutorial and necessary documentation. It has proven to be successful in a large international collaborative project.
Conclusion
The presented system closes the gap between the need and the availability of lightweight software solutions for managing information in biomedical studies involving human research subjects.
doi:10.1186/1471-2105-8-52
PMCID: PMC1803798  PMID: 17291344

Results 1-9 (9)