|Home | About | Journals | Submit | Contact Us | Français|
Genetic data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) has been crucial in advancing the understanding of AD pathophysiology. Here we provide an update on sample collection, scientific progress and opportunities, conceptual issues, and future plans.
Lymphoblastoid cell lines and DNA and RNA samples from blood have been collected and banked, and data and biosamples have been widely disseminated. To date, APOE genotyping, genome-wide association study (GWAS), and whole exome and whole genome sequencing (WES, WGS) data have been obtained and disseminated.
ADNI genetic data have been downloaded thousands of times and over 300 publications have resulted, including reports of large scale GWAS by consortia to which ADNI contributed. Many of the first applications of quantitative endophenotype association studies employed ADNI data, including some of the earliest GWAS and pathway-based studies of biospecimen and imaging biomarkers, as well as memory and other clinical/cognitive variables. Other contributions include some of the first WES and WGS data sets and reports in healthy controls, MCI, and AD.
Numerous genetic susceptibility and protective markers for AD and disease biomarkers have been identified and replicated using ADNI data, and have heavily implicated immune, mitochondrial, cell cycle/fate, and other biological processes. Early sequencing studies suggest that rare and structural variants are likely to account for significant additional phenotypic variation. Longitudinal analyses of transcriptomic, proteomic, metabolomic, and epigenomic changes will also further elucidate dynamic processes underlying preclinical and prodromal stages of disease. Integration of this unique collection of multi-omics data within a systems biology framework will help to separate truly informative markers of early disease mechanisms and potential novel therapeutic targets from the vast background of less relevant biological processes. Fortunately, a broad swath of the scientific community has accepted this grand challenge.
The Alzheimer’s Disease Neuroimaging Initiative (ADNI) was initiated in 2003 by the National Institute on Aging (NIA), the National Institute of Biomedical Imaging and Bioengineering (NIBIB), the Food and Drug Administration (FDA), private pharmaceutical companies, and nonprofit organizations as a public-private partnership. The first phase (ADNI-1) launched a cohort study of patients with Alzheimer’s disease (AD), mild cognitive impairment (MCI), and cognitively normal older adult controls (CN). A second, American Recovery and Reinvestment Act of 2009 (ARRA) sponsored, “Grand Opportunity” phase (ADNI-GO) followed, and introduced the concept of early mild cognitive impairment (EMCI). ADNI is now completing its third phase (ADNI-2), which also assesses individuals with early or late MCI (LMCI), mild AD, cognitively normal controls, and a new group of older adults with significant memory concerns (SMC) but whose psychometric performance is within normal limits , consistent with the recent consensus concept of subjective cognitive decline (SCD) . Detailed information on ADNI is found on the study website (http://www.adni-info.org/). The primary goal of ADNI has been to establish the optimal panel of clinical assessments, magnetic resonance imaging (MRI) and positron emission tomography (PET) imaging measures, as well as other biomarkers from blood and cerebrospinal fluid (CSF), to inform clinical trial design for AD therapeutic development. At the same time, ADNI has been highly productive in generating a wealth of data useful for elucidating disease mechanisms occurring during early stages of preclinical and prodromal AD. For a comprehensive summary of ADNI results see  and the other papers in this Special Issue.
Genetics has played an increasingly important role in AD research over the past few decades, starting with the discoveries of rare mutations in APP (amyloid precursor protein), PSEN1 (presenilin-1), and PSEN2 (presenilin-2) that cause early onset autosomal dominant forms of the disease (see  for review) and continuing with the association of the common APOE (apolipoprotein E) ε4 allele with sporadic or late onset AD (LOAD). Until recently, only APOE on chromosome 19 had been robustly replicated as a LOAD susceptibility gene. With the advent of large-scale genome-wide association studies (GWAS) conducted by multi-study consortia, a current “top 20” or more list of replicated genes has emerged, which are now undergoing further investigation to identify the biological basis of their risk or protective roles in LOAD . In ADNI, APOE genotype was determined and DNA and cell lines were banked from the beginning of the study. Genome-wide genotyping was also completed, enabling ADNI data to be included in several large and important GWAS studies. Notably, most of these large well-powered meta-analytic GWAS studies are restricted to binary categorical phenotypes, such as clinical diagnosis, and usually only include AD cases and controls.
A relatively unique aspect of ADNI is the systematic longitudinal collection of biomarker data that can serve as quantitative endophenotypes for genetic association studies. In fact, using quantitative endophenotypes as target measures has been found to improve power relative to case-control techniques, as well as to avoid using arbitrary or potentially error prone cut-offs to define case status [6, 7]. Recognizing the value of these quantitative trait loci (QTL) investigations in AD, the ADNI Genetics Core was established at the beginning of the GO phase  with the goals of: (1) blood sample processing, genotyping, and dissemination; (2) genome-wide analysis of multidimensional phenotypic data collected on the ADNI cohort; and, (3) serving as a central resource, point of contact, and planning group for genetic studies in ADNI. The first GWAS of an ADNI quantitative phenotype was published in 2009  and progress has been rapid as groups worldwide analyzed the publicly available ADNI genetic data. Shen et al. (2014) provided a detailed review of results from ADNI genetic studies through 2012 . Here, we provide an updated review of these results through 2014, summarize the evolving set of ADNI genetic data, and discuss future directions to maximally leverage this data that we hope will be incorporated in the proposed next phase of ADNI.
The main goal of this update is to describe the genetic data and highlight results generated to date as part of the ADNI study. First, we describe the samples, methods, and basic analysis of the different types of genetic data available in ADNI, including APOE, TOMM40 poly-T repeat, GWAS, whole exome sequences, whole genome sequences, and microarray-based RNA gene expression profiles. This documentation of techniques which have evolved over time should assist the many users of ADNI genetic data to better understand the processing pipelines. Next, we discuss how and the extent to which the various ADNI genetic datasets have been used so far. Third, we highlight novel and important findings generated by relating quantitative phenotypes to the ADNI genetic data. Fourth, we discuss the findings from the ADNI data in the context of a systems biology framework. Next, we describe how ADNI genetic findings could inform drug development, as well as clinical trial design and execution. This is followed by a discussion of the potential for understanding disease biology using induced pluripotent stem cells (iPSC). Next, we describe how ADNI and other genetic data relate to ethical, legal, and social implications (ELSI) of genetic testing, e.g., in returning genetic results on disease risk. Finally, we discuss future directions proposed by the Genetics Core that we hope will be investigated in the proposed next phase of the ADNI study.
Blood samples are collected at each visit for all participants (i.e., CN, SMC, EMCI, LMCI and AD) according to ADNI protocol for the particular study phase (http://www.adni-info.org/Scientists/ADNIStudyProcedures.aspx). All collected samples are sent to and processed by the NIA-sponsored National Cell Repository for Alzheimer’s Disease (NCRAD; http://ncrad.iu.edu/) according to Standard Operating Procedures briefly summarized below. Baseline blood samples for ADNI-1 were received and processed by the ADNI Biomarker Core with DNA extraction (described below) contributed by Pfizer and performed by Cogenics (now Beckman Coulter Genomics). DNA was then transferred to NCRAD. All samples of genomic material are inventoried and tracked by NCRAD and the ADNI Genetics Core. In addition, collection of PAXgene blood RNA samples (three 2.5-ml tubes per participant) was initiated in the ADNI-GO/2 phases. As of 03/24/2015, there are 777 ADNI-1, 127 ADNI-GO, and 781 ADNI-2 participants with at least one DNA sample from genomic blood stored at NCRAD. 810 ADNI-1, 125 ADNI-GO, and 772 ADNI-2 participants have at least one cell line DNA sample. Because RNA collection was initiated in ADNI-GO, only ADNI-1 participants who continued to ADNI-GO/2 have RNA samples; 290 ADNI-1, 128 ADNI-GO, and 780 ADNI-2 participants have at least one RNA sample stored at NCRAD. Many ADNI-GO/2 participants also have longitudinal RNA collection.
DNA is extracted from buffy coats using whole blood collected in ethylenediaminetetraacetic acid (EDTA) tubes or buffy coats pre-isolated at the site of collection. The salts of the chelating agent EDTA are used as anticoagulants for genetic testing as they preserve cellular components of blood. Blood samples are centrifuged at 3000 rpm (1500 X g) for 10 minutes at room temperature and the plasma is drawn off and the buffy coat is removed. The buffy coat contains nucleated cells containing DNA. DNA bound to protein is separated using sodium dodecyl sulfate and proteinase K. Then, alcohol is used to precipitate DNA from a high concentration of sodium chloride. Precipitated DNA is collected and transferred into 2-ml or 5-ml tubes following centrifugation and incubation. Once a homogenous solution is obtained and placed inside rotator, DNA concentration is measured on a NanoDrop 1000 Spectrophotometer and DNA tubes are stored in the appropriate −80°C freezer.
The QIAgen PAXgene Blood RNA Kit (Germantown, MD, USA) is used to purify total RNA from 2.5-ml of human whole blood collected in a PAXgene Blood RNA Tube. The procedure is performed using manual or automated procedures. One 2.5-ml tube is processed to extract RNA and the remaining tubes are stored at −80°C. RNA extraction is performed using the Qiagen QIAcube according to the manufacturer’s protocol. One tube is stored in the 4°C refrigerator before any sample preparation and centrifuged at 3000 X g at room temperature followed by discarding supernatant and mixing RNAse-free water or BR1 resuspension buffer. Samples are then microcentrifuged using QIAcube and eluted samples are transferred to thermal screw cap tubes for incubation for 5 minutes using a benchtop oven pre-heated to 65°C. Quality and quantity of incubated RNA samples is measured and RNA samples are aliquoted in 2-ml screw cap tubes at 2μg/aliquot and stored at −80°C.
To establish lymphoblastoid cell lines (LCLs), whole blood samples from the baseline visit are collected in two 8.5-ml tubes coated with acid citrate dextrose. LCLs are created using the white blood cells (WBC) from the buffy coat extracted from whole blood. Cell lines are immortalized by transforming B lymphocytes with the Epstein-Barr virus. WBC are placed in a flask along with a solution that allows permanent cell growth and incubated at 37°C for between 4 weeks and 3 months. The cell-containing solution is then divided and transferred into two larger flasks for further cell growth. Cells are then placed in a vial, along with a preservative, and gradually cooled to freezing temperature to prevent damage to the cell line. The frozen cells are stored in a liquid nitrogen tank at NCRAD.
APOE genotyping was done using DNA from blood samples from 818 ADNI-1, 128 ADNI-GO, and 778 ADNI-2 participants. For ADNI-1, APOE genotyping was carried out by polymerase chain reaction (PCR) amplification, Hhal restriction enzyme digestion, and subsequent standard gel resolution and visualization processes [11, 12]. For ADNI-GO and ADNI-2 DNA samples, genotyping was performed by Prevention Genetics (Marshfield, WI, USA) and LGC Genomics (Beverly, MA, USA). Prevention Genetics employed array processing using allele-specific PCR with universal molecular beacons [13, 14]. At LGC, assays were performed using competitive allele-specific PCR, enabling bi-allelic scoring of single nucleotide polymorphisms (SNPs). Assay kits were added to DNA samples, followed by thermal cycling reaction and an end-point fluorescent read. Genotypes were called using LGC Genomics’ in-house Kraken software (http://www.lgcgroup.com/products/genotyping-software/kraken/#.VRCH6I7F98E) and were returned to the ADNI Genetics Core after manual quality control (QC). All APOE genotype data underwent further QC checks, including sex and identity checks, and potential problems were identified and corrected through communication with NCRAD and other cores. Final quality-controlled data were posted on the ADNI LONI website (http://adni.loni.usc.edu).
The TOMM40 poly-T repeat was assessed in the ADNI-1 cohort because an association had been previously reported between the TOMM40 poly-T repeat and AD . Genotyping was performed by Polymorphic DNA Technologies, Inc. (Alameda, CA, USA) for 757 individuals from ADNI-1, with support for the assays provided by Allen Roses (Duke University). Details of the poly-T assay and bioinformatics analyses using long-range PCR and DNA sequencing are available in .
Genome-wide genotyping was performed using the Illumina Human 610-Quad BeadChip (Illumina, Inc., San Diego, CA, USA) for ADNI-1 individuals by TGen (Phoenix, AZ, USA) and using the OmniExpress BeadChip for ADNI-GO/2 individuals by the Center for Applied Genomics of Children’s Hospital of Philadelphia (Philadelphia, PA, USA). Detailed genotyping protocols were described previously . All assays were performed according to manufacturer protocols. Bead intensity data were used to call genotypes using BeadStudio 3.2 (Illumina) for the first release of ADNI-1 data, GenomeStudio v2009.1 (Illumina) for the second release of ADNI-1 data, and GenomeStudio v2011.1 (Illumina) for ADNI-GO/2. All genotype data was quality controlled, including checks for sex and identity, and the quality-controlled data were released on the ADNI LONI website (http://adni.loni.usc.edu) in Final Report and/or PLINK data formats. Of note, samples from the 818 individuals in the whole genome sequencing (WGS) sub-study described below (Section 2.2.5) underwent genome-wide genotyping using the Illumina Omni 2.5M BeadChip performed by Illumina. Thus, multiple genome-wide genotype array datasets are available for some ADNI participants. After QC, the Omni 2.5M genotype data in PLINK format were posted on the LONI website (http://adni.loni.usc.edu) in February 2014.
Whole exome sequencing (WES) of 18 ADNI-1 participants with a diagnosis of MCI at baseline was performed on blood-derived genomic DNA samples at TGen (www.tgen.org/). Participants were selected for a small extreme-trait design. Specifically, nine age- and education-matched pairs of non-Hispanic Caucasian, male, right-handed participants with amnestic MCI were selected on the basis of rapid versus slow hippocampal volume change on MRI over 2 years (nine participants with rapid rates of atrophy and nine with slow rates of atrophy) [16, 17]. All participants had APOE ε3/ε3 genotype, as one goal of this small study was to identify coding variants other than the APOE ε4 allele that are associated with rate of hippocampal volume loss. Sequences were enriched through hybridization using the Agilent’s SureSelect Human All Exon 50Mb kit following the manufacture’s protocol (www.genomics.agilent.com/). The Agilent kit captured an exome that was approximately 50Mb in size, covering ~21,000 genes. These samples could then be sequenced together on one lane of the flowcell and segregated later for analysis using their molecular bar-codes as tags. Samples were sequenced across multiple flowcell lanes to account for any possible lane effects. These libraries were sequenced on the Illumina HiSeq2000 using paired-end read chemistry and read lengths of 105bp. Each sample was sequenced two times and the resulting fastq files were posted to the ADNI LONI website (http://adni.loni.usc.edu) after QC.
Whole genome sequencing (WGS) was performed on blood-derived genomic DNA samples from 818 ADNI-1/GO/2 participants with support from the Brin-Wojcicki Foundation and the Alzheimer’s Association. This research sequencing was performed in a non-CLIA (see Section 4.4) laboratory at Illumina. 810 of the ADNI WGS datasets were released in August 2013. Samples were sequenced on the Illumina HiSeq2000 using paired-end read chemistry and read-length of 100bp at 30–40X coverage. The resulting BAM and VCF files generated by Illumina using the CASAVA software were archived at three physical locations, including LONI at the University of Southern California, Indiana University, and the Broad Institute. QC steps included participant sex check, participant identity check, and variant quality check of the Illumina-generated VCF files. The participant sex check was conducted by comparing sex estimated using Illumina Omni2.5M GWAS data and self-reported sex information from the ADNI database. Participant identity was cross-checked by determining: (1) concordance of a subset of fingerprint SNPs between prior genotype data and Omni2.5M genotype data; and, (2) concordance of all quality-controlled SNPs between Omni2.5M genotype data and WGS VCF data. The quality of variants passing sequencing QC in the WGS VCF files was assessed by comparing with Omni2.5M genotype data. Since Illumina’s CASAVA software calls variants on a single sample only, the Broad Institute converted the BAM files into original fastq files, a text-based format for storing both sequence reads and their corresponding quality information in Phred format, and re-aligned and re-called the variants using Broad GATK. Briefly, short-read sequences were mapped to the reference human genome using BWA-mem and potential PCR duplicates were removed. After completing initial alignment, the alignment was further refined by locally realigning any suspicious reads. The reported base calling quality scores obtained from the sequencer were recalibrated to account for covariates of base errors such as sequencing technology and machine cycle. Finally, the realigned reads were written to a BAM file for further analysis. The analysis-ready BAM files were assessed to identify all variants with statistical evidence for an alternate allele present among samples using GATK HaplotypeCaller for multi-sample variant callings. Variants that passed recommended variation quality criteria were assessed by comparing with the Illumina Omni 2.5M genotyping data to estimate the concordance rate for each individual. Finally, the resulting VCF files were posted to the ADNI LONI website (http://adni.loni.usc.edu).
Gene expression or RNA profiling is a new type of data for ADNI that has long been planned but was not previously supported by existing funding. Fortunately RNA profiling on selected baseline blood samples was contributed by Bristol-Myers Squibb (BMS) and performed at the BMS laboratories for 811 ADNI participants included in the WGS sub-study. The Affymetrix Human Genome U219 Array (Affymetrix, Santa Clara, CA) was used for expression profiling. Peripheral blood samples were collected using PAXgene tubes for RNA analysis. Total RNA was extracted using the PAXgene Blood RNA Kit, following the protocol provided by the manufacturer. The quantity and quality of extracted RNA were assessed using the NanoDrop and the Agilent Bioanalyzer, respectively. Blood RNA samples from 64 participants did not pass QC and were excluded. Samples were randomized to plates, with checks to ensure sex and diagnosis balance, and hybridized to Affymetrix Human Genome U219 array plate. Array hybridization, washing, staining, and scanning were carried out in an Affymetrix GeneTitan system. The quality of gene expression data, including sample quality and hybridization and overall signal quality, was analyzed using Affymetrix Expression Console software and Partek Genomic Suite 6.6, according to standard QC criteria provided by each software package. Raw expression values obtained directly from CEL files were pre-processed using the Robust Multi-chip Average (RMA) normalization method . The Affymetrix HG U219 Array contains 530,467 probes for 49,293 transcripts. All Affymetrix U219 probe sets were mapped and annotated with reference to the human genome (hg19). The Genetics Core performed several additional QC steps using the RMA normalized expression array data. First, the sex of samples was checked using sex-specific gene expression data, including XIST and USP9Y . Second, sample identity was verified on the basis of expression profiling to Omni2.5M genotype match using a Bayesian method to predict individual SNP genotypes from gene expression data . Briefly, the 1,000 most significant SNP-transcript cis-eQTL pairs from quality-controlled gene expression and genotype data were used to estimate the posterior probability for a match between gene expression and genotype data. The quality-controlled gene expression profiles were completed and released to the ADNI LONI website (http://adni.loni.usc.edu) in late April 2015.
Another new and important data type for ADNI expected to become available in mid-2015 is metabolomics profiling. As a brief background, the metabolome is an organism’s repertoire of metabolites or small molecules present in cells, tissues, body fluids, and organ systems. Metabolomics is the comprehensive study of metabolites and metabolism or biochemical processes at the global or “-omics” level. In contrast to the RNA profiling discussed above that reflects transcription processing and gene products, metabolic profiling assays the end products of cellular processes and can provide a more complete picture of cellular physiology and pathophysiology. This is a relatively new and rapidly expanding field that is recognized as having the potential to significantly impact medical practice [21–24].
Over the past few years, the ADNI Biomarker and Genetics Cores have collaborated with Dr. Rima Kaddurah-Daouk (Duke University) and colleagues to initiate metabolomics profiling of ADNI samples. Several initial studies were conducted using samples from Duke and the University of Pennsylvania [25–27]. Currently, ADNI-1 plasma samples are being profiled, as are additional samples from the Indiana University Memory and Aging Study (IMAS) . Profiling is expected to include broad biochemical coverage with over 500 metabolites measured in each participant to permit analysis of metabolic changes within multiple pathways and networks. A biochemical database will be created and released on the ADNI LONI website (http://adni.loni.usc.edu). We expect that baseline data from ADNI-1 will be profiled and released by end of 2015.
Since the initial GWAS genotyping of ADNI-1 samples by TGen, there were 31 sample requests fulfilled by NCRAD. As of 3/24/2015, 8333 ADNI DNA samples were used for APOE and GWAS genotyping, TOMM40 Poly-T genotyping, WES, WGS, replication of genetic findings from other cohorts, and other studies. 811 RNA PaxGene tubes were used for the RNA expression profiling by BMS. ADNI genetic data dissemination is managed and tracked by the Informatics Core based at LONI. From 2009 through the end of 2014, ADNI GWAS and/or other genetic data were downloaded approximately 107,000 times. For WGS, 12 sets of the extensive (~150 TB) WGS raw data set (BAM files) have been disseminated by hard drive array, which were return shipped to requestors.
Here we briefly summarize the ADNI genetic studies published up to 1/1/2015, where ADNI APOE, GWAS, or sequencing data were used. We searched the PubMed database using the EndNote X7 online search tool with the following three criteria: (1) the “All Fields” contain “Alzheimer’s Disease Neuroimaging Initiative”; (2) the “All Fields” contain “APOE”, “apolipoprotein”, “gene”, “genetic”, “genetics”, “genome”, “genomic”, or “genomics”; and, (3) the “Year” field value is between 2008 and 2014. We integrated the search results with the ADNI publication database maintained by the ADNI Data and Publications Committee. We manually reviewed all the abstracts and identified 313 relevant ADNI genetic publications through this extensive search. These publications include 2 in 2008, 9 in 2009, 38 in 2010, 36 in 2011, 60 in 2012, 69 in 2013, and 99 in 2014; see Supplementary Material (ADNI_Genetics_papers_2008-2014.enlx). In these publications, ADNI genetic data were studied together with one or more of the ADNI multi-dimensional phenotypic data sets, including clinical information, structural MRI (sMRI), advanced MRI such as diffusion tensor imaging (DTI), resting state fMRI, or arterial spin labeled (ASL) perfusion MRI, [18F]Florbetapir (AV-45) or [11C]Pittsburgh Compound B (PiB) PET, [18F]Fluorodeoxyglucose (FDG) PET, CSF or plasma biomarkers, and/or cognitive measures. Figure 1 shows the distribution of ADNI genetic publications grouped by assessed phenotype(s) from 2008 to 2014. Note that papers analyzing more than one phenotype were counted multiple times. Figure 2a shows the word cloud of journal names where these papers were published, where the color and size of a journal name correspond to the number of papers published in that journal. Figures 2b and 2c show the word cloud of gene names appearing in the paper abstracts, where the color and size of a gene name correspond to the number of abstracts mentioning the gene. Note that Figures 2b and 2c represent the results with and without APOE included, respectively.
Since the first ADNI GWAS publication by Potkin et al. , numerous genetic analyses have used the ADNI data alone or in combination with other cohorts. The papers published between 2009 and 2012 were summarized in detail in . Publications including ADNI genetic data continued to increase rapidly in 2013 and 2014. As shown in Figure 1, different types of qualitative and quantitative phenotypes have been studied with genetic data in ADNI. Below, we summarize and selectively review recent findings grouped by phenotypic category.
One of the most prominent case control studies in LOAD was conducted by Lambert et al. , who performed a meta-analysis of 74,046 individuals, including the ADNI cohort, and identified 11 new susceptibility loci for LOAD. Boada et al.  performed another multi-cohort study, including ADNI, and identified a novel LOAD association at the adenosine triphosphate (ATP) synthase, H+ transporting, mitochondrial F0 (ATP5H)/Potassium channel tetramerization domain-containing protein 2 (KCTD2) locus. The ATP5H/KCTD2 locus has been related to mitochondrial energy production and neuronal hyperpolarization during cellular stress conditions. Leduc et al.  performed a three-cohort study, including ADNI, and identified rs3846662 in the 3-hydroxy-3-methylglutaryl-CoA reductase (HMGCR) gene as a potent genetic modifier for AD risk, age of onset, and MCI to AD conversion. This finding is noteworthy since the use of statins (HMGCR inhibitors) during mid-life has been demonstrated to reduce the risk of developing sporadic AD by as much as 70%, although no AD-related benefits of statins have been shown in clinical AD patients. Biffi et al.  performed a targeted genetic association study in ADNI and identified variations in the oxidative phosphorylation gene-set associated with AD status and conversion from CN to MCI. Kim et al.  performed a gene-based analysis of relatively rare alleles (minor allele frequency≤0.03) in ADNI, coupled with several other AD cohorts from dbGaP, and identified an association between ZNF628 (zinc finger protein 628; a transcription factor coding gene) and AD. Finally, Desikan et al. reported on the genetic overlap between AD and Parkinson’s disease (PD) based on large GWAS data sets including ADNI (total n=89,904) and identified a marker in the extended MAPT (microtubule-associated protein tau) region (A allele of rs393152) associated with both AD and PD . Comparisons of common and divergent pathways in neurodegenerative diseases including AD, PD, amyotrophic lateral sclerosis, and other conditions appears to be a promising strategy as discussed in .
Data extracted from structural MRI scans have been the most widely analyzed phenotypes in ADNI genetic studies as was described in Shen et al. . As an example, Nho et al. [16, 17], using an extreme phenotype design discussed above in Section 2.2.4, identified genetic variations within PARP1 (poly (ADP-ribose) polymerase 1) and CARD10 (caspase recruitment domain family, member 10) genes associated with a more rapid rate of hippocampal volume loss in MCI. Recently, the same extreme phenotype imaging genetics design was used to identify REST (RE1-silencing transcription factor) as a protective variant [35, 36], which was consistent with an elegant independent series of analyses in post-mortem tissue and cellular models reported by Lu and colleagues in Nature . Hohman et al. [38, 39] performed two ADNI studies to examine genetic modification of the relationship between other AD biomarkers and MRI-based neurodegeneration. In one study , they identified rs4728029 from the POT1 (protection of telomeres 1) gene, which modifies the relationship between phosphorylated tau and ventricular dilation. In the other study , they identified several SNPs that modify the relationships between amyloid-beta (Aβ) or tau positivity and neurodegeneration. Roussotte et al. performed a series of ADNI MRI-based genetic studies [40–43] and identified a common variant in the DAT1 (dopamine transporter; also known as SLC6A3, solute carrier family 6 (neurotransmitter transporter), member 3) gene associated with faster ventricular expansion , combined effects of LOAD risk variants in the CLU (clusterin) and APOE genes on ventricular expansion , a common variant in the OPRD1 (opioid receptor, delta 1) gene associated with smaller regional brain volumes , and a variant in the RASGRF2 (Ras protein-specific guanine nucleotide-releasing factor 2) gene associated with larger cortical volume but faster longitudinal ventricular expansion . Luis et al.  performed an MRI-based genetic study on the TREM2 (triggering receptor expressed on myeloid cells 2) AD risk variant (rs75932628), which included ADNI and a Spanish cohort, and identified an association between the TREM2 variant and frontobasal grey matter loss. Rajagopalan et al. also found that carriers of the risk-conferring variant in TREM2 had faster atrophy in the temporal lobes compared to non-carriers . Koran et al.  evaluated the effect of genetic interactions within inositol-related pathways on longitudinal MRI measures of the inferior lateral ventricles and identified several genetic interactions associated with longitudinal changes in ventricle size. Andrawis et al.  demonstrated that APOE had a significant effect on longitudinal change in the hippocampus, with greater change in MCI and AD participants who were APOE ε4 positive. Finally, Apostolova et al.  demonstrated that APOE genotype alters the timing and sensitivity of hippocampal volume and CSF biomarkers for predicting MCI to AD conversion, with better prediction using hippocampal volume in APOE ε4 positive MCI participants and better prediction using p-tau in APOE ε4 negative participants.
ADNI-GO/2 introduced several advanced MRI modalities, including DTI, resting-state fMRI, and ASL perfusion MRI. In each case these advanced techniques were limited to sites employing a particular scanner vendor effectively reducing the sample to about one third of the cohort for each scan type and limiting the power for genetic analyses (see Jack et al. MRI Core report in this Special Issue). Given the modest sample size of these data sets, only a few genetic studies have been performed to date, using only the ADNI DTI data. Notably, however, combining data from two cohorts, Jahanshad et al.  performed the first connectome-wide and genome-wide association study and identified an association between a variant in SPON1 (spondin 1; rs2618516) and brain anatomical fiber connectivity. Warstadt et al.  fitted a multi-locus genetic model within white matter areas associated with serum cholesterol and identified that CETP (cholesteryl ester transfer protein, plasma; increased rs5882 G-allele dosage) was associated with higher fractional anisotropy and lower radial and mean diffusivity on DTI. Several publications also proposed new methods to boost the power to detect genetic associations with diffusion imaging measures in ADNI, by using bivariate genetic analysis of MRI and DTI and an approach called “seemingly unrelated regression” which integrates genetic information from different imaging biomarkers [51–53].
An early targeted study using a modest sized ADNI-1 [11C]PiB PET sample examined the amyloid pathway . The number of genetic studies with amyloid PET phenotypes grew rapidly in 2013 and 2014. For example, Bradshaw et al.  examined the CD33 (CD33 molecule) LOAD susceptibility locus (rs3865444) with the ADNI-1 [11C]PiB PET data and identified an association with increased [11C]PiB binding. Ramanan et al.  performed the first GWAS with amyloid PET as the phenotype using the ADNI-GO/2 [18F]Florbetapir scan data and reported that APOE and BCHE (butyrylcholinesterase) were independent modulators of cerebral amyloid deposition, together accounting for nearly 15% of the variance in cross-sectional amyloid burden. Thambisetty et al.  examined the CR1 (complement component (3b/4b) receptor 1) LOAD susceptibility locus (rs3818361) using ADNI-1 [11C]PiB PET data, combined with [11C]PiB PET data from sub-studies of the Baltimore Longitudinal Study of Aging (BLSA), and reported that risk allele carriers had lower brain amyloid burden compared to non-carriers. Swaminathan et al.  examined the association between plasma Aβ from peripheral blood and cortical amyloid deposition on [11C]PiB PET and found that this relationship was modulated by APOE ε4 genotype. Murphy et al.  mapped the effects of APOE ε4 on ADNI [18F]Florbetapir scans and found a significant effect in the four cortical regions examined. Risacher et al.  investigated the role of APOE genotype in EMCI and found that the APOE ε4 allele was associated with increased amyloid accumulation on [18F]Florbetapir PET, lower CSF Aβ1-42, and increased CSF tau levels in EMCI and CN. Recently, Risacher et al.  also examined the role of APOE in an expanded sample that included the new SMC group introduced in ADNI-2. SMC participants who were carriers of at least one APOE ε4 allele showed greater amyloid deposition on [18F]Florbetapir PET, lower CSF Aβ1-42, and increased CSF tau levels compared to those who were APOE ε4 non-carriers. Interestingly, an effect of APOE ε4 was not observed for hypometabolism or medial temporal lobe atrophy in the SMC group consistent with the concept that those markers may be downstream of the amyloid and tau proteinopathies detected by PET and in CSF . Finally, several amyloid PET studies examined various facets of epistasis or gene-gene interactions. Yan et al.  performed a transcriptome-guided amyloid imaging genetic analysis and identified bi-multivariate associations between APOE SNPs and brain-wide amyloid imaging measures. Hohman et al.  examined epistatic relationships between genes involved in amyloid and tau pathophysiology using [18F]Florbetapir PET imaging as the target phenotype. This study identified three significant interactions between a SNP in GSK3β (glycogen synthase kinase 3 beta; rs334543) and SNPs in each of the following genes: APBB2 (amyloid beta (A4) precursor protein-binding, family B, member 2; rs2585590, rs3098914) and APP (amyloid beta (A4) precursor protein; rs457581).
Relatively few recent reports specifically examined genetic associations with [18F]FDG PET as an isolated phenotype. However, several studies performed integrative analyses of genetics, [18F]FDG PET, and other imaging and biomarker data for predicting outcomes of interest. For example, Schraml et al.  proposed a hypometabolic convergence index (HCI) to provide a summarized measure of hypometabolism on [18F]FDG PET and identified a significant association between HCI and APOE ε4 dose using ADNI data. [18F]FDG PET measures have also been analyzed with genetic data such as APOE ε4 status, along with other multidimensional biomarkers, to classify AD status , predict cognitive decline , or predict MCI to AD conversion .
Fluid biomarkers are another frequently analyzed category of quantitative phenotypes used in ADNI genetic studies. Kim et al.  examined the influence of genetic variation on plasma protein levels using the multi-analyte Rules Based Medicine (RBM) panel and identified multiple novel associations. For example, the CFHR1 (complement factor H-related 1; rs7517126) and CFH (complement factor H; rs6677604) genes were discovered to be very highly associated with plasma complement factor H-related protein 1 (CFHR1) level. This study also confirmed previously identified gene-protein associations for the interleukin-6 receptor (IL6R), chemokine C-C motif (CC) ligand 4, angiotensin-converting enzyme (ACE), and angiotensinogen. Cruchaga et al.  performed a GWAS of CSF tau/phosphorylated tau (pTau181) levels and identified three loci: one between GEMC1 (geminin coiled-coil domain containing; also known as GMNC) and OSTN (osteocrin), one within GLIS3 (GLIS family zinc finger 3), and one within the TREM gene cluster. Chouraki et al.  performed a genome-wide meta-analysis of plasma Aβ peptide levels and found a significant association between the CTXN3 (cortexin 3) gene and plasma Aβ1-42 levels. Kauwe et al.  performed a GWAS of CSF levels of 59 AD candidate proteins and identified genetic associations with CSF levels of 5 proteins, including ACE, chemokine CC ligand 2, chemokine CC ligand 4, IL6R, and matrix metalloproteinase-3 (MMP-3). As mentioned earlier, Hohman et al. performed two ADNI studies [38, 39] to examine genetic modification of relationship between CSF biomarkers and MRI-based neurodegeneration and identified rs4728029 from the POT1 gene that modifies the relationship between phosphorylated tau and ventricular dilation  and several SNPs that modify the relationships between amyloid or tau positivity and neurodegeneration . Ramirez et al.  performed a GWAS of CSF Aβ1-42 and pTau181 levels and found an association between Aβ1-42 level and rs62256378 in SUCLG2 (succinate-CoA ligase, GDP-forming, beta subunit), as well as an interaction between APOE ε4 genotype and this SNP. In the above-mentioned study by Risacher and colleagues , individuals with SMC who were APOE ε4 carriers showed lower Aβ1-42 and higher tau and pTau181 than SMC APOE ε4 non-carriers. A recent cross-sectional and longitudinal analysis  of the relationship between CSF and PET amyloid biomarkers by Toledo et al.  found that APOE modified the relationship between these AD biomarkers and suggested that they measure different aspects of Aβ pathology in AD. Finally, as mentioned above, Apostolova et al.  found that p-tau was the best predictor for MCI to AD conversion in APOE ε4 negative participants, while hippocampal volume was the best predictor in APOE ε4 positive MCI participants.
Studies including ADNI data have established that the effect of APOE on cognitive performance is more complex than a simple extension of its association with AD risk [60, 73–76]. For example, one study demonstrated an interactive effect of APOE and amyloid pathology on cognitive decline, characterized by APOE ε4 carriers with high amyloid burden exhibiting greater cognitive decline . This finding was consistent with an earlier analysis in the Mayo Clinic Study of Aging that suggested that APOE isoforms modify the association between amyloid load and cognition .
Other genetic analyses of cognition have expanded the scope beyond APOE to study other known AD risk loci [74, 78], as well as to discover novel associations. Sherva et al. reported that the SPON1 gene was associated with longitudinal global cognitive decline . This SPON1 finding is notable in view of the above-mentioned report of an association of SPON1 with cerebral connectivity in both ADNI and a young twin cohort . Ramanan et al. identified an association between the pro-apoptotic gene FASTKD2 (fas-associated serine/threonine kinase domains 2) and cross-sectional episodic memory performance [80, 81]. The Ramanan et al. [80, 81] memory GWAS is noteworthy for its large sample consisting of data from the United States Health and Retirement Study (HRS), ADNI, AddNeuroMed, the Religious Order Study (ROS), the Rush Memory and Aging Project (MAP), and the Indiana Memory and Aging Study (IMAS) yielding a total discovery and replication sample of 14,781 individuals. The novel SNP discovered in this study exhibited a protective effect on memory performance and hippocampal structure (volume and grey matter density on structural MRI) and was associated with decreased CSF levels of apoptotic mediators, consistent with a neuroprotective effect of this gene variant. In related work, the ADNI dataset contributed to the large scale genome-wide screens of brain measures conducted by the ENIGMA Consortium . In one analysis of GWAS data and MRI measures from over 50 cohorts worldwide, the ENIGMA consortium, in partnership with ADNI, discovered 8 common genetic variants associated with the volume of subcortical brain structures, including the hippocampus; several of the implicated genes are associated in aggregate with disease risk and have known roles in cell guidance, apoptosis, and other physiological processes.
Other studies have related genes to cognition following the discovery of a primary association with a non-cognitive phenotype, including ASTND2 (astrotactin 2) and GRIN2B (glutamate receptor, ionotropic, N-methyl D-aspartate 2B) with hippocampal and temporal lobe atrophy on MRI [83, 84], PSEN1 (presenilin 1) with CSF amyloid levels , and DAT1 with AD case-control status . Alternatively, pathway analysis has been used to elucidate collective effects of multiple genes on memory , executive functioning , and depressive symptoms . Notably, pathway analysis of memory in the ADNI-1 sample (N=742)  displayed substantial overlap of underlying mechanisms (particularly of pathways related to cholinergic neurotransmission, cell adhesion, and inflammation) with a subsequent pathway analysis of memory in the population-based HRS sample (N=6,705) , highlighting the potential utility of pathway-based approaches [34, 89] to yield replicable associations from otherwise diverse studies.
Overall, these findings provide a foundation for meeting the broader ADNI aim of enhancing clinical trial design and interpretation. Cognitive phenotypes are thought to have substantial heritability (up to 80%) based on twin studies, but a substantial portion of this heritability is still uncharacterized, and known genes appear to exhibit modest individual effect sizes [90–92]. Although APOE has shown the most consistent relationship with cognition in AD and non-AD settings, its effects depend on interactions with other factors such as age [73, 93, 94] and biomarker status [60, 75], so accounting for these interactions might facilitate risk enrichment and stratification in clinical trials . The use of genetic profiling for risk stratification to enhance clinical trial design is discussed in greater detail in Section 4.2 below. Other heavily-investigated genes, such as BDNF (brain-derived neurotrophic factor) [96–98] and KIBRA (WWC1; WW and C2 domain containing 1) [99–102], have displayed inconsistent relationships with cognitive outcomes, potentially due to the complexity of the cognitive “phenome” where individual cognitive domains may have distinct genetic and molecular architectures. Further, cognitive tasks vary in their domain specificity and domain overlap [90, 91]. Integrative analytic methods, as discussed below in Section 4.1, may be useful for untangling this complex phenotypic architecture . In addition, low-frequency, rare, and copy number variants may now be readily analyzed using the recently available ADNI WGS data allowing assessment of contributions of these features in cognitive performance.
Besides examining genetic effects of different ADNI phenotypes, some studies have integrated ADNI genetic data with multidimensional phenotypic data sets to jointly predict outcomes of interest. For example, Trzepacz et al.  compared the power of three neuroimaging modalities (sMRI, [11C]PiB PET, and [18F]FDG PET), coupled with APOE genotype, for predicting MCI conversion to AD. Zhang et al.  compared several feature selection and machine learning algorithms, where ADNI multi-dimensional data sets, including genetic data along with sMRI, [18F]FDG PET, and CSF biomarkers, were used to classify CN, MCI, and AD participants. As mentioned above, multimodal ADNI data sets including APOE ε4 status, CSF biomarkers, MRI, and [18F]FDG PET have been used as predictors to classify AD status , predict cognitive decline , or predict MCI to AD conversion .
Overall, the many reports evaluating ADNI genetic data using multidimensional phenotypes from rich ADNI data sets confirmed key findings in the genetics of AD and also identified novel candidate genes that deserve further investigation and replication in independent cohorts. Below we discuss several key themes related to the significance and implications of genetic studies using ADNI data as well as plans for the ADNI Genetics Core going forward.
LOAD is thought to have a complex etiology, with multiple genetic and environmental factors presumed to influence susceptibility . This complex model of susceptibility can pose conceptual challenges to the “one-gene/one-enzyme/one-function” paradigm that has been used to characterize disorders with more simple genetic architectures [107, 108]. In response to such challenges, systems biology approaches have been developed that attempt to model complex and interactive multi-level biological systems using a range of advanced computational and experimental tools. These approaches are based on the hypothesis that complex genetic architectures can only be appropriately characterized through analytical strategies designed to capture this complexity [109–111]. In practice, these approaches rely on two fundamental premises, namely that (1) genes and their variants are not isolated inciting entities, but rather exist within broader functional pathways and networks , and that (2) diseases and their endophenotypes are not isolated endpoints, but rather are collections of perturbations at multiple biological levels including the genome, epigenome, transcriptome, proteome, and metabolome [103, 112]. Figure 3 illustrates the landscape of multiple “-omics” domains relevant to AD. It is striking that ADNI has collected data spanning a broad range of these domains (indicated by asterisks in the figure).
One systems biology approach that has been applied to AD starts with an initial discovery of a gene in one “-omics” realm and progresses to test the effect of this gene in complementary biological systems. This strategy can be particularly effective in characterizing the downstream consequences of a genetic variant to fill in knowledge gaps native to most genotype-phenotype associations. There are emerging proof-of-concept examples of this approach that have utilized ADNI data and it seems likely that this approach could be productively extended to other AD-related genes. One integrative functional genomics report followed up on the immune system gene CD33, which was previously implicated in AD through GWAS [113, 114], by demonstrating SNP-associated changes in CD33 expression (protein), phagocytic performance of CD33-expressing monocytes (cellular function), and amyloid accumulation measured by PET imaging (brain pathology) . It is notable that CD33 is expressed selectively in microglia in AD brain tissue , is a modulator of microglial activation, and also constitutes a druggable target that has been investigated in acute myeloid leukemia and very recently in AD . Another example, discussed in Section 3.3.7, used GWAS to discover a novel association of the pro-apoptotic gene FASTKD2 with better episodic memory performance and subsequently related the memory-associated SNP to lower FASTKD2 mRNA expression (transcript), lower CSF levels of fas-mediated apoptotic factors (protein), and higher hippocampal volume and grey matter density (brain structure) . Like CD33, FASTKD2 also shows promise as a druggable target based on earlier work in oncology . Future extensions of this systems biology strategy might additionally leverage the ADNI WGS data, as AD-associated genes are likely to contain rare variants with functional implications in addition to the common variants detected by GWAS .
Alternatively, systems biology approaches can start with larger gene sets (pathways or networks) identified through “-omics” analysis and subsequently use the known biological relationships within those groups to converge on one or more gene targets. Two recent examples of this approach, including one utilizing ADNI data , identified novel regulatory gene targets through network analysis of post-mortem brain tissue transcriptome data. In one report, a network enriched with immune system and microglia-related genes and upregulated in post-mortem AD brain tissue was found to contain TYROBP (TYRO tyrosine kinase binding protein) as a common regulator of many of the genes in the network . Another report identified networks with perturbed gene expression in AD and APOE ε4 carriers and discovered RNF219 (ring finger protein 219) as a key mediator of these transcriptomic changes . Variants in RNF219 were additionally found to be related to increased amyloid deposition by in vivo PET imaging . Overall, the network strategy is particularly attractive for its ability to highlight broad ensembles of genes serving as effectors that drive key disease mechanisms and thus, may hold strong potential for identification of promising therapeutic targets .
With rich phenotype and genotype data now available for participants spanning the continuum from normal aging to clinical AD, the next phase of ADNI described below (Section 4.4) will provide an ideal setting for systems biology approaches to elucidate missing pieces in the understanding of AD pathophysiology and to potentially expand the scope of AD diagnostic and therapeutic strategies. From a risk prediction standpoint, networks exhibiting coexistent genetic variation and biological perturbation would represent prime targets in the development of personalized, burden-based genetic susceptibility tests . For therapeutic development, pathways and networks displaying multi-omics relationships with AD would reduce the search space for rational drug design and may highlight “hub” genes for therapeutic cocktail approaches, such as in the polypharmacy strategies successfully employed for AIDS and various cancers [34, 112].
The prospect of a personalized or precision medicine for AD, and for its incorporation in therapeutic trial design, is predicated on the ability to use an individual’s genetic profile to refine predisposition to disease, characteristics such as likely rate of progression, and predicted therapeutic and side effect responses to various therapeutic strategies. At present, the utility of genetic data within regards to AD clinical drug development has been largely limited to APOE. Unfortunately, the past decade of clinical drug development programs for AD-modifying therapies has been burdened with failures. Although these failures may be a result of lack of drug efficacy, the inability to measure drug response in homogenous patient populations has contributed an unquestionable source of variability in these programs. APOE’s strong and well-replicated associations with multiple clinical and functional disease endpoints and quantitative traits such as brain volume, age at disease onset, amyloid deposition, and disease risk provide sufficient effect size and predictive value to serve utility in clinical development programs [5, 56, 122, 123]. APOE is commonly used in clinical trial analyses as a stratifying factor or covariate to adjust for heterogeneity, although independently it has failed to show consistent therapeutic effect on drug response within this context [124, 125]. However, APOE did predict dose related side effects (vasogenic edema and stroke). APOE is also being used to select patients for two large drug trials targeting at-risk but cognitively normal individuals, the Alzheimer’s Prevention Initiative (API) APOE and TOMMORROW trials, where this genotype provides an inexpensive and minimally invasive biomarker to identify individuals with high risk of AD. Both APOE and the adjacent TOMM40 gene are being used for patient selection in the TOMMORROW trial. The small effect sizes of the other more than 20 confirmed markers from large GWAS studies has to date limited their utility for trial design and stratification. Polygenic scoring and related risk aggregation strategies may prove useful in addressing the cumulative contributions of multiple genes and pathways [126, 127]. Similarly, analysis of gene-gene and gene-environment interactions, while increasing complexity, is likely to provide important insights relevant for trial design and analysis. As additional information becomes available from multiple sources, the combinatorial roles of genes in targeted pathways are expected to become increasing important for trial design.
Sources of human variability (genotypic and phenotypic) continue to be characterized in extraordinary detail, scale, and speed, presenting unprecedented opportunities for discovery and development of novel and targeted therapeutic hypotheses. This requires robust insights into relevant human biology. Figure 4 provides an overview of the path from genetic signal detection to targeted therapeutics, including methods and tolls, as well as implications for trial design, as discussed above. The value of genetic and related data in medicine development has been exemplified by a new paradigm for novel and targeted drug discovery in oncology [128, 129]. Three critical tipping points appear to have now been reached for the key principles of this paradigm to be extended to drug discovery and development in neurodegeneration: (1) unprecedented technologies to characterize genotypic and phenotypic causes and consequences of human disease variability are; (2) dense genotype-phenotype data being generated by large-scale public and private sector investments; and, (3) translational systems available to derive novel insights into disease biology and pharmacology from genetic signals. Considerable effort is being invested to leverage and translate human genetic and related data into opportunities for target discovery and validation, understanding disease biology, and patient stratification and enrichment.
Retrospective analyses of drug discovery pipelines indicate that novel therapeutic hypotheses supported by robust human genetic data have a greater likelihood of successfully delivering new, impactful medicines . Strategically, premising drug discovery pipelines upon a foundation of robust human data provides an opportunity for smaller, focused research portfolios of higher confidence targets (contrasting with historical “shots on goal” strategies of pursuing a large number of lower confidence targets with a high rate of attrition ). Moreover, advances in approaches to integrate genetic, intermediate trait, and clinical outcome data provide opportunities to distinguish causal relationships from correlative associations based upon the principles of Mendelian Randomization . Within the drug discovery industry, two applications of such integrated data are to identify opportunities for target discovery (“forward genetics”: where a new therapeutic hypothesis is developed subsequent to identification of robust genetic signal) and target qualification (“reverse genetics”: where human genetic data are used to “validate” a therapeutic hypothesis based on non-genetic data, such as pre-clinical animal models).
The growing list of genes identified as being robustly associated with AD has recently identified several potential novel therapeutic targets and target pathways. ADNI data has contributed significantly to such findings. For example, recent genetic association findings highlight autoimmune and inflammation related mechanisms as being potentially causal in AD . Autoimmune mechanisms may provide opportunities for novel therapeutic targets for the treatment or prevention of AD, especially as such mechanisms may provide additional therapeutic value beyond established target pathways such as Aβ and tau . Specific examples of immune-related genes associated with AD risk and/or related intermediate traits include TREM2 and CD33 [55, 135]. The dense genetic, multi-modal phenotypic, and clinical data (including in a longitudinal setting) in ADNI have contributed to better understanding of associations of these genes with AD, including potential pathways and mechanisms involved . As the number of genes and pathways identified as being associated with neurodegenerative diseases grows , ADNI provides an important resource to better understand such associations and help contribute to the translation of such associations into therapeutic hypotheses. This contribution may be enhanced given the potential to incorporate functional genomic and mechanistic data in the future.
Genetic studies are instrumental in elucidating the pathways involved in the etiology of AD and in discovering and validating drug targets. A parallel approach can also be used to identify disease biomarkers. Identification of the TREM2 rare variant and its association with AD risk has sparked renewed interest in the field in understanding the role of inflammation . As this mutation is relatively rare, it likely has little clinical utility as a biomarker of disease heterogeneity. However, in part because of this and other immune related genetic findings, there is renewed appreciation of the need for finding blood biomarkers or imaging methods that track with neuroinflammation. Pathway analysis approaches can be used to identify such biomarker targets, as was demonstrated by Zhang et al . As our understanding of disease complexity in AD matures, genomic science can help to elucidate underlying biology, bringing together multiple levels of “-omics” data to the level of individual variability to uncover novel signals and biomarkers that may segregate with disease subtypes.
Cancer research has benefited from renewable sources of cell lines derived from tumors carrying alterations of major oncogenic pathways to enable drug screening and characterization. By contrast, drug discovery in neurodegenerative diseases such as AD have mostly relied on animal models. Advances in iPSC technology have opened new opportunities for creation of novel human-derived in vitro models to help bridge the gap between animal models and clinical development.
iPSC cells can be generated from various adult cell types such as skin fibroblasts, hair follicle keratinocyte, or liver or blood cells. Similar to embryonic stem cells, iPSCs can self-renew and be differentiated into all three germ layers to allow further investigations in relevant tissue/cell types . The first iPSCs for AD were generated from fibroblasts of familial AD patients carrying mutations in PSEN1 or PSEN2 . Neurons differentiated from these iPSCs have elevated Aβ1-42 secretion, which can be modulated by gamma-secretase inhibitors. A later study by Israel et al.  created iPSCs using fibroblasts from familial AD, sporadic AD, and non-demented controls. Similar to the earlier report, differentiated neurons from iPSCs from familial and sporadic AD showed higher levels of Aβ and phosphorylated tau compared to those from controls. These studies indicated that it is possible to generate iPSC-derived neurons from elderly patients that capture basic biochemical changes associated with disease states yielding potential in vitro systems for compound screening. Promising recent technical developments also suggest the feasibility of producing iPSCs with appropriate disease phenotypes from LCLs . This could be very important in maximizing the value of ADNI data as LCLs are available for almost every participant from all study phases (ADNI-1/GO/2).
The vast majority of AD cases are sporadic with contributions from multiple genetic and environmental factors. Even though individuals with AD share similar pathological changes, the patient population is highly heterogeneous. The preliminary comparison reported by Israel et al. showed that only one of the two sporadic AD iPSC differentiated neurons shared similar phenotypes with those from familial AD participants, indicating that sporadic AD patients may have different degrees of similarity to familial AD phenotypes. A larger panel of iPSC lines derived from sporadic AD patients may provide further information related to disease heterogeneity and help define potential disease subtypes within sporadic AD. Recent work from the University of California, San Diego group suggests the potential of iPSC studies to identify molecular phenotypic signatures associated with established AD candidate genes such as SORL1 (sortilin-related receptor, L(DLR class) A repeats containing) .
ADNI has stimulated international efforts to characterize AD phenotypes across a number of technology platforms including imaging, biochemistry, and genetics. iPSC lines derived from these cohorts, including prodromal dementia participants and cognitively normal at-risk participants with rich characterizations, will enable linking in vitro molecular profiles (e.g., epigenetic changes) with other relevant disease phenotypes. In fact, the next phase of ADNI would be well-positioned to collect samples for iPSC development on a large series of older adults at risk for MCI and AD with highly detailed phenotypic characterization that could help to determine the relationship between clinical and pathophysiological phenotypes and cellular/molecular phenotypes. Recent developments in 3-dimensional (3D) culture systems may permit capture of additional disease features such as neurofibrillary tangles . This will provide better in vitro model systems to understand disease mechanisms that, combined with improved phenotypic characterization of disease heterogeneity, can be expected to advance AD therapeutics and move the field closer to the goal of precision medicine.
The topic of return of research results (RORR) to research participants and/or their physicians has been a source of considerable interest and controversy across the entire spectrum of the research enterprise [142–144], but particularly in the domains of imaging [145–147] and genetics [148–150]. In brief, as human studies systematically collect information on participants, participants may request the information and arguments may be made for participant benefit of receiving this information. Discussions of RORR sometimes differentiate information that is relevant to the disease in question and the aims of the study (sometimes called “incidental research findings”) versus information unrelated to the purpose of the study (sometimes referred to as “incidental findings”). The discussion also sometimes differentiates RORR that are aligned with clinical standards of care (for example, returning an unexpectedly elevated blood pressure or the presence of an unexpected mass on an imaging study) versus returning results that are not yet part of expected standards of clinical care where the implications of the results may not yet be well-understood (such as new assays or novel imaging modalities).
There have been active discussions about RORR within various ADNI committees and within the ADNI Genetics Core. Institutional review boards at various ADNI clinical sites have also raised questions with regard to how both imaging and genetic RORR issues should be handled. In ADNI, the issue has arisen in two principal areas: amyloid PET imaging and genomics.
A large subset of ADNI participants received amyloid PET scans as part of the ADNI research protocol. In some of these instances, either the participants themselves have requested their amyloid PET results, or their clinicians (who are sometimes also the ADNI investigators at that particular site) have requested the results of the amyloid PET scans in order to influence the clinical care of these participants – particularly among cognitively normal participants who are found to have positive amyloid PET scans. Arguments against disclosing these data were that they were ordered outside of the clinical work stream, often in situations (such as in unaffected individuals) that were outside the scope of current medical practice, and that they were not all necessarily read in a standardized way by interpreters who had been trained to render a clinical report. A further argument against disclosing these results was that a major component of the scientific value of ADNI is in the correlative analyses between cognitive testing, biomarker collection, structural and functional imaging, and longitudinal outcome. Indeed, it has been demonstrated that simply learning that one has a neurocognitive diagnosis , or that one is at increased genetic risk for AD dementia , can have a deleterious effect neurocognitive performance. If the results of amyloid PET scans performed under a research protocol were prematurely fed back into the clinical work stream, the choices of clinicians and the reactions of participants could be altered in ways that could diminish the scientific validity of the results.
Arguments for RORR to participants and their clinicians are predominantly focused upon the autonomy of participants and the rights of participants to learn information about themselves that could influence their health [142, 150]. There is also the universal appetite among research participants of all types for the return of health-related research information . In just the arena of dementia, a nationally representative survey of United States adults , and an entirely separate survey of adults in the United States and four European countries , showed that nearly 70% of those queried would take a test to learn more about their risk of AD, even if no preventative treatment was available. In a 2012 survey of 260 academic physicians specializing in dementia care and research, 135 (52%) responded, and of those, 24% reported that if amyloid scans were FDA approved and covered by insurance, they planned to use this technology for screening asymptomatic individuals . Since that time, published guidelines have recommended against amyloid scans in cognitively normal individuals [157, 158] and United States Centers for Medicare and Medicaid Services (CMS) issued a decision denying payment for clinical use of this technology [159, 160]. Of note, CMS approved coverage with evidence development (CED) for amyloid PET in which patients meeting appropriate use criteria can receive coverage if they are enrolled in an approved comparative effectiveness trial designed to clarify the benefit to clinical care. In our 2012 survey of ADNI investigators, asking respondents to assume FDA approval of [18F]Florbetapir, over half of the 159 respondents indicated that cognitively normal study participants had requested the results of their amyloid scans and 9 respondents (6%) reported that they had already disclosed amyloid imaging results to these participants . Within ADNI, there was a considerable range of opinion regarding the policy that should be adopted for RORR of amyloid PET scans, and in 2012 a policy was adopted that recommended against the RORR for this modality (http://www.adni-info.org/scientists/doc/10_Green_RORR.pdf). These issues are likely to be revisited periodically in this rapidly evolving area.
In addition to imaging studies, ADNI generates individual level data on a variety of serum, plasma and cerebrospinal fluid biomarkers (see ADNI Biomarker Core report in this Special Issue), as well as the genotype and sequencing results described earlier in this paper. In the areas of biomarkers and genetic variants related to AD, the arguments for and against disclosure are similar to those made about research imaging results, i.e. respect for participant autonomy and possible benefit to ongoing and future medical care versus the possibility that longitudinal measures of important outcomes could be biased by RORR or by actions taken in response to RORR. In the area of laboratory tests there is another consideration, which is that most of the biomarker and genetic data is generated in laboratories that are not CLIA-certified. CLIA is the Clinical Laboratory Improvement Amendments certification by CMS that regulates quality assurance in human clinical laboratory testing in the United States. The position of CMS, and of most Institutional Review Boards (IRBs), is quite unambiguously that no health-related laboratory results should be returned to participants unless those results are generated (or replicated) in a CLIA-certified laboratory [162, 163].
As large scale studies like ADNI collect and analyze genomic information, the societal trends toward return of research results and individual empowerment are encouraging extensive discussion about whether specific processes should be put in place to return incidental or secondary genomic findings of medical importance [150, 164]. Similar discussions have been occurring in the clinical realm as whole exome and whole genome sequencing are increasingly being utilized in clinical diagnosis [165, 166], sparked in part by the recommendations of the American College of Medical Genetics and Genomics (ACMG) that were published in 2013 and updated in 2015 [167, 168]. The ACMG recommended that an explicit panel of mutations in specific genes should be routinely assessed when sequencing is performed for clinical purposes. The ACMG recommendations recognized that this area is evolving and suggested a need for regular re-evaluations of the minimum list, as well as the importance of clear communication with the patient, but attempted to place the return of secondary findings in clinical sequencing squarely within existing traditions of clinical medicine . These discussions in the clinical arena have prompted extensive considerations by researchers and IRBs about the distinctions between the clinical and research domains [162, 163], but consensus recommendations appear to be evolving toward the position that genomic results of medical importance and medical actionability should be shared with research participants and after the death of participants, with their families, particularly if the consent forms are clearly written to include this contingency [150, 170].
Large-scale genotyping and sequencing in a dementia study raises a number of other issues. Many individuals and families are interested in learning their APOE genotype, as this is a risk marker for AD that has become widely recognized throughout society. APOE genotype is a robust marker for AD risk, but does not meet the criteria for medical importance and medical actionability that have been described in the above referenced discussions of RORR. Thus, while clinical trials have demonstrated that APOE genotype can be returned without undue distress [171–174], this is not typically included in discussions of possible genomic RORR.
Both imaging and genome-scale testing also raise issues of re-identifiability. With high resolution MRI scanning, it is theoretically possible to reconstruct faces to the point where they might be identifiable. There have also been well-publicized accounts in which de-identified genomes could be linked to specific surnames, and potentially to specific individuals, through linkage to publically available databases . In this regard, ADNI genomes are as vulnerable, or protected, as other genomes that are being made available to qualified researchers through various sharing mechanisms such as dbGaP. While re-identification through imaging or genomic information is therefore possible, it is strictly prohibited by ADNI policies and by IRBs, and any attempt to do so by investigators would be in clear violation of these policies.
In summary, the RORR to participants in ADNI has not been pursued to date because ADNI consent forms do not promise that incidental findings will be returned and in some cases are explicit in noting that they will not be returned. It remains to be seen if IRB policies toward return of incidental genomic information will request or require ADNI sites to revisit this question in the years ahead.
ADNI-2 is now about to enter its final year and plans are being finalized for the ADNI-3 proposal to continue and extend this research in important new directions. Tau PET imaging studies, computer-based cognitive assessments, and other new features will be added this year and hopefully carried forward into the next phase of ADNI. From a genetic perspective, these novel phenotypes will be important topics for future association studies. In addition to continued banking, dissemination, and assays on longitudinal genetic biosample data, major plans include analysis of existing baseline and follow-up visit data and similar collection and analysis of new participants to be recruited in the future. Most resources available to the Genetics Core are for sample collection and processing. However, analysis of existing data on the association of genetic variation and key clinical/cognitive, imaging, and biomarker phenotypes is a priority. Longitudinal blood RNA and epigenetic markings (including but not limited to methylation) in DNA will be important and potentially highly informative. As discussed above, a longitudinal convergent “multi-omics” collection and analysis strategy will continue, along with the challenge of finding resources for complete analysis and dissemination of all biosample results. Partnerships with the Alzheimer’s Association, other foundations and donors, and ADNI’s industry partners via the Foundation for the National Institutes of Health have made possible the early ADNI-1 GWAS, the recent WGS on over 800 samples, and the RNA expression profiling using blood samples from most of the same participants.
The ADNI Genetics Core hosts an extended Working Group that holds conference calls monthly or more often as needed. The Genetics Core also sponsors an ad hoc RNA Working Group to support collaborative expression profiling and transcriptome analysis. Further, the Genetics Core has recently established an ADNI Systems Biology Working Group to enhance the potential contributions of this specialized area for analysis of ADNI data and for collaboration on analyses that span multiple data types and cohorts. Thought leaders in systems biology and network medicine from academia, foundations, and industry are engaged in determining the key scientific questions, opportunities, and priorities that these areas of science can contribute to ADNI.
The ADNI Genetics Core has served as a liaison to the relevant consortia including the Alzheimer’s Disease Genetics Consortium (ADGC)/Alzheimer’s Disease Sequencing Project (ADSP), ROS/MAP, AddNeuroMed, World-Wide ADNI (WW-ADNI), Enhancing Neuro Imaging Genetics through Meta-Analysis (ENIGMA), Multi-Institutional Research in Alzheimer’s Genetic Epidemiology (MIRAGE), IMAS, and HRS investigators, as well as other cohort studies that could serve as replication samples or collaborate in other synergistic research activities. Various inquiries about ADNI genetic data within and outside of ADNI have been addressed to encourage the individual and collaborative investigation of ADNI genetic data. Current plans include direct comparisons between ADNI and the autosomal dominant form of AD seen in the Dominantly Inherited Alzheimer Network (DIAN) Consortium. One limitation of ADNI genetic analyses is the absence of large data sets for replication with the vast array of imaging and fluid biomarkers available in ADNI. To date, some studies have utilized local data sets not publically shared, as well as some partially publically available data sets. Additional open data sharing in the future will assist in providing replication samples for ADNI genetic analyses. Further, when genetic data become available for the Australian Imaging, Biomarker & Lifestyle Flagship Study of Ageing (AIBL) Study and other publically available cohorts in the WW-ADNI consortium there will be tremendously enhanced power to detect associations and replicate ADNI genetic findings.
As promising leads are discovered using ADNI genetic and phenotypic data, there is potentially very high scientific and translational value to be gained from follow-up with functional genomics experiments (as illustrated in Figure 4). One particularly promising direction for functional genomics follow-on studies is creation of iPSCs from blood or fibroblast samples collected from the ADNI cohort. iPSCs coupled with other clinical and imaging information from the same individual would help to construct “dementia-on-a-chip” models for assessing candidate diagnostic and therapeutic agents. Although it is beyond the scope of ADNI to follow-up all promising leads, the Genetics Core will continue to facilitate and coordinate with partners and the research community to enable these experiments to be conducted and results shared expeditiously. Finally, given the complexity of rich longitudinal “multi-omics” phenotypes, intelligent analytic strategies closely linked to biological knowledge need to be adapted and applied to ADNI data. Fortunately, the computational science community is stepping up to this challenge with dozens of publications employing advanced strategies to interrogate ADNI data and the Genetics Core looks forward to supporting and collaborating with such efforts to extract biologically significant and clinically relevant information from the proverbial haystack of information.
Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen Idec Inc.; Bristol-Myers Squibb Company; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; ; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Medpace, Inc.; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Synarc Inc.; and Takeda Pharmaceutical Company. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern Alzheimer’s Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. Samples from the National Cell Repository for Alzheimer’s Disease (NCRAD), which receives government support under a cooperative agreement (U24AG021886) awarded by the National Institute on Aging, were used in this study. Data analysis was supported in part by NIH R01 LM011360, U01 AG024904, RC2 AG036535, R01 AG19771, P30 AG10133, UL1TR001108, DOD W81XWH-14-2-0151, NCAA 14132004, the Alzheimer’s Association, and NSF IIS-1117335.
5. Supplementary Information: Endnote database of ADNI Genetics Publications (ADNI_Genetics_papers_2008-2014.enlx)
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.