|Home | About | Journals | Submit | Contact Us | Français|
Background: The Consortium on the Genetics of Schizophrenia (COGS) is an ongoing, National Institute of Mental Health–funded, 7-site collaboration investigating the occurrence and genetic architecture of quantitative endophenotypes related to schizophrenia. The purpose of this article is to provide a description of the COGS structure and methods, including participant recruitment and assessment. Methods: The hypothesis-driven recruitment strategy ascertains families that include a proband with a Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition diagnosis of schizophrenia, and at least one unaffected full sibling available for genotyping and endophenotyping, along with parents available for genotyping and (optional depending on age) endophenotyping. The family structure is selected to provide contrast in quantitative endophenotypic traits and thus to maximize the power of the planned genetic analyses. Probands are recruited from many sources including clinician referrals, local National Alliance for the Mentally Ill chapters, and advertising via the media. All participants undergo a standardized protocol that includes clinical characterization, a blood draw for genotyping, and endophenotype assessments (P50 suppression, prepulse inhibition, antisaccade performance, continuous performance tasks, letter-number span, verbal memory, and a computerized neurocognitive battery). Investigators participate in weekly teleconferences to coordinate and evaluate recruitment, clinical assessment, endophenotyping, and continuous quality control of data gathering and analyses. Data integrity is maintained through use of a highly quality-assured, centralized web-based database. Results: As of February 2006, 355 families have been enrolled and 688 participants have been endophenotyped, including schizophrenia probands (n = 154, M:F = 110:44), first-degree biological relatives (n = 343, M:F = 151:192), and community comparison subjects (n = 191, M:F = 81:110). Discussion: Successful multisite genetics collaborations must institute standardized methodological criteria for assessment and recruitment that are clearly defined, well communicated, and uniformly applied. In parallel, studies utilizing endophenotypes require strict adherence to criteria for cross-site data acquisition, equipment calibration and testing and software equivalence, and continuous quality assurance for many measures obtained across sites. This report describes methods and presents the structure of the COGS as a model of multisite endophenotype genetic studies. It also provides demographic information after the first 2 years of data collection on a sample for whom the behavioral data and genetics of endophenotype performance will be fully characterized in future articles. Some issues discussed in the reviews that follow reflect the challenges of evaluating endophenotypes in studies of the genetic architecture of endophenotypes in schizophrenia.
Decades of research have shown that genetic factors play a key role in the pathogenesis of schizophrenia (eg, Faraone et al,1 Gottesman and Shields,2 Braff and Freedman3). Indeed, replicable linkage for schizophrenia has been reported at a number of chromosomal locations (eg, Badner and Gershon,4 Baron5). However, identification of a specific schizophrenia susceptibility gene, or a set of genes, remains elusive. One strategy that holds promise for understanding the genetic architecture of this heterogeneous disorder is the analysis of discrete and neurobiologically relevant “endophenotypic” abnormalities rather than relying on the noisy and heterogeneous clinical phenotype of schizophrenia.3,6–10
The Consortium on the Genetics of Schizophrenia (COGS) is a 7-site National Institute of Mental Health (NIMH) sponsored collaboration seeking to understand the genetic basis of 6 primary and multiple secondary candidate endophenotypes, including a neurocognitive battery,11,12 in schizophrenia patients and their relatives. The study is unusual in both its national scope and focus on the collection of extensive endophenotype measures in families. The 7 participating sites are Harvard University, Mount Sinai School of Medicine, University of California Los Angeles, University of California San Diego (central administrative site), University of Colorado, University of Pennsylvania, and University of Washington. The COGS was designed to enable the reliable collection of endophenotype measures across all 7 sites so as to maximize the power of genetic linkage and association studies. This article outlines the rigorous selection and quality assurance (QA) procedures that were considered essential for a successful multisite, endophenotype-based neuropsychiatric genetic study.
The COGS endophenotype battery includes 6 primary neurophysiologic and neurocognitive measures that have been reported to exhibit quantitative neurobiological deficits in schizophrenia patients and their first-degree biological relatives. The neurophysiological measures include P50 event-related potential suppression,13–15 prepulse inhibition (PPI) of the startle response,16–18 and the antisaccade oculomotor task.19,20 Neurocognitive measures include the Continuous Performance Test (CPT21–23), tests of verbal memory (cf, Braff et al,17Cirillo et al,24 Faraone et al25), and working memory (cf, Perry et al,26 Conklin et al,27 Gold et al,28 Park and Holzman29). In addition, a computerized neurocognitive battery (CNB), developed at the University of Pennsylvania, was included to characterize the participants and to provide additional endophenotypes.11,12
The 6 candidate endophenotype measures have been selected based on empirical evidence that they fulfill 6 criteria: (1) they show moderate to large effect sizes between schizophrenia patients and control participants, (2) clinically unaffected relatives also have deficits compared with normal controls, (3) these endophenotypes have adequate psychometric properties of heritability, stability, and between-site reliability, (4) in the extant literature, the measures have a known or partially explicated neurobiological substrate (usually assessed via animal model and functional brain imaging studies) that is relevant to schizophrenia, (5) the range of scores reported on these measures is sufficiently broad for control participants, clinically unaffected family members, and schizophrenia patients to support using them to test genetic hypotheses, and (6) at the time of the study's design, medication effects were not known to irretrievably obscure schizophrenia-normal differences and effect sizes. Other considerations include the ease with which uniformity of training and administration (eg, manuals, videotapes, and hands-on training in groups) of these endophenotypes can be achieved in a large, multisite study. The COGS will evaluate the unique and shared genetic vs environmental determinants of these endophenotypes. Recruitment and research design goals are to assess at least 420 clearly defined schizophrenia pedigrees (1680 participants) and 525 community comparison subjects (CCSs) over 5 years. Since the start of the COGS project, additional more specific measures of verbal memory have been added across all sites. In addition, site-specific pilot data from brain imaging, mismatch negativity, and functional outcome measures have been obtained for evaluation as new candidate endophenotypes.
Successful multisite collaborations, especially those that involve complex endophenotyping, require careful standardization and continuous quality-assured administration of measures across sites. The COGS involves standardized clinical characterization, collection of neurophysiological and neurocognitive data, and genotyping from a sample of probands, their first-degree relatives, and CCSs collected across 7 sites. Before initiating subject testing and data collection, 6 months of consortium-wide standardization of equipment and test protocols, training and QA efforts, and database development took place. The purpose of the current report is to (1) provide a detailed description of COGS recruitment, clinical and endophenotype assessment, and continuous QA methods and (2) present the preliminary sample characteristics of participants from whom endophenotype data and DNA have been collected in order to guide other, similar efforts and to serve as a definitive point of reference for COGS publications. Specific data and analytic issues, such as cross-site reliability of endophenotypes and statistical analysis of family data, are addressed in several COGS empirical articles that are in various stages of submission or publication (eg, Greenwood et al,30 Horan et al,31 Radant et al32). A separate article that details the statistical analytic approach for DNA analysis is included in the current issue of Schizophrenia Bulletin.33 Finally, because the COGS family DNA will eventually be available to other investigators through the NIMH data-sharing process, we aimed to provide a description that can be used as a resource for those using the COGS database in the future.
The COGS infrastructure, governance interaction among sites, and data flow were planned in detail over a 2-year period commencing before the essentially identical 7 individual COGS R01′s were submitted to the NIMH and reviewed by the Genomics Review Committee. Moreover, recruitment, diagnostic assessment, and endophenotype methods were standardized using in-person meetings, written manuals, videotapes, and regularly scheduled teleconferences.
The COGS is organized into 7 core units that work interactively to collect, process, and analyze data.
The Director's Unit at University of California San Diego (UCSD) (David L. Braff, Director) provides scientific leadership, administrative support, and coordination of information across all sites. The Statistical Genetics Core (Nicholas J. Schork, Director), also located at UCSD, provides specialized expertise and data analysis for the statistical genetics component of the consortium. The Bioinformatics and Data Core at Univeristy of California Los Angeles (UCLA) (Jim Mintz, Director) is responsible for constructing and maintaining the secure web-based COGS data warehouse and for providing methodological and statistical consultation to COGS investigators. The Clinical Core Committee (CCC), comprised of one faculty-level representative from each site, oversees the recruitment and clinical assessment of research participants and assists in the development of the clinical components of the COGS database. The Endophenotype Committee (EPC), comprised of the senior QA scientists at each site, oversees procedures for accurate collection, data transmission, and analysis of the 6 primary endophenotypes, secondary phenotypes, and the Penn Neurocognitive Battery measures. The chairperson of the Clinical and Endophenotype Cores rotates annually among investigators at the 7 sites. The General Investigators Meeting allows for the integration and discussion of all issues, especially those raised by the CCC and the EPC. The Principal Investigators Committee (PIC) is the final decision-making body regarding policies, procedures, and publications.
Thus, investigators and research staff participate in 4 separate teleconferences, each held twice per month to (1) coordinate and evaluate recruitment and assessment progress and procedures (CCC), (2) discuss and provide continuous QA of the 7-site endophenotyping endeavor (EPC), (3) integrate the CCC and EPC recommendations and discuss COGS-wide issues with faculty and staff (Investigators Committee), and (4) review overall site performance, set policies and publication plans, and resolve administrative issues (PIC). Training and development is an important mission of the COGS. In general, the COGS structure and practice is designed to enhance the training and facilitate authorship contributions of junior faculty, who often serve as site coordinators and/or fill other crucial roles. All procedures are approved by the local institutional review boards (IRBs) at each institution.
Figure 1 provides an overview of the flow of COGS data. Participants at each COGS site undergo detailed clinical assessment and collection of all 7 endophenotypes. Blood samples for DNA analysis are sent from the local site to the Rutgers University Cell and DNA Repository using standardized collection and shipping procedures.
Data management depends on a constant iterative relationship between the 7 sites and the UCLA Data Core. All data are uploaded to a rigorously “firewalled,” password-protected, web-based data warehouse developed collaboratively by the COGS investigators and maintained by the Central Data Core at UCLA (http://www.npistat.com/cogs/). Backup copies of these data are stored locally at all 7 sites, and all data are deidentified before electronic transmission to the Central Data Core. Clinical assessment data and endophenotype data are uploaded separately. Once the information has been transmitted to the Central Data Core, the QA scientist responsible for each endophenotype regularly downloads all new endophenotype data collected on that measure across all sites. The QA scientists review and score the endophenotype data according to procedures established for each particular measure. Any data questions that arise during this QA process are reviewed with the personnel at the site at which the data were collected. Separation between clinical and endophenotype information ensures that QA scientists remain blinded to diagnosis and other clinical information at the time of data scoring.
Data from the Penn CNB are transmitted electronically from the administrator's laptop to a central repository at the University of Pennsylvania, with immediate e-mail confirmation of receipt to the local administrator, site coordinator, and key PENN personnel. In addition to error messages generated by the computerized scoring program, a PENN research team member assesses the validity of each submitted battery and addresses any questions to the administrator and site coordinator. A weekly tabulation of completed batteries, including weekly and cumulative totals, is e-mailed to each site and to the Central Data Core at UCLA.
After the QA review and scoring are completed, the scored data from each endophenotype and from the CNB are again uploaded to the central data warehouse. Finally, the scored endophenotype data are merged with the clinical data to create a composite COGS data set. The composite data set is updated and distributed to principal investigators (PIs) and site coordinators at each site on a monthly basis. Primary data analyses are conducted using this composite data set.
From the initial (prefunding) meetings, the COGS Investigators were focused on the absolute need for uniformity in equipment, experimental procedures, physical environments of the testing rooms, experimenter behavior, and an array of other factors (see below). To further ensure data QA, all equipment is calibrated monthly in a standardized fashion, and backup equipment is available to each site in case a malfunction is detected.
Diagnostic interviewers learn to competently administer the clinical assessment tools using a standardized training procedure developed by a clinical psychologist (Monica E. Calkins) with extensive training and experience in semistructured interviewing. Before the start of the study, diagnostic interviewers met for a 4-day training workshop at the Central Site at UCSD. When new interviewers are hired, training is supervised by senior clinical site coordinators who implement the training protocol. The fundamental goals of the protocol are to enhance and facilitate the clinical skill and knowledge of psychopathology required to expertly administer semistructured interviews. Accordingly, the protocol consists of didactic sessions, observation, and supervised practice. Ten videotaped didactic sessions cover major psychopathology and differential diagnosis, particulars of semistructured interviewing, interview documentation practices, and item-by-item review of assessment instruments, rating scales, and COGS-specific operational definitions. Contemporaneously, each trainee practices the interview modules with mock participants and observes interviews conducted by senior interviewers. Finally, trainees engage in supervised practice with at least 3 participants and, subsequently, obtain ongoing regular supervision by a senior clinician.
All 6 endophenotypes have an assigned QA site that reviews and scores data coming from all 7 COGS sites. To ensure that each site is testing according to protocol, UCSD hosted an initial 4-day training workshop attended by testers from all 7 sites along with all 6 endophenotype faculty QA experts prior to initiation of the study. Training materials for each endophenotype include written procedure manuals and instructional videos demonstrating step-by-step administration of the protocol generated at the yearly COGS-wide training sessions. The procedure manuals include scripts for task instructions, which must be stated verbatim to each participant by each tester, and equipment settings. Each site maintains a library of these materials for training of current and newly hired staff. Furthermore, “retraining” is held at UCSD on an annual basis to minimize experimenter drift in methods and train newly hired testers. During these extensive training workshops, each QA expert reviews the endophenotyping methodology and observes all testers as they setup and administer each test, thereby approving these individuals and their respective sites as “COGS certified.”
In addition to the training workshops, the COGS Project Coordinator (Andras Kovach) visits all 7 COGS sites every year in order to ascertain testing procedures across sites. His role is to ensure the standard and consistent practice of administering the endophenotypes according to the original protocol and to inspect the endophenotype equipment. During each 2-day visit, he spends 1 day acting as if he is a naïve CCS and then observes an actual CCS being tested. He compares tester performance during both days to QA-approved checklists on proper protocol and reports any deviations from the protocol of testing or calibration to the QA site for their advice and action. He works with the site PIs and with the faculty QA experts for each endophenotype to ensure that all sites strictly follow COGS testing guidelines.
It is thus a complex and challenging endeavor (for this and other, similar studies) to uniformly collect and ensure the quality of endophenotypes. At the outset, some sites had little or no experience with some endophenotypes. Consequently, the group training, video tutorials, and manuals had to be precisely followed. The yearly retraining and on-site visits yielded very high uniformity and some surprises that cost us lost data for some endophenotypes over short time periods. Slight deviations in instructions and procedures had to be uniformly corrected. For example, deviations in the placement of visual fixation points can cause oculomotor discharge that can alter blink startle measures. Fortunately, with the combination of rigorous video and manualized retraining, onsite inspections, and yearly retraining workshops, projects of this kind can be “quality assured.” The initial “data-based” COGS articles include analyses of site differences to assess the comparability of measurement across sites.
Schizophrenia probands have been recruited from clinics/clinician referrals (37%), local chapters of the National Alliance for the Mentally Ill (16%), Web sites (http://www.schizophrenia.com, http://www.SchizophreniaResearch.net, local sites; 9%), and various other sources (38%) including the media, local flyers, and referral from other studies. CCS are recruited primarily through newspaper advertisements (32%), Web sites (25%), local flyers (24%), and various other sources (19%), including referrals from previous participants and other studies. Once identified, potential participants undergo semistructured screening, either in person or on the telephone, the purpose of which is to provide an initial informed determination regarding subject eligibility for study participation. Information obtained in the screening assessment includes referral source, demographic data, current clinical symptoms, brief medical history, family structure, and family history of mental illness. Permission to review relevant medical records is also obtained at this time. Persons identified as potentially eligible are invited to participate in the more detailed, standardized clinical assessment as described below.
All participants who are endophenotyped are between the ages of 18–65 and able to understand and provide informed consent. Parents older than 65 have their blood drawn for genotyping purposes but do not routinely undergo endophenotyping. Eligible families fulfill the following criteria: (1) a proband meets Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) criteria for schizophrenia as determined by the Best Estimate Final Diagnosis (BEFD) procedures described below, (2) both biological parents are available for genotyping, and (3) at least one full sibling who is unaffected with schizophrenia is available for endophenotyping and genotyping. First-degree biological relatives include parents and siblings of the proband. Examples of eligible and ineligible family pedigrees are depicted in figure 2. This family configuration was selected to optimize the study of quantitative phenotypic variation, in which contrasts between individuals with high and low phenotype values generate greater power.34–37 Probands having only one available parent but 2 or more available siblings (with at least one unaffected by schizophrenia) are also included, as are probands having no available parents but 3 or more available siblings (with at least one unaffected by schizophrenia), though these configurations are less powerful for the proposed analyses (cf, Ott38). Multigenerational (extended) families are included when an additional relative (eg, nephew or child of the proband) meets criteria for schizophrenia, and the proband has living parents and at least one unaffected sibling available (see figure 2).
Having met these basic eligibility requirements, participants are excluded if they fulfill any of the exclusion criteria presented in table 1. To minimize genetic heterogeneity, families that include only a proband with schizoaffective disorder (either bipolar or depressed subtype) are not included. Additionally, families with evidence of “parental bilineality” (ie, both parents with schizophrenia) or no “contrast” in sibships (all sibs have a diagnosis of schizophrenia) are not included (see figure 2). Again, the family structure criteria were developed to maximize power for quantitative genetic analyses of the endophenotypic measures.
If subjects fulfill the basic eligibility requirements described above, all available first-degree family members complete diagnostic and endophenotype testing regardless of comorbid medical and psychiatric conditions. CCS, who serve as a reference group against which to gauge performance of probands and family members, participate in all components of testing, including diagnostic and endophenotype assessment, and blood draw. To parallel psychiatric comorbidity in relatives of probands, nonpsychotic axis I psychopathology is accepted in approximately 30% of CCS, but clinical stability and/or remission is required. Although relatives and CCS with medical and psychiatric conditions are included in the sample, data are coded to allow subset analyses restricted to medically and psychiatrically healthy participants (eg, Calkins et al19). Data on these subjects provide the opportunity to test the influence of clinical conditions that may affect endophenotype performance and provide a comparison sample for relatives who are not affected by schizophrenia but are afflicted with other psychiatric conditions.
The assessment protocol is typically completed over the course of 2 days, which are scheduled in as close temporal proximity as possible. The first session consists of the clinical assessment battery and blood draw, and the second includes the endophenotype assessment battery.
All participants undergo an extensive, standardized assessment protocol that was developed to provide a comprehensive evaluation of psychotic, mood, and substance-related disorders. The assessment procedure includes the following components.
The Diagnostic Interview for Genetics Studies (DIGS), which was developed by the NIMH Genetics Initiative for genetic studies of schizophrenia and mood disorders, provides for a detailed assessment of psychotic, mood, and substance-related symptoms, which in turn allows for the reliable differential diagnosis of related disorders.39 In addition, it provides self-reported demographic and medical history data. Several instruments embedded within the DIGS allow for Mini-Mental Status Examination, functional assessment (Global Assessment Scale, with standardized anchors40), clinical symptom ratings via the Scale for the Assessment of Positive Symptoms (SAPSs41) and the Scale for the Assessment of Negative Symptoms (SANSs42), and assessment of schizotypal and other axis II cluster A personality features via a modified version of the Structured Interview for Schizotypy.43
The Family Interview for Genetic Studies (FIGS), also developed by the NIMH Human Genetics Initiative,44 is a method of systematically gathering information about psychiatric disorders in the family from an informant. Although this method has low sensitivity compared with direct assessment of each family member, its specificity is high.1 In the COGS, FIGS information is gathered independently from all available family members, resulting in the creation of a composite pedigree that is informed by multiple participants. FIGS administration involves drawing a family pedigree—which is sent to the Statistical Genetics Core—obtaining psychiatric information about relatives in the pedigree, and following up on endorsed symptom categories, including depression, mania, substance-related disorders, psychosis, and cluster A personality disorders.
This scale from a version of the Brief Psychiatric Rating Scale developed by the UCLA Clinical Research Center's Diagnosis and Psychopathology Unit is administered to assess symptoms of anxiety spectrum disorders. It provides a standardized method of eliciting descriptions about and rating the severity of anxiety symptoms, including apprehension, tension, fear, panic, or worry.
After gathering all sources of information, including available medical records, the interviewer prepares a comprehensive narrative summary of the results of the diagnostic assessment. The narrative provides behavioral observations along with an integrated timeline of the onset, duration, description, and temporal relationship of symptoms.
For each participant, the BEFD process begins at the point of initial contact, where screening information and informal observations are recorded by recruiters. Following the diagnostic assessment battery for all participants, all available sources of information, including DIGS (and its embedded instruments), FIGS, rating scales, narrative summary, and medical records are reviewed by at least 2 faculty-level clinicians from the site assessing the participant. The clinicians then assign DSM-IV lifetime BEFD based on consensus review of these materials. Prior to the beginning of data collection, a subcommittee of clinicians reviewed and operationalized standardized diagnostic criteria for major disorders. The criteria were then discussed and approved by the PI's and are taught to new staff during their training. In addition to routine communications among the CCC at biweekly meetings, these DSM-IV based criteria ensure that perceived ambiguities in DSM-IV criteria are handled in a uniform manner across sites. A pilot study of intersite reliability of BEFD for 5 cases across all 7 sites, in which each site reviewed identical assessment materials, yielded 100% agreement for the BEFD of schizophrenia.
The endophenotype and neuropsychological battery is administered in one of 2 fixed orders (1 through 6 from the numbered tests below or 6 through 1) that take a total testing time of 4–6 hours. A break is typically scheduled between the neurophysiologic and the neurocognitive testing batteries to help minimize subject fatigue. For each of the 6 endophenotypes, a single primary outcome measure is used for subsequent analyses, based on prior evidence of maximal group separation and relationship to specific clinical features. In addition, secondary measures are obtained from the test session itself (eg, blink latency in the PPI session) or via “extension” of the tests (eg, smooth pursuit eye movement, following the antisaccade testing). Finally, each site can “recall” subjects for additional testing, such as neuroimaging or other EEG testing, per their locally approved IRB consents. The endophenotype assessment battery includes the following measures:
From the start of testing in the fall of 2003 until February 2006, 355 families were enrolled, 792 participants were interviewed, and 688 participants were endophenotyped. Endophenotyped participants included 154 probands, 343 first-degree relatives (n of siblings = 209, n of parents = 134), and 191 CCS. Each of these individuals completed at least one endophenotype, with the majority completing all endophenotypes. Reasons for missing data and criteria for removing data from any particular endophenotype are described in individual endophenotype articles. This report presents demographic information on the preliminary sample because recruitment and testing of participants are ongoing.
Table 2 presents a summary of families according to the number of first-degree relatives who were endophenotyped as of February 2006. In 159 families, ≥1 member completed the endophenotype assessment. Unaffected sibling pairs will be critical to COGS genetics analyses. In 5 families, the proband had a sibling who was also affected with schizophrenia. These families, however, all included at least one other unaffected sibling who had completed endophenotype assessment, so the “contrast outcome” requirement (at least one sibling with and one sibling without schizophrenia) was met. Of the families with endophenotypes, 21 did not yet have a sibling who had completed the battery (see table 2, Row 1). Thus, at the time these data were downloaded, there were a total of 138 families with ≥1 unaffected sibling who had been endophenotyped (see table 2, Rows 2–7).
Basic demographic characteristics of endophenotyped participants are presented in tables 3 (age, education, parental education, and sex) and 4 (ethnicity). Preliminary analyses indicate that relatives are significantly older than CCS and patients, who do not differ from each other. Importantly, siblings do not differ in age from CCS. Thus, in order to evaluate the potential impact of age on endophenotype performance in relatives, we will be able to compare the subgroup of siblings to the CCS, in addition to the standard approach of using age as a covariate. However, because the parents of the probands tend to be older than the CCS, the COGS is enhancing recruitment of older CCS who will be better matched in age to parents of the probands.
As expected, probands have achieved less education than CCS and relatives. Notably, relatives and CCS do not significantly differ in their years of education (see table 3) making this direct relative-CCS comparison straightforward regarding possible education effects. Because participant's educational attainment is associated with illness status (eg, Isohanni et al60), some COGS endophenotype analyses will use education of the most educated parent rather than of the participant as an analytic variable. Relatives and CCS are well matched in both personal and parental education, indicating that educational disparities are unlikely to account for any endophenotype deficits observed in relatives.
As anticipated, the groups are not balanced by sex. The proband group is disproportionately male compared with the relative group and CCS who do not differ from each other. Our recruitment efforts have therefore been altered to recruit more male controls. Sex balance is important so that any observed endophenotype impairments will not be attributable to varying distributions of males and females.
The sample is composed of participants from many ethnicities, with the majority of European descent. see (table 4) Because ethnicity itself is a complex, sometimes subjective, and often misunderstood construct, our statistical genetics strategy will account for any final observed group × ethnicity interactions.61,62 Individual articles reporting analyses of endophenotypes will evaluate the existence of any group differences in these demographic variables at the given point of time. Where relevant for particular endophenotypes, demographic variables will then be entered as factors or covariates in the analyses.
Table 5 presents lifetime BEFD in each participant group. The most common comorbid conditions among probands are substance-related disorders, both abuse and dependence. Approximately 4% of relatives were diagnosed with schizophrenia spectrum disorders (ie, schizophrenia, schizoaffective disorder, psychotic disorder not otherwise specified, and schizotypal and paranoid personality disorders). In both the relative and the CCS groups, substance abuse, substance dependence, and major depressive disorder (MDD)–single episode each affect between 7–10% of participants. Thus, relatives and CCS in this sample exhibit similar patterns of axis I nonschizophrenia spectrum psychopathology, except that more relatives (12%) than CCS (5%) have experienced MDD-recurrent. Note, however, that unlike relatives, potential CCS would have been excluded if they had current substance abuse (past 1 month) or dependence (past 6 months) (see table 5). Again, where relevant for particular endophenotypes, conditions such as mood disorders and substance-related disorders, as well as psychotropic medications, will be handled as factors or covariates in the analyses.
In this preliminary analysis, probands had a mean (SD) total SANS score of 10.1 (5.9) and a SAPS score of 7.2 (3.9). Global assessment of functioning for the past month among the 3 groups was probands (mean = 44.8, SD = 12.8), relatives (mean = 78.9, SD = 13.5), and CCS (mean = 82.4, SD = 9.8). More detailed clinical data will be presented in future articles.
Multisite genetic collaborations, especially these involving multigenerational families, require assessment and recruitment methods that are precisely defined and uniformly applied. These requirements sound straightforward, but in practice entail a great deal of planning and thought, standardization of training and methods, and ongoing retraining to attain cross-site QA. Figure 3 provides a summary of the key components of the multisite collaboration. This article presents preliminary demographic findings on a carefully diagnosed 7-site sample of probands, families, and normal comparison participants for endophenotype performance and genetic substrates, which will both be fully characterized in current and future reports. In the COGS cohort, parental DNA, the ascertainment of a sibship with diagnostic “contrast,” and sibs who are endophenotyped and genotyped are the hallmark minimal requirements for family eligibility. After over 2 years of active recruitment, 688 participants have been successfully evaluated and endophenotyped, including 138 families with at least one sibpair discordant (contrasting) for schizophrenia, and a corresponding sample of 191 CCS. DNA has been collected for genotyping on all participants.
As emphasized throughout this article, the COGS has been designed and organized in such a way as to facilitate the reliable collection of refined endophenotype measures across all 7-subject ascertainment and clinical centers. The design and goals of the COGS are thus quite different in orientation from other large-scale genetic consortia sponsored by the National Institutes of Health (NIH), which often focus on integrating information provided by participating study sites using very different endpoints and study designs or consider the collection of families with a single disease using common, accepted diagnostic criteria and/or perhaps a few additional clinical or neurocognitive parameters collected in a uniform manner. In this light, the COGS is unique among large-scale NIH-sponsored genetic consortia in that it focuses on collection of multiple subclinical measures—that are difficult enough to assay at any single site—in a uniform manner across all data collection sites so as to maximize the power of genetic linkage and association studies.
Because the central aim of the COGS is to explore the familiality and genetics of quantitative endophenotypic traits, sufficient contrasts between individuals with high and low phenotypic variation will generate adequate statistical power for informative genetic analyses.34,37,63–66 This approach is distinct from the strategies used for studying a single qualitatively defined disease, where affected sibling pairs and multiplex families provide the most efficient and powerful design for allele-sharing linkage analysis.67 When studying quantitative phenotypes underlying or contributing to qualitative disease outcomes like schizophrenia, it is important to include families that do not exclusively have affected individuals because, if impaired performance on these phenotypes truly associate with schizophrenia, one would actually reduce the variation exhibited by this trait in the samples and thereby reduce power for studying the determinants of the quantitative phenotypes.
However, the requirement of relatively intact families could lead to the ascertainment of a group of families that is perhaps “healthier,” or at least better connected, than those participating in other studies. It is possible that intact families (especially probands and their siblings) are not representative of “all” schizophrenia kindred (eg,Horan et al31). For example, perhaps these volunteer families carry less of a “paranoid” or “suspicious” genetic load, which could make positive findings less likely and therefore reduce false positive (or increase false negative) results. Similarly, families willing to participate in a lengthy diagnostic and testing protocol, which can be tedious, may be different from families in other schizophrenia family studies that are not as demanding of time and effort. To the extent that such differences are associated with endophenotype performance, participants from these families may not show the typical expected pattern of endophenotype impairments observed in families recruited with other strategies. Future analyses will address the impact of this ascertainment strategy on clinical features and endophenotype performance of COGS participants in comparison with those from other family genetic studies.
The inclusion of CCS in the COGS provides the additional comparative data necessary for evaluating the grouped results of neurophysiological and neurocognitive tests. CCS both with and without a lifetime history of nonpsychotic mental disorders are included in the COGS. While our field continues to struggle with the ideal criteria for selecting comparison subjects (eg, Kendler68), we believe that the COGS criteria provides a sufficient number of CCS diagnosed with psychiatric conditions to allow both (1) meaningful subanalyses of the impact of comorbid diagnoses on endophenotype measurements and (2) direct comparisons of the CCS group to the relatives, a portion of whom also have psychiatric conditions.
There are several other challenges and unique opportunities available to the COGS. First, while the minimum COGS family structure requires participation of a discordant sibpair and 2 living parents, additional siblings are not always available or willing to participate. If these nonparticipating siblings share relevant clinical characteristics that cluster with endophenotype abnormalities (eg, schizophrenia and schizotypy), differences between relatives and CCS may be more difficult to detect. Fortunately, the availability of FIGS data on nonparticipating relatives will enable an estimate of the magnitude of these potential biases. Second, the cross-sectional design may lead to diagnostic misclassification, especially among young adult siblings of affected probands who are still at high risk for developing schizophrenia. This concern is addressed in part by the comprehensive assessment of subsyndromal and schizotypal symptoms in subsections of the DIGS, and by the strategy of using neurobiological endophenotypes, rather than clinical phenotypes, as the trait of interest in genetic analyses. Finally, in the interest of efficiency, the structured portion of the DIGS interview has been modified slightly to emphasize the major diagnoses that are relevant to schizophrenia. Thus, some diagnoses (eg, phobias) may be less thoroughly assessed than others (eg, major depression, schizophrenia, bipolar disorder, and substance-use disorders).
In summary, the COGS has established a rigorous infrastructure for participant recruitment, clinical characterization, measurement of multiple endophenotypes, data management, and ultimately genotyping. The structure and function of this endeavor has and will serve as a model for other endophenotyping consortia for medical and neuropsychiatric disorders. This multisite study of patients with schizophrenia, their family members, and CCS will help to clarify the genetic architecture of quantitative endophenotypes underlying schizophrenia susceptibility, as described in other articles of this Special Issue of Schizophrenia Bulletin and in the first generation of COGS original data papers cited in those reviews.
This work was supported by the NIMH. We are grateful for the generosity of time and effort by the families who make this COGS research possible. We thank the following key personnel for their dedicated efforts to the COGS: Harvard University (RO1-MH065562; MH43518; Commonwealth Research Center of the Massachusetts, Department of Mental Health): Lynda Jacobs, Monica Landi, Erica Lee, Andrea Roe, Frances Schopick, and Alison Thomas; Mount Sinai School of Medicine (RO1-MH065554): Leigh Baitler, Rui Ferreira, Robert Fieo, Frances Schopick, Christopher Smith, and Rebecca West; University of California Los Angeles (RO1-MH65707): William Horan and Mark Sergi; University of California, San Diego (R01-MH065571): Andras Kovach, Joyce Sprock, Katrin Meyer-Gomes, Barbara Haugeland, Kari Tweedale, Sheldrick Holmes, and Emmeline Crowley; University of Colorado (RO1-MH65588): Jamey Ellis, Jeff Hollis, Vicki Pender, Bernadette Sullivan, Bettye Clement, Christopher Cason, and Alexis Ritvo; University of Pennsylvania (RO1-MH65578): Felipe Da Silva, Alexandra Duncan Ramos, Jarrod Gutman, CarlaAnn Henry, Paul Hughett, Jennifer Jackson Greene, Adrienne Mishkin, J. Daniel Ragland, Leslie Ramsey, David Rice, Jan Richard, Devon Seward, and Robert Witalec; University of Washington (R01-MH65558): Kate B. Alvey, Andrew C. David, Sean P. Meichle, and Denise O. Pritzl. Portions of this article were presented at the American College of Neuropsychopharmacology Meeting, December 2005, Waikoloa, Hawaii.