Search tips
Search criteria 


Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Biol Psychiatry. Author manuscript; available in PMC 2013 October 15.
Published in final edited form as:
PMCID: PMC3437244

Genomewide Linkage Analysis of Obsessive Compulsive Disorder Implicates Chromosome 1p36

Carol A. Mathews, M.D,1 Judith A. Badner, M.D., Ph.D.,2 J. Michael Andresen, Ph.D.,3 Brooke Sheppard, B.S.,1 Joseph A. Himle, Ph.D.,4,5 Jon E. Grant, M.D.,6 Kyle A Williams, M.D.,6 Denise A. Chavira, Ph.D.,7 Amin Azzam, M.D., M.A.,1 Maxine Schwartz, B.S.,1 Victor I. Reus, M.D.,1 Suck Won Kim, M.D.,6 Edwin H. Cook, M.D.,8 and Gregory L. Hanna, M.D.4



Obsessive compulsive disorder (OCD) has a complex etiology involving both genetic and environmental factors. However, the genetic causes of OCD are largely unknown, despite the identification of several promising candidate genes and linkage regions.


Our objective was to conduct genetic linkage studies of the type of OCD thought to have the strongest genetic etiology (i.e., childhood-onset OCD), in 33 Caucasian families with ≥2 childhood-onset OCD-affected individuals from the United States (US) (N=245 individuals with genotype data). Parametric and non-parametric genome-wide linkage analyses were conducted with Morgan and Merlin in these families using a selected panel of single nucleotide repeat polymorphisms (SNPs) from the Illumina 610-Quad Bead Chip. The initial analyses were followed by fine-mapping analyses in genomic regions with initial heterogeneity LOD (HLOD) scores of ≥2.0.


We identified five areas of interest (HLOD score ≥2) on chromosomes 1p36, 2p14, 5q13, 6p25, and 10p13. The strongest result was on chromosome 1p36.33-p36.32 (HLOD=3.77, suggestive evidence for linkage after fine-mapping). At this location, several of the families showed haplotypes co-segregating with OCD.


The results of this study represent the strongest linkage finding for OCD in a primary analysis to date, and suggest that chromosome 1p36, and possibly several other genomic regions, may harbor susceptibility loci for OCD. Multiple brain-expressed genes lie under the primary linkage peak (approximately 4 mb in size). Follow-up studies, including replication in additional samples and targeted sequencing of the areas of interest, are needed to confirm these findings and to identify specific OCD risk variants.

Keywords: linkage, genomewide, pedigree, obsessive-compulsive, genetics, multigenerational


Obsessive compulsive disorder (OCD) [MIM 164230] is a common neuropsychiatric disorder consisting of repeated, distressing ego-dystonic thoughts (obsessions) and behaviors (compulsions) with a world-wide prevalence of 1–3%(14). Family, segregation, and twin studies clearly demonstrate that OCD is familial, with estimated heritabilities for obsessive-compulsive (OC) symptoms of 27% to 47% for adults and 45% to 65% for children(518). Genetic epidemiological studies also indicate that OCD with symptom onset before age 18 has a stronger genetic contribution than OCD with later onset, with a doubling of the OCD risk for first-degree family members in families of probands with childhood-onset compared to adult-onset of symptoms(1118).

Although several promising genes and genomic regions of interest have been identified, clear susceptibility genes for OCD have not yet been demonstrated. Most studies have focused on candidate gene approaches, although four primary genome-wide linkage studies have also been conducted(15, 1928). These studies have identified eleven genomic regions with LOD scores of ≥1.4 on chromosomes 1q, 3q, 6p, 6q, 7p, 9p, 10p, 11p, 14q, 15q, and 19q, most with a broad definition of the OCD phenotype(15, 1928). The strongest linkage finding in a primary analysis of OCD reported to date was on chromosome 15q14 in three Costa Rican families (LOD score=3.13); this region was also previously identified in a Caucasian sample (25, 28). Other than 15q14 and the 9p region identified by Hanna et al, and subsequently examined as a targeted replication in a separate sample(26, 27), no linkage region has been identified in more than one study. Genome-wide association studies (GWAS) for OCD and Tourette Syndrome (TS), a related disorder, have recently been completed. However, in the context of genetic and environmental heterogeneity, multiple approaches are appropriate, and linkage studies continue to play an important role. While GWAS are useful for the identification of common variants with relatively small effect sizes, linkage studies of multiplex families are particularly useful for the identification of rare variants with larger effect sizes that are increasingly believed to underlie a substantial proportion of the risk for complex disorders(29). Individual variants identified via linkage approaches may be family specific, each accounting for a small proportion of the overall variance; however, the genes implicated by these variants will be of interest in multiple samples and populations, likely accounting for a much larger proportion of the overall OCD risk, in addition to providing insights about the biology of OCD.

The aim of this study was to search for genomic regions that potentially harbor OCD susceptibility genes using genome-wide linkage approaches in Caucasian families with childhood-onset OCD.


Sample Collection

The sample consisted of 33 families (245 individuals with genotype data) ascertained for ongoing genetic studies of OCD in the US (Table 1). We included families for whom phenotype data were available for ≥2 OCD-affected individuals (broad or narrow phenotype) and who had ≥ 1 affected (narrow phenotype) and ≥1 unaffected individual with genotype data. Families ranged in size from 4 individuals in two generations to 58 individuals in four generations (examples in Figure 1). Families were ascertained via probands with DSM-IV OCD whose symptoms began before age 18 and who did not have a pervasive developmental disorder, bipolar disorder, schizophrenia, or a primary psychotic disorder. All families were Caucasian ethnicity of European (primarily Northern European) descent. Families were ascertained and collected at the University of California, San Diego and subsequently the University of California, San Francisco (CAM), the University of Michigan (GLH), and the University of Minnesota (SWK). Genome-wide linkage analyses using a less dense set of microsatellite markers in 17 of the 18 Michigan families have been previously reported(24, 26). The study was approved by the Institutional Review Boards of the participating sites. After complete discussion of the study with the participants, written informed consent or assent was obtained; parental permission was also obtained for participants under age 18.

Figure 1
Examples of large, multigenerational pedigrees used in the genome-wide linkage analysis of OCD. Filled black symbols indicate narrow OCD phenotype, half filled black symbols indicate broad OCD phenotype, grey symbols indicate unknown phenotype, and unfilled ...
Table 1
Characteristics of individuals in 33 early-onset OCD families included in the linkage analysis. All families had ≥2 OCD-affected members (narrow or broad definition) with available clinical data, with genotype data available on ≥1 OCD-affected ...

Clinical assessments at UCSF/UCSD and Minnesota were conducted by psychiatrists or PhD-level psychologists specializing in OCD and trained in the research instruments. Clinical assessments at Michigan were conducted by interviewers with at least a master’s degree plus clinical training who were trained to ≥90% diagnostic agreement with the assessment instruments. The primary assessment instruments for all sites included the adult and child versions of the Yale-Brown Obsessive Compulsive Scale (Y-BOCS and CY-BOCS, respectively) (UCSF/UCSD and Minnesota) or the Schedule for Tourette and Other Behavioral Syndromes (STOBS), which includes a modified version of the Y-BOCS (Michigan), complemented with the Diagnostic Interview for Genetics Studies (DIGS) (UCSF/UCSD) or the Structured Clinical Interview for DSM-IV Axis I diagnoses (SCID) (Michigan and Minnesota) for adults, and the Schedule for Affective Disorders and Schizophrenia for School-Age Children (KSADS) (all sites) (3035) (Additional details in Supplement). Because phenotypic data were collected independently and at different time periods by the sites, clinical data collection could not be standardized prospectively. Instead, phenotypes were standardized at the diagnostic level, using a common phenotype matrix across all sites that included two OCD phenotypes, an unaffected phenotype, and an unknown phenotype (see below). Concordance between sites was achieved via a best estimate (BE) consensus approach(36). This approach, which uses all available sources of information (e.g., medical records, clinical interviews, self-report questionnaires, and family history interviews), requires 100% concordance on all elements of the diagnostic criteria, and reduces the phenotypic heterogeneity that may arise from the use of different assessment instruments (details in Supplement).

Two OCD diagnoses were assigned: narrow and broad OCD. A narrow OCD diagnosis was given if the individual met all DSM-IV criteria for OCD. The broad OCD diagnosis encompassed both DSM-IV OCD and subclinical OCD, which was considered present if the individual had clear obsessions and/or compulsions, but did not quite meet the impairment or distress criteria (e.g., OC symptoms taking less than an hour and causing mild rather than moderate to severe distress and/or impairment). The broad definition was designed to capture a robust phenotype that is likely to be etiologically related to OCD, but was not severe enough to meet strict DSM-IV criteria. Participants were considered unknown for both phenotypes if there was a history of thoughts or behaviors suggestive of OC symptoms that met most, but not all criteria for subclinical OCD, or if they were under age 40 and did not have OC symptoms. Individuals with subclinical OCD who were coded as affected for the broad analyses were coded as unknown in the narrow analyses. Participants with no history of any OC symptoms who were ≥40 years old at the time of the interview were classified as unaffected. The mean age of onset of OC symptoms was 8.7, and the mean lifetime worst-ever Y-BOCS/CY-BOCS severity score was 24.0 for the broad phenotype and 24.9 for the narrow phenotype (Table 2). 15% had a co-occuring chronic tic disorder (Tourette Syndrome, chronic motor or vocal tic disorder), and 4% had a co-occurring eating disorder.

Table 2
Clinical characteristics of individuals with either narrow or broad OCD (N=158) in the 33 early-onset OCD families included in the linkage analysis.


DNA extraction was performed from blood or immortalized lymphoblastoid cell lines according to standard procedures. A small number of individuals from the UCSF sample were genotyped using the Illumina Linkage Panel IVb at the UCSF Genome Core Facility (UCSF GCF). The rest were genotyped using the Illumina Human 610-Quad BeadChip at the Broad Institute (Massachussets General Hospital). Data were analyzed for quality control and Mendel errors using GenomeStudio software (Illumina). 540,123 SNPs were retained for analysis for the samples genotyped with the 610-Quad BeadChip. Those samples that were genotyped on the Illumina Linkage Panel 4b had 2,157 markers that overlapped with the Human 610-Quad BeadChip; genotypes for the additional markers were coded as missing.

Statistical analysis

Pedigree relationships were confirmed prior to analysis using PREST and PLINK(37, 38). In two families, pedigree structures were altered to incorporate non-paternities that were identified through these assessments. Parametric and nonparametric linkage analyses were conducted using Morgan version 3.0 and Merlin (details in Supplement)(39, 40). We chose to use both Merlin and Morgan because of the size and complexity of the pedigrees combined with the number of genetic markers available. While Morgan can analyze very large pedigrees, the number of markers that can be analyzed are limited, and must be in linkage equilibrium. In contrast, Merlin controls for the effects of linkage disequilibrium between markers, utilizing all available genotype information, but cannot use all individuals due to the size and complexity of the largest families. PedShrink was used to trim the pedigrees as needed for the Merlin analysis, with priority on trimming uninformative individuals(41).

We used a model-based (dominant and recessive) approach because simulation studies show that formulating a genetic model that approximates the true inheritance may have more power than nonparametric analyses, in part because parametric models can utilize information about unaffected individuals, which is not the case for nonparametric analyses(42). We also conducted non-parametric analyses (details in Supplement) because, for loci with high frequencies and low penetrances, non-parametric models may be more powerful. Because we had three different analytic approaches, each with different strengths, we were able to compare results across approaches, identifying and prioritizing those that were consistent across analyses as the most likely to represent true linkage regions.

The linkage parameters (see Supplement) were chosen to model a relatively rare locus with a large effect size and to reduce the risk of false positives due to phenocopies, given the high degree of bilineality that is seen in OCD families, including ours. We note that power to detect linkage is not sensitive to misspecification of penetrance or allele frequency, but instead is most sensitive to degree of dominance(43). Heterogeneity LOD (HLOD) scores were calculated by allowing the proportion of linked families to vary and estimating the proportion that gave the highest LOD scores for a given region.


Fine-mapping using additional SNP markers from the Illumina 610-Quad Bead Chip was conducted on chromosomal regions where the HLOD scores were ≥2.0 using the model and phenotype that showed the strongest evidence of linkage. All SNPs from the 610-Quad marker panel that were under the linkage peak of interest and had a MAF of >0.25 were identified and used for the Merlin analyses; for the Morgan analyses, this marker set was then pruned for linkage disequilibrium so that only SNPs with a pairwise r2<0.1 were included.

Haplotype analysis

Haplotypes of SNPs pruned for linkage disequilibrium were generated for all linked pedigrees in the genomic region with the highest HLOD score using the “haplotype analysis” command in Simwalk2snp and visualized using Haplopainter(44). Haplotypes that were inherited identical by descent and co-segregated with the OCD phenotype were assessed within each family.

Estimation of genome-wide significance values

Estimates of genome-wide significance values, incorporating both the markers used in the original linkage analyses and those used for fine-mapping, were calculated using 1) simulations and 2) the autoregressive method described by Bacanau(45). Permutations were performed using gene-dropping simulations, as implemented in Morgan and Merlin. For both Morgan and Merlin, 1000 replicates were simulated and each replicate was analyzed with both parametric models and both affection statuses. The significance for each LOD score was assessed by: 1) counting the number of replicates (nr) in which the maximum LOD score exceeded the observed lod score; and 2) calculating the p-value as (nr + 1)/1001. The threshold for genome-wide significant linkage was taken to be the 49th highest LOD score of the 1000 replicates. Criteria for significant genome-wide linkage (occurring in 5% of genome scans by chance) was determined to be LOD of 2.8–2.9 for a single analysis in Morgan (3.1–3.3 in Merlin), and 3.3 considering all four parametric analyses (3.8 in Merlin). It is likely that the thresholds are higher for Merlin because many more markers were analyzed. In comparison, Lander and Krugylak suggested that a LOD score of 3.3 for a parametric linkage analysis of an “infinitely dense” map be considered the threshold for a genome-wide significant result(46). We also used the autoregressive method to generate genome-wide significance thresholds from the data and made a (conservative) Bonferroni correction for the number of genome scans that were done (both parametric and non-parametric). The range of LOD score thresholds for suggestive linkage using this approach was 3.1 to 3.5, and the range of LOD score thresholds for significant linkage was 4.1 to 4.8 (Table S1 in the Supplement).


Genome-wide linkage analysis

We identified eleven chromosomal regions with HLOD scores ≥1.5, a threshold commonly used to identify linkage regions of interest, and five with a HLOD score ≥2 (Table 3). Figures S1–S3 in the Supplement shows the results of the genome-wide parametric and non-parametric analyses. The region with the highest HLOD score was on chromosome 1p36, with a maximum HLOD score of 2.96 using Merlin under the dominant model and broad phenotype (LOD score without correction for heterogeneity = −3.88) and a maximum HLOD score of 2.88 with the dominant model and narrow phenotype (LOD score without correction for heterogeneity = −2.74). The maximum HLOD score in this region with Morgan was 2.66 under a dominant model using the narrow phenotype, and the maximum LOD score in this region using the nonparametric approach was 0.87 at marker rs2377041 with Morgan and 0.93 at markers rs6676961 to rs6677984 with Morgan. Note that the difference between the parametic and nonparametric LOD scores is most likely due to the added information provided by the unaffected individuals in the parametric analyses.

Table 3
Chromosomal regions with LOD scores ≥1.5.

Seventeen of the 33 families showed LOD scores >0 in this region. Only one of the 11 identified linkage regions has been previously reported as potentially harboring OCD susceptibility genes; the linkage region on chromosome 6p25, at ~3Mb, had a maximum HLOD score of 2.56, and is near the 6p25 region identified by Hanna et al at ~5Mb(26). When the six pedigrees that were linked in both the previous and the current studies were excluded, the evidence for linkage in this region remained, although somewhat diminished (Table 4).

Table 4
Finemapping results for chromosomal regions with initial LOD scores ≥2.0.

Individual family LOD scores

Individual family LOD scores for the genomic regions with HLODs ≥2 are shown in Table S2 in the Supplement. The strongest genome-wide individual family LOD scores for the three largest families (shown in Figure 1) were 3.4 for family 1 on chromosome 1p36 (broad phenotype, dominant inheritance), 2.0 on chromosome 18q22.1 for family 2 (narrow phenotype, dominant inheritance), and 1.4 on chromosome 1q31 for family 3 (narrow phenotype, dominant inheritance).

Fine mapping

Fine-mapping analyses were conducted on chromosomes 1p36, 2p14, 5q13, 6p25, and 10p13 (Figure 2). For chromosome 1p36, which had similar high HLOD scores for both the broad and narrow phenotypes in Merlin, we conducted fine-mapping analyses for the narrow phenotype in both Morgan and Merlin and for the broad phenotype in Merlin only, as Morgan gave a HLOD score of only 1.35 for the broad phenotype. For chromosome 2, the HLOD score decreased with the inclusion of additional markers, and for chromosomes 5 and 10, the HLOD score increased in one of the two analyses only. For all other genomic regions, the HLOD scores increased with the inclusion of additional markers for both analyses (Table 4). As in the genome-wide analysis, the highest overall HLOD score was obtained at chromosome 1p36.33 to 1p36.32, with a maximum HLOD score of 3.77 at marker rs897615 using Merlin and 3.08 at marker rs884080 using Morgan (dominant model, narrow phenotype in both) (Figure 2). The confidence interval for this linkage peak using HLOD>2.0 as the cutoff was bounded by SNPs rs884080 to rs7518255 for the Morgan analysis and by SNPs rs4475691 to rs1874266 for the Merlin analysis.

Figure 2Figure 2
Fine-mapping analyses for genomic regions with HLOD scores ≥2.0. Y axis indicates HLOD score; X axis indicates genomic position. Graphs are presented for phenotype, model, and analysis type (Merlin or Morgan) with the highest HLOD score.

Examination of haplotypes

We examined the haplotypes generated by Simwalk2snp in all families linked to chromosome 1p36 using the LD-pruned SNP set. We identified haplotypes that co-segregated with OCD, encompassing the region with the highest genome-wide HLOD scores, in the majority of the linked families. In the largest family, which had a LOD score of 2.9 under the narrow phenotype (LOD = 3.4 under the broad phenotype), 11 of the 14 individuals with the narrow OCD phenotype carried a common haplotype inherited from the founder, along with all four individuals with the broad OCD phenotype and one obligate carrier. In the next largest family, which had a LOD score of 0.8 under the narrow phenotype, five of the seven biologically related individuals with the narrow OCD diagnosis (including the founder) carry a shared haplotype, along with three of the four biologically related individuals with the broad OCD phenotype, and two obligate carriers (Figure S4 in the Supplement). While there was haplotype sharing within each family, we did not identify a haplotype that was shared between families.


In this study we report the results of a genome-wide linkage analysis in multiply-affected pedigrees with childhood-onset OCD. We identifed several genomic regions of potential interest for OCD on chromosomes 1p36, 2p14, 5q13, 6p25, and 10p13. The linkage region on chromosome 1p36.33-1p36.32, which meets genome-wide criteria for suggestive linkage after fine-mapping based on our calculated significance thresholds (HLOD=3.77) and spans 4 Mb, is the strongest linkage finding for OCD reported to date, and the most interesting region that we identified. The majority of the linkage signal on 1p36 comes from our largest family, however, 16 other families also contributed to the LOD score, and had haplotypes that co-segregated with either the narrow or the broad OCD phenotype, suggesting that the finding is not specific to a single family.

Although this is the first reported linkage for OCD on chromosome 1p36, several other neuropsychiatric disorders have been linked to the 1p36 region or nearby including major depressive disorder (MDD), eating disorders (ED), childhood-onset mood disorders, and childhood epilepsy(4750). A whole genome linkage scan of recurrent MDD identifed a maximum LOD score of 3.03 in females (there was no evidence of linkage in males) at 1p36.23 to 1p36.22 (7.6 Mb to 12.3 Mb), a region that adjoins our linkage region(47). Major depression co-occurs with OCD in about 50% of cases, including in our families, and a recent family study has suggested that childhood-onset OCD with MDD may represent a distinct etiological syndrome(51).

Similarly, a whole-genome linkage scan for ED identified a linkage region on chromosome 1p33 to 1p36 (fine mapping peak multipoint NPL score of 3.45, restricting subtype of anorexia nervosa), just centromeric to our linkage region (1p36.33 to 1p36.32, or 0 to 4 Mb)(49, 50). As with MDD, there is evidence that OCD and ED show substantial phenotypic and etiological overlap(49, 5255). A recent comprehensive review of epidemiological, longitudinal, and family studies suggests a much higher rate of co-occurrence of ED and OCD than expected by chance, as well as a clear etiological relationship between these disorders. The data suggest that the two most likely models are 1) OCD and ED are alternate expressions (or different phases) of the same underlying etiological risk factors, or 2) OCD, which often has an earlier age of onset than ED, is a risk factor for the development of an ED(56). Rates of ED were very low in our families (~4%), and were not concentrated in the families that were linked to chromosome 1, suggesting that the second model is not likely for our sample. However, both models are consistent with shared genetic risk factors, and therefore candidate genomic regions that are identified in both OCD and ED are of increased interest.

The 1p36 region has also been implicated in a deletion syndrome (1p36 syndrome) with a complex phenotype characterized by intellectual disability and multiple system anomalies(57). Behavioral disorders, including self-biting, temper tantrums, reduced social interactions, stereotypies or other repetitive movements, and hyperphagia have been reported in approximately half of the deletion 1p36 cases(57). OCD symptoms have not specifically been reported, but given the phenomenological similarities between stereotypies and compulsive behaviors, and the role of the striatum in both phenotypes, the overlap of this known deletion syndrome with our primary linkage region strengthens the hypothesis that it may harbor susceptibility genes for OCD and perhaps for other related disorders of childhood, as do the linkage findings for MDD and ED.


The major strengths of our study are the size of the sample and the informativeness of the families for linkage analyses. We believe that the use of both parametric and nonparametric analyses is also a strength—this approach was chosen to maximize the information available from complex pedigrees that are not easily accomodated by a single phenotype or model in a disorder with an unknown mode of inheritance. Nevertheless, we recognize that multiple analyses can lead to an inflation of the LOD scores. We have corrected for this by calculating the relevant genome-wide significance thresholds using the actual data, and by examining the evidence for linkage across the multiple analyses, as well as by examining haplotypes and segregation patterns in our families. The results of the simulations indicate that our LOD scores are not artifically inflated. We believe that, for chromosome 1 at least, the convergence of results across analyses strengthens rather than weakens the evidence for linkage.

The primary limitation of this study relates to heterogeneity. For example, there may be phenotypic heterogeneity among families asertained from the different sites due to the nonuniformity of clinical assessments used (although all sites used a version of the Y-BOCS or CY-BOCS and an additional semi-structured, well-validated instrument for clinical assessment). We have addressed this by conducting BE diagnoses and requiring 100% concordance on all of the diagnostic criteria for phenotypic assignment of both OCD (narrow phenotype) and subclinical OCD (broad phenotype). We believe that this rigorous approach minimizes the problem of potential phenotypic heterogeneity.

There is also genetic heterogeneity in the study population, perhaps due to subtle ethnic variation (e.g., northern vs southern European descent). However, the use of an HLOD approach, which identifies and incorporates heterogeneity in the linkage analysis, addresses this concern. Genetic heterogeneity is also evident in the observation that, while we identified haplotypes on chromosome 1p36 that co-segregate with OCD within families, we did not identify a consistent haplotype that co-segregated with OCD across families. This is not surprising, given that the families in our sample are from outbred Caucasian populations rather than from a genetic isolate, and does not necessarily reduce our confidence in the results. It does highlight the need for further investigation of this region and the other genomic regions of interest identified by our study, however, as discussed below.


In conclusion, this work identifies a new region of interest for OCD on chromosome 1p36. Despite meeting suggestive rather than significant linkage criteria based on our simulations, this is the strongest linkage result reported to date for this complex disorder. This region has previously been associated with several neuropsychiatric phenotypes that are related to OCD, including eating and mood disorders, and the 1p36 deletion syndrome. As with all genetic investigations, follow-up studies are needed to validate and extend these findings, including replication of the linkage findings, and sequencing of the region to identify possible functional variants within particular gene(s).

Although for a time supplanted in favor of case control or trio-based approaches such as GWAS, the advent of high-throughput sequencing technology has caused a renewed interest in linkage studies for complex traits. Such studies are needed to help determine which of the many rare, potentially deleterious, variants identified through sequencing co-segregate with disease. Genome-wide linkage studies such as this one can help to identify and prioritize genomic regions of interest, as well as helping to identify the most informative individuals within linked pedigrees for either targeted or complete genome sequencing.

In addition, although the common disease/common variant approach has driven the interest in GWAS, rare variants have also been shown to be important in the etiology of common disease (e.g., Crohn’s disease)(58). Variants identified through family-based approaches, such as linkage and translocation studies, while individually affecting only a small proportion of families or individuals, have led to the identification of biologically relevant genes, gene clusters, or gene networks that are involved in the pathogenesis of Alzheimer disease (presenilin genes) and schizophrenia (DISC1 and 2)(59, 60). As with these complex traits, in order to more fully understand the biological underpinnings of OCD, multiple approaches, including (but likely not limited to) linkage, GWAS, whole-genome sequencing, and animal model studies, will ultimately be required.

Supplementary Material



This research was supported by grants to CAM from the National Center for Research Resources (K23 RR015533), the National Alliance for Research on Schizophrenia and Affective Disorders, the Obsessive Compulsive Foundation, and the Althea Foundation, by grants to GLH from the National Institute of Mental Health (K20 MH 01065 and R01 MH 58376) and the Obsessive Compulsive Foundation, and to DAC from the National Institute of Mental Health (K01 MH072952).


FINANCIAL DISCLOSURES: The authors report no biomedical financial interests or potential conflicts of interest.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.


1. Ruscio AM, Stein DJ, Chiu WT, Kessler RC. The epidemiology of obsessive-compulsive disorder in the National Comorbidity Survey Replication. Mol Psychiatry. 2008 Aug 26; [PMC free article] [PubMed]
2. APA. Diagnostic and Statistical Manual of Mental Disorders (DSM-IV TR) 4. Washington, DC: American Psychiatric Association; 2000. Text Revision.
3. Fontenelle LF, Mendlowicz MV, Versiani M. The descriptive epidemiology of obsessive-compulsive disorder. Prog Neuropsychopharmacol Biol Psychiatry. 2006 May;30(3):327–37. [PubMed]
4. Horwath E, Weissman MM. The epidemiology and cross-national presentation of obsessive-compulsive disorder. Psychiatr Clin North Am. 2000 Sep;23(3):493–507. [PubMed]
5. Eley TC, Bolton D, O’Connor TG, Perrin S, Smith P, Plomin R. A twin study of anxiety-related behaviours in pre-school children. J Child Psychol Psychiatry. 2003 Oct;44(7):945–60. [PubMed]
6. Van Grootheest DS, Cath DC, Beekman AT, Boomsma DI. Genetic and environmental influences on obsessive-compulsive symptoms in adults: a population-based twin-family study. Psychol Med. 2007 Nov;37(11):1635–44. [PubMed]
7. van Grootheest DS, Cath DC, Beekman ATF, Boomsma DI, editors. International Congress on Psychiatric Genetics. Boston, MA: 2005. Genetic and environmental influences on OC symptoms in adults: a population based twin-family study.
8. Hudziak JJ, Van Beijsterveldt CE, Althoff RR, Stanger C, Rettew DC, Nelson EC, et al. Genetic and environmental contributions to the Child Behavior Checklist Obsessive-Compulsive Scale: a cross-cultural twin study. Arch Gen Psychiatry. 2004 Jun;61(6):608–16. [PubMed]
9. van Grootheest DS, Cath DC, Beekman AT, Boomsma DI. Twin studies on obsessive-compulsive disorder: a review. Twin Res Hum Genet. 2005 Oct;8(5):450–8. [PubMed]
10. van Grootheest DS, Bartels M, Cath DC, Beekman AT, Hudziak JJ, Boomsma DI. Genetic and environmental contributions underlying stability in childhood obsessive-compulsive behavior. Biol Psychiatry. 2007 Feb 1;61(3):308–15. [PubMed]
11. Pauls DL, Alsobrook JP, 2nd, Goodman W, Rasmussen S, Leckman JF. A family study of obsessive-compulsive disorder. Am J Psychiatry. 1995 Jan;152(1):76–84. [PubMed]
12. Nestadt G, Lan T, Samuels J, Riddle M, Bienvenu OJ, 3rd, Liang KY, et al. Complex segregation analysis provides compelling evidence for a major gene underlying obsessive-compulsive disorder and for heterogeneity by sex. Am J Hum Genet. 2000 Dec;67(6):1611–6. [PubMed]
13. Nestadt G, Samuels J, Riddle M, Bienvenu OJ, 3rd, Liang KY, LaBuda M, et al. A family study of obsessive-compulsive disorder. Arch Gen Psychiatry. 2000 Apr;57(4):358–63. [PubMed]
14. Geller D, Biederman J, Jones J, Park K, Schwartz S, Shapiro S, et al. Is juvenile obsessive-compulsive disorder a developmental subtype of the disorder? A review of the pediatric literature. J Am Acad Child Adolesc Psychiatry. 1998 Apr;37(4):420–7. [PubMed]
15. Pauls DL. The genetics of obsessive compulsive disorder: a review of the evidence. Am J Med Genet C Semin Med Genet. 2008 May 15;148C(2):133–9. [PubMed]
16. Hanna GL, Fingerlin TE, Himle JA, Boehnke M. Complex Segregation Analysis of Obsessive-Compulsive Disorder in Families with Pediatric Probands. Hum Hered. 2005 Jul 27;60(1):1–9. [PubMed]
17. Hanna GL, Fischer DJ, Chadha KR, Himle JA, Van Etten M. Familial and sporadic subtypes of early-onset Obsessive-Compulsive disorder. Biol Psychiatry. 2005 Apr 15;57(8):895–900. [PubMed]
18. Hanna GL, Himle JA, Curtis GC, Gillespie BW. A family study of obsessive-compulsive disorder with pediatric probands. Am J Med Genet B Neuropsychiatr Genet. 2005 Apr 5;134(1):13–9. [PubMed]
19. Dickel DE, Veenstra-VanderWeele J, Cox NJ, Wu X, Fischer DJ, Van Etten-Lee M, et al. Association testing of the positional and functional candidate gene SLC1A1/EAAC1 in early-onset obsessive-compulsive disorder. Arch Gen Psychiatry. 2006 Jul;63(7):778–85. [PubMed]
20. Arnold PD, Sicard T, Burroughs E, Richter MA, Kennedy JL. Glutamate transporter gene SLC1A1 associated with obsessive-compulsive disorder. Arch Gen Psychiatry. 2006 Jul;63(7):769–76. [PubMed]
21. Bienvenu OJ, Wang Y, Shugart YY, Welch JM, Grados MA, Fyer AJ, et al. Sapap3 and pathological grooming in humans: Results from the OCD collaborative genetics study. Am J Med Genet B Neuropsychiatr Genet. 2008 Dec 2; [PubMed]
22. Wang Y, Samuels JF, Chang YC, Grados MA, Greenberg BD, Knowles JA, et al. Gender differences in genetic linkage and association on 11p15 in obsessive-compulsive disorder families. Am J Med Genet B Neuropsychiatr Genet. 2009 Jan 5;150B(1):33–40. [PubMed]
23. Samuels J, Shugart YY, Grados MA, Willour VL, Bienvenu OJ, Greenberg BD, et al. Significant linkage to compulsive hoarding on chromosome 14 in families with obsessive-compulsive disorder: results from the OCD Collaborative Genetics Study. Am J Psychiatry. 2007 Mar;164(3):493–9. [PubMed]
24. Hanna GL, Veenstra-Vanderweele J, Cox NJ, Van Etten M, Fischer DJ, Himle JA, et al. Evidence for a susceptibility locus on chromosome 10p15 in early-onset obsessive-compulsive disorder. Biol Psychiatry. 2007 Oct 15;62(8):856–62. [PMC free article] [PubMed]
25. Shugart YY, Samuels J, Willour VL, Grados MA, Greenberg BD, Knowles JA, et al. Genomewide linkage scan for obsessive-compulsive disorder: evidence for susceptibility loci on chromosomes 3q, 7p, 1q, 15q, and 6q. Mol Psychiatry. 2006 Aug;11(8):763–70. [PubMed]
26. Hanna GL, Veenstra-VanderWeele J, Cox NJ, Boehnke M, Himle JA, Curtis GC, et al. Genome-wide linkage analysis of families with obsessive-compulsive disorder ascertained through pediatric probands. Am J Med Genet. 2002 Jul 8;114(5):541–52. [PubMed]
27. Willour VL, Yao Shugart Y, Samuels J, Grados M, Cullen B, Bienvenu OJ, 3rd, et al. Replication study supports evidence for linkage to 9p24 in obsessive-compulsive disorder. Am J Hum Genet. 2004 Sep;75(3):508–13. [PubMed]
28. Ross J, Badner J, Garrido H, Sheppard B, Chavira DA, Grados M, et al. Genomewide linkage analysis in Costa Rican families implicates chromosome 15q14 as a candidate region for OCD. Hum Genet. 2011 Jun 21; [PMC free article] [PubMed]
29. Cirulli ET, Goldstein DB. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet. 2010 Jun;11(6):415–25. [PubMed]
30. Goodman WK, Price LH, Rasmussen SA, Mazure C, Fleischmann RL, Hill CL, et al. The Yale-Brown Obsessive Compulsive Scale. I. Development, use, and reliability. Arch Gen Psychiatry. 1989 Nov;46(11):1006–11. [PubMed]
31. Kaufman J, Birmaher B, Brent D, Rao U, Flynn C, Moreci P, et al. Schedule for Affective Disorders and Schizophrenia for School-Age Children-Present and Lifetime Version (K-SADS-PL): initial reliability and validity data. J Am Acad Child Adolesc Psychiatry. 1997 Jul;36(7):980–8. [PubMed]
32. Nurnberger JI, Jr, Blehar MC, Kaufmann CA, York-Cooler C, Simpson SG, Harkavy-Friedman J, et al. Diagnostic interview for genetic studies. Rationale, unique features, and training. NIMH Genetics Initiative. Arch Gen Psychiatry. 1994 Nov;51(11):849–59. discussion 63-4. [PubMed]
33. Scahill L, Riddle MA, McSwiggin-Hardin M, Ort SI, King RA, Goodman WK, et al. Children’s Yale-Brown Obsessive Compulsive Scale: reliability and validity. J Am Acad Child Adolesc Psychiatry. 1997 Jun;36(6):844–52. [PubMed]
34. Pauls DLHC. Schedule for Tourette and other behavioral syndromes. New Haven, CT: Child Study Center, Yale University; 1991.
35. Orvaschel H. Schedule for Affective Disorders and Schizophrenia for School-Age Children-Epidemiologic Version-5 (K-SADS-E-5) Fort Lauderdale, FL: Nova Southeastern University, Center for Psychological Studies; 1995.
36. Leckman JF, Sholomskas D, Thompson WD, Belanger A, Weissman MM. Best estimate of lifetime psychiatric diagnosis: a methodological study. Arch Gen Psychiatry. 1982 Aug;39(8):879–83. [PubMed]
37. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007 Sep;81(3):559–75. [PubMed]
38. McPeek MS, Sun L. Statistical tests for detection of misspecified relationships by use of genome-screen data. Am J Hum Genet. 2000 Mar;66(3):1076–94. [PubMed]
39. Wijsman EM, Rothstein JH, Thompson EA. Multipoint linkage analysis with many multiallelic or dense diallelic markers: Markov chain-Monte Carlo provides practical approaches for genome scans on general pedigrees. Am J Hum Genet. 2006 Nov;79(5):846–58. [PubMed]
40. Abecasis GR, Cherny SS, Cookson WO, Cardon LR. Merlin--rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet. 2002 Jan;30(1):97–101. [PubMed]
41. Schaid DJ. pedigree.shrink. 2009 Available from:
42. Greenberg DA, Abreu P, Hodge SE. The power to detect linkage in complex disease by means of simple LOD-score analyses. Am J Hum Genet. 1998 Sep;63(3):870–9. [PubMed]
43. Clerget-Darpoux F, Bonaiti-Pellie C, Hochez J. Effects of misspecifying genetic parameters in lod score analysis. Biometrics. 1986 Jun;42(2):393–9. [PubMed]
44. Thiele H, Nurnberg P. HaploPainter: a tool for drawing pedigrees with complex haplotypes. Bioinformatics. 2005 Apr 15;21(8):1730–2. [PubMed]
45. Bacanu SA. Robust estimation of critical values for genome scans to detect linkage. Genet Epidemiol. 2005 Jan;28(1):24–32. [PubMed]
46. Lander E, Kruglyak L. Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet. 1995 Nov;11(3):241–7. [PubMed]
47. McGuffin P, Knight J, Breen G, Brewster S, Boyd PR, Craddock N, et al. Whole genome linkage scan of recurrent depressive disorder from the depression network study. Hum Mol Genet. 2005 Nov 15;14(22):3337–45. [PubMed]
48. Feng Y, Kapornai K, Kiss E, Tamas Z, Mayer L, Baji I, et al. Association of the GABRD gene and childhood-onset mood disorders. Genes Brain Behav. 2010 Aug;9(6):668–72. [PMC free article] [PubMed]
49. Grice DE, Halmi KA, Fichter MM, Strober M, Woodside DB, Treasure JT, et al. Evidence for a susceptibility gene for anorexia nervosa on chromosome 1. Am J Hum Genet. 2002 Mar;70(3):787–92. [PubMed]
50. Bergen AW, van den Bree MB, Yeager M, Welch R, Ganjei JK, Haque K, et al. Candidate genes for anorexia nervosa in the 1p33–36 linkage region: serotonin 1D and delta opioid receptor loci exhibit significant association to anorexia nervosa. Mol Psychiatry. 2003 Apr;8(4):397–406. [PubMed]
51. Hanna GL, Himle JA, Hanna BS, Gold KJ, Gillespie BW. Major depressive disorder in a family study of obsessive-compulsive disorder with pediatric probands. Depress Anxiety. 2011 Jun;28(6):501–8. [PMC free article] [PubMed]
52. Cavallini MC, Bertelli S, Chiapparino D, Riboldi S, Bellodi L. Complex segregation analysis of obsessive-compulsive disorder in 141 families of eating disorder probands, with and without obsessive-compulsive disorder. Am J Med Genet. 2000 Jun 12;96(3):384–91. [PubMed]
53. Bellodi L, Cavallini MC, Bertelli S, Chiapparino D, Riboldi C, Smeraldi E. Morbidity risk for obsessive-compulsive spectrum disorders in first-degree relatives of patients with eating disorders. Am J Psychiatry. 2001 Apr;158(4):563–9. [PubMed]
54. Murphy R, Nutzinger DO, Paul T, Leplow B. Conditional-associative learning in eating disorders: a comparison with OCD. J Clin Exp Neuropsychol. 2004 Apr;26(2):190–9. [PubMed]
55. Boghi A, Sterpone S, Sales S, D’Agata F, Bradac GB, Zullo G, et al. In vivo evidence of global and focal brain alterations in anorexia nervosa. Psychiatry Res. 2011 Jun 30;192(3):154–9. [PubMed]
56. Altman SE, Shankman SA. What is the association between obsessive-compulsive disorder and eating disorders? Clin Psychol Rev. 2009 Nov;29(7):638–46. [PubMed]
57. Battaglia A, Hoyme HE, Dallapiccola B, Zackai E, Hudgins L, McDonald-McGinn D, et al. Further delineation of deletion 1p36 syndrome in 60 patients: a recognizable phenotype and common cause of developmental delay and mental retardation. Pediatrics. 2008 Feb;121(2):404–10. [PubMed]
58. Pritchard JK, Cox NJ. The allelic architecture of human disease genes: common disease-common variant...or not? Hum Mol Genet. 2002 Oct 1;11(20):2417–23. [PubMed]
59. St George-Hyslop PH, Petit A. Molecular biology and genetics of Alzheimer’s disease. C R Biol. 2005 Feb;328(2):119–30. [PubMed]
60. Brandon N, Millar K, Korth C, Sive H, Sing KK, Sawa A. Understanding the role of DISC1 in psychiatric disease and during normal development. J Neurosci. 2009;29(41):12768–75. [PubMed]