This study represents a classical approach for the identification of genes underlying common and complex disorders, such as the disabling neuropsychiatric condition, OCD. The approach entailed a genome-wide linkage scan in a large sample of rigorously assessed families, each with multiple members diagnosed with DSM-IV OCD (
Shugart et al 2006). Chromosomal regions with the strongest linkage signals were then investigated further; first with additional linkage fine-mapping using STRP markers, and then with staged, association fine-mapping using SNP markers.
The original linkage study suggested that six regions were most promising for further investigation. After additional linkage and association fine-mapping, two regions on chromosomes 15q and 1q, and, more specifically, two genes in each region, were associated with OCD. These genes are MEIS2 and NANOGP8 (15q) and PBX1 and LMX1A (1q).
The strongest signals on chromosome 15q are at rs11854468 (p=0.000054), located approximately 60 kb from NANOGP8, and at rs4625687 (p=0.00007), located in the intron of MEIS2. There is another strong signal at rs11858145 (p=0.0007) located in a coding region of FMN1. The best signal on chromosome 1q is at rs4387163 (p= 0.0002) which is located at 1q23.3. The GRR at this locus is 1.9. This SNP is located 150kb equidistant from both PBX1 and LMX1A. SNP rs11854468 on 15q remained significant after adjustment for multiple testing (p<0.05). Although statistically this is the most robust signal, rs4625687 in the MEIS2 gene (p after adjustment = 0.065) remains an interesting signal, because of its location and the signals at surrounding SNPs.
An outstanding feature of these results is that PBX1, MEIS2, LMX1A, and the NANOG-related NANOGP8 gene are all transcription factors, or closely related to transcription factors, important in neurodevelopment. While we recognize that there are many transcription factors involved in neurodevelopment, and even more that could plausibly be implicated in some hypothetical relationship to the origin of OCD, we assert that the probability is exceedingly small that genes with these functional properties would be identified. Specifically, of the thousands of genes across the genome there are approximately 233 Homeobox genes identified; therefore identifying these four genes by chance is highly improbable.
MEIS2, PBX1, LMX1A, and
NANOG code for homeobox proteins: the former two are members of the three amino acid loop extension (TALE) superclass, whereas
LMX1A is a member of the LIM superclass. All are cofactors for transcription regulation and have important roles in brain development.
MEIS2 and
PBX are intimately involved as cofactors with homeobox (
HOX) and myogenic differentiation
(MyoD) genes (
Berkes et al 2004). It has been shown that
MEIS2 and
PBX1 form a heterodimeric complex which acts together with
HOX genes to activate transcription (
Sagerström, 2004) and
MEIS2 has been found to be the preferred partner of
PBX1 (
Chang, et al 1997).
NANOG also binds to
PBX1 which regulate its function (
Chan et al., 2009).
NANOGP8 is a retrogene (
Zhang, et al., 2006) which belongs to a family of 11
NANOG pseudogenes. The
NANOGP8 gene in 15q14 is unique among
NANOG pseudogenes in that it has an intact open reading frame and is actively transcribed (
Jeter et al 2009). It is the only
NANOG-related gene that is uniquely expressed in humans compared to chimpanzees (
Fairbanks et al 2006).
FMN1 is a regulator of another important transcription factor in neurodevelopment, glioma-associated oncogene family zinc finger 3
(GLI3; Zeller et al 1999;
Zuniga & Zeller, 1999).
MEIS2 and
PBX1 have been shown to be expressed in the developing rat striatum (
Cecconi et al 1997;
Toresson et al 2009), as has
NANOG (Molero et al 2009).
Takahashi et al (2008) suggest that
MEIS2 and
PBX1 might play a “pivotal role in establishing the neurons of the islands of Calleja, within the ventral striatum, as well as striosomal neurons in the striatum” (
Takahashi et al 2008). These transcription factors have also been reported to be associated with the
ephA8 regulatory sequence in mesencephalon development (
Sungbo et al 2007).
MEIS2 (with
ISLET1) has also been shown to activate transcription through
I56ii (a conserved regulatory element of
DLX genes) during mid-gestation. Additionally, both
PBX1 and
MEIS2 have been shown to have a functional link to
SOX3, which is involved in specifying neural fate (
Mojsin and Stevanovic, 2009). Determining the pathways through which these and other transcription factors interact in the development of the relevant brain regions is in progress (Long
et al 2009a; Long
et al 2009b), and will ultimately enhance our understanding of brain development and possibly the origin of neuropsychiatric conditions.
Either of these two genes (i.e.
MEIS2 and
PBX1) alone could be considered interesting candidate genes involved in the pathogenesis of OCD (
Takahashi et al 2008). However, identifying signals in the region of both genes, on different chromosomes, makes these findings more compelling; both are involved in brain development in a brain region, the striatum, that is the primary focus of interest in OCD; and they exert their developmental effect in tandem, forming a heterodimeric complex for this purpose. The genetic effects of
FMN1 and
NANOGP8 also plausibly impinge on neurodevelopment, albeit in different ways. While
FMN1 directly supports cell structure via actin/microtubules and regulates a neurodevelopment gene,
GLI3, NANOGP8 is related to
NANOG which maintains stem cell pluripotency, and therefore conceivably affects neurogenesis generally, and early stage serotonin neurons specifically (
Bethea et al 2009).
An obvious step to investigate a possible inter-relationship between these two genes in the origin of OCD is to determine whether there is a statistical interaction between them in their association with OCD. We did not find evidence for statistical interaction in this sample using the method described above; however, definitive identification of interaction would require a far larger sample of families than were available in this study.
LMX1A is adjacent to
PBX1 on 1q and both are equidistant to the association signal in that region.
LMX1A acts in a pathway that involves the development of dopamine neurons in the midbrain. It has also been reported to be involved in postnatal brain development and the maintenance of neuronal function (
Andersson et al 2006). This gene is therefore also an interesting candidate in OCD. Dopaminergic function has been hypothesized to be involved in the etiology (and treatment) of OCD (
Graybiel and Rauch, 2000).
FMN1 is a member of the formin family of proteins that remodel the actin and microtubule cytoskeletons (
Goode and Eck, 2007); however, it also directly regulates
GLI3, a key gene in development, including cortical morphology (
Friedrichs et al 2008;
Theil, 2005). Thus, a plausible explanation for the involvement of
FMN1 in OCD may be related to the involvement of microtubule and actin cytoskeleton in neural migration (
Kerjan and Gleeson, 2007) or in the gene network that regulates brain morphology (
Yu et al., 2009;
Quinn et al., 2009).
The likelihood of genetic heterogeneity is a fundamental concern in conducting studies such as this one. We investigated whether any clinical characteristic of the cases were associated with SNP rs4625687 in MEIS2. Only this SNP was investigated, out of several potential SNPs, as an exploratory exercise. The presence of body dysmorphic disorder, tics, and symmetry/ordering OCD symptoms were significantly more common. It should be noted that probands with Tourette Disorder were excluded. Nevertheless, these three clinical features frequently co-occur in OCD cases who exhibit tics; this and suggests that a ‘tic-form’ of OCD may be a relevant sub-phenotype underlying these findings.
Patients with OCD typically score extremely high on ‘Neuroticism’, a well-characterized personality trait in the psychological literature (
Samuels et al 2000). It is therefore interesting that a WGA study of ‘neuroticism’ reported a single nucleotide polymorphism (SNP) in
PBX1 as one of the ten best signals (7.18 × 10
−6), and this SNP was most significant for the “anxiety” facet of neuroticism (
van den Oord et al 2008). The hypothesis that ‘neuroticism’ and OCD share a common etiology would be borne out if the relationship between
PBX1 and both ‘neuroticism’ and OCD were replicated.