Our study represents the first large scale genome-wide effort to identify chromosomal regions with putative loci affecting T2D and lipid traits in a unique community of Asian Sikhs from Northern India. This diabetic cohort from a genetically homogenous subgroup was collected with the initial goal of identifying T2D predisposing genes. However, the results of our non-parametric linkage scan did not identify any chromosomal region to be significantly linked to T2D
(online Figure S1). Note that the non-parametric method for linkage (used in our study) only considers allele sharing between affected individuals, therefore, the ambiguous phenotype of unaffected members is unlikely to have led to the failure to detect linkage in this large sample. These results reaffirm the highly complex nature of T2D phenotype. Essentially, our study failed to identify genes associated with T2D even when a homogenous population was used to control genetic heterogeneity associated with T2D phenotype and a sample collected from one geographic location was used to reduce environmental heterogeneity. These finding suggest that the genes responsible for T2D in Sikhs have small effects, as seen in other ethnic groups, and are difficult to detect using linkage analysis. It can be argued that in comparison to random-mating population, higher identity by descent (IBD) sharing in this inbred population might have reduced the power of detecting significant linkage. In this scenario, one would expect to see increased average IBD leading to false positive indications of linkage. On the contrary, we found the opposite with no substantial increase in IBD among affected individuals and thus no linkage. At the same time, we believe that our linkage data may still contain considerably useful information that could enable the discrimination of causal variant from a near-by variant that is merely in linkage disequilibrium (LD)
[34]. Interestingly, our case-control association studies have confirmed some Caucasian GWAS loci (
TCF7L2, PPARG, KCNJ11, FTO and
KCNQ1) associated with T2D in this population
[35],
[36],
[37],
[38]. Therefore, further fine mapping especially in the elevated regions using high-density SNP panel and whole genome sequencing may identify rare and functional variants with large effects contributing to T2D. These Investigations also may answer the questions of ‘missing heritability' which is expected to lie in the ‘rare' variants and which the GWA studies are unable to explain
[39].
The other aim of this investigation was to identify genomic regions affecting lipid-related phenotypes in this cohort. We performed QTL-ALL analysis on this non-randomly ascertained dataset, which revealed several suggestive linkage signals associated with serum lipid levels
(). Classical multiple linear regression models were used to adjust for environmental effects on the serum lipid traits. In view of strong environmental component associated with T2D and lipid metabolism, we have carefully analyzed the environmental factors, particularly the unique life style factors such as diet, physical activity, obesity, job status, socio-economic status, gender, and medication that could potentially influence these traits. As explained in the Results section, the significant covariates with potential to modify linkage effect were identified and included in the analysis model. The strongest evidence of linkage (p

=

0.0011) for HDL cholesterol was detected on chromosome 10q21.1–21.2. Suggestive evidence of linkage to ApoA-I was observed on chromosome 10q21.1 in the Quebec Family Study (QFS)
[40]. The same region containing protocadherin 15 (
PCDH15) gene (10q21.1) has been associated with multiple lipid traits in Finnish and Dutch multigenerational dyslipidemic families
[41]. Another strong GWAS candidate gene linked with metabolic traits is solute carrier family 16, member 9 (
SCL16A9) that also maps to chromosome 10q21.2
[42]. It is a proton-linked monocarboxylate transporter and catalyzes the rapid transport of many monocarboxylates across the plasma membrane. Chromosome 10 also carried a signal for LDL cholesterol at 10p11.23 (p

=

0.0045) in our study. The closest candidate gene at this region is
KIAA1462 (10p11.23) that encodes a yet uncharacterized protein. However, a recently published GWAS showed an unambiguous evidence for association of rs3739998 (p

=

7.2×10
−8) within this gene with CVD and myocardial infarction in German MI Family cohort (GerMIFS) III (KORA)
[43]. Interestingly, some common variants in
PCDH15 and
SCL16A9 and
KIAA1462 genes are also associated with multiple lipid traits including HDL cholesterol, LDL cholesterol, and triglycerides ( p values of 0.004 to 0.0001) in our provisional results of lipid GWAS being performed on the population originated from the same Asian Indian community (unpublished results).
A linkage peak for total serum cholesterol (p

=

0.0031) was detected near marker D5S2488 at the proximal region of chromosome 5p15.33. This region was previously linked to LDL cholesterol in the NHLBI Family Heart Study
[44] and HDL cholesterol in the Hypertension Genetic Epidemiology Network Blood Pressure Study
[45]. Additionally, meta-analysis of linkage scans from four studies revealed a modest signal for LDL cholesterol (LOD 1.6) on chromosome 5p15.33
[46]. The suggestive linkage for LDL cholesterol at 9q21.13 near marker D9S1122 in our cohort was also associated with the triglyceride phenotype in a linkage study performed in families with myocardial infarction
[47].
The linkage signal at chromosome 22q13.32 near marker TCTA015M (p

=

0.0016), detected for total cholesterol was linked with familial hypercholesterolemia in a Utah study
[48]. The chromosomal region 22q11–13 was also reported to effect HDL cholesterol in the Old Order Amish
[49]. Notably, the strongest candidate gene in this region is
PPARα, which is a ligand-activated nuclear transcription factor and controls extracellular and intracellular lipid metabolism, and also inhibits progression of atherosclerotic lesions
[50]. Lipid-lowering drugs of fiberate class are synthetic ligands of pparα
[51]. Variants in this gene were reported to be associated with T2D and CVD
[52]. Another gene
CELSR1 (located at 22q11–13) is associated with ischemic stroke in recent Japanese GWAS
[53]. Furthermore, a single nucleotide polymorphism (SNP) near
CELSR2 on chromosome 1p13 (homologous to
CELSR1) is associated with LDL cholesterol and myocardial infarction in a meta-analysis study by Myocardial Infraction Genetics Consortium
[54].
Our study does not represent a common replication attempt to identify lipid loci in an independent population. Rather, this investigation has been carefully carried out in this unique family-based cohort using a conservative statistical approach applying score-based statistics to map quantitative lipid traits in a non-randomly ascertained dataset. Exceeding our expectations, this study has identified linkage regions, primarily HDL cholesterol (10q21.1–21.2) and total cholesterol (22q13.32) that were previously reported for lipid traits or CVD. The most interesting part of this study is that some of these linkage signals also harbor important candidate loci (e.g.,
KIAA1462, PCDH15, PPARα, SLC16A9, and
CELSR1) implicated with lipid traits in recent GWAS and meta-analysis studies and also some of these regions overlap with prior linkage studies
[55],
[56],
[57]. Therefore, our findings suggest that these regions might contain some novel genes for blood lipids rather than chance findings, and perhaps some of the loci may have larger effects in this Khatri Sikh cohort. Notably, the presence of HDL cholesterol signal on chromosome 10q21.2 is particularly important in view of low HDL cholesterol-associated CVD risk in Asian Indian men, in general, and may strongly relate to gene-environmental interaction which is enhanced by rapidly emerging western lifestyle
[58],
[59]. Further fine mapping with more efficacious strategy using SNP-based arrays (which would also help determine LD over small intervals), sequencing, and functional studies should allow rapid detection of novel target genes of therapeutic importance under these candidate regions.