By using a custom 50 K SNP genotyping array with dense, multi-population tag SNP coverage of >2 000 genes in CVD-related pathways including coagulation, we further characterized regions of the genome associated with FVII levels in EAs and AAs. Our findings can be summarized as follows: (i) the chromosome 13q34
F7–
F10 gene region contained multiple variants associated with higher and lower FVII in both EAs and AAs. (ii) The lower extent of LD among AA enabled finer localization of the strongest
F7 signals. (iii) We identified two novel genomic regions significantly associated with FVII in EAs, the
ZNF259–
BUD13–
APOA5 gene region on chromosome 11q23 and chromosome 20q12
HNF4A gene (
4). Of other candidate regions associated with FVII in EAs, we found evidence for replication of the chromosome 20
PROCR SNP rs867186 (Ser219Gly) and
GCKR associations with higher FVII levels in AAs; on the other hand, FVII association with polymorphisms within the
MS4A2,
APOA5 and
HNF4A gene regions were not evident in AAs.
Polymorphisms in the
F7 gene could be grouped in into three major haplotypes in European descent populations (
5). Compared with the most common or ‘wild-type’ haplotype, the haplotype tagged by the minor allele of rs555212 (and rs3093230) was associated with higher FVII levels while the haplotype tagged by the minor allele at rs561241 was associated with lower levels. These results confirm previous findings from smaller European population studies that SNPs tagging these haplotypes constitute the major determinants of FVII activity in EAs (
5,
15). Because of the large sample size of CARe and the dense SNP coverage of the IBC genotyping array, we were able to identify at least three additional variants within the
F7–
F10 genes independently associated FVII levels. Two of these are located within intron 5 of the
F7 gene (rs493833 and rs3093265) and one within intron 1 of the
F10 gene (rs3093268).The strong LD among the promoter variants in Europeans has previously hampered detailed characterization of the functional variants responsible for reduced FVII levels. Functional analysis of the
F7 promoter suggested that the −323ins10 and −122C (rs561241) variants are strongly associated with decreased promoter activity. It has remained uncertain whether the −402A variant (rs510317) or other upstream variants are responsible for the increased F7 promoter activity (
16). In our analysis, AA showed greater
F7 haplotype diversity, which suggested further localization of the higher-expression
F7 promoter region to rs3093230 (or an untyped variant in LD).
Characterization of common
cis-acting
F7 gene variants may have relevance for athero-thrombotic disease risk. The
F7 polymorphisms associated with higher FVII levels have been associated with increased risk of stroke and myocardial infarction (MI) in young women (
5,
17,
18).The relationship of haplotype C FVII-lowering polymorphisms to risk of clinical CVD has been equivocal, with published meta-analyses of MI (
10) and stroke (
19) showing no significant association. Since our results suggest independent associations of several additional
F7 polymorphisms on FVII level, evaluation of these variants in the context of clinical CVD risk, or extent of subclinical atherosclerosis in young adults (
20) both in EA and AA populations, may be warranted.
Polymorphisms in LD with the
PROCR Ser219Gly variant (rs867186) have been strongly associated with higher soluble endothelial protein C receptor (EPCR) levels, as well as with higher levels of clotting FVII, FVIIa, protein C antigen and several downstream markers of activated coagulation in the extrinsic pathway (
21–
22) in whites. Binding of FVII/FVIIa to soluble and/or membrane-associated EPCR and subsequent endocytosis of the receptor-ligand complexes may be important for regulating the circulating concentrations of FVII and protein C (
23). These findings have potential implications for the role of the
PROCR Ser219Gly dimorphism in risk of clinical thrombotic disease (
24,
25).
FVII levels are correlated with other CVD risk factors, such as lipid levels, body mass index and insulin resistance (
6,
26). The causal nature and mechanisms of these associations remain to be elucidated. There is a fairly well-described connection between high-fat diet, triglycerides and FVII activity. FVII and other vitamin K-dependent coagulation proteins bind to triglyceride-rich lipid particles (
27). Moreover, expression of the
F7 gene can be modulated by glucose and insulin levels due to a specific promoter element (
28). The association of FVII levels with polymorphisms of the
GCKR,
APOA5,
MLXIPL and
HNF4A genes suggests an additional level of shared genetic regulation between FVII synthesis, glucose metabolism and low-grade inflammation. The
APOA5 SNP rs6589566 that was discovered in our study was also associated with low density lipoprotein levels in a recent genome-wide association scan (
29). Other SNPs in LD with rs6589566 (e.g. rs964184, rs1558861, rs4938303, rs12280753) have been associated with high density lipoprotein (HDL-C) levels and triglyceride levels (
30–
34).
While the same SNPs were not identified in this study, variants in or near
PRKCQ have been found to be associated at a genome-wide significance level with type I diabetes (
35–
36) and rheumatoid arthritis (
37–
38). One non-synonymous SNP in
MXLIPL was also significantly associated with plasma triglycerides in another recent genome-wide scan (
33), although LD information was not available between our SNP (rs7777102) and theirs (rs3812316).
The
GCKR gene product inhibits glucokinase in liver and pancreatic islet cells and is considered a susceptibility gene candidate for a form of maturity-onset diabetes of the young. Polymorphisms of
GCKR have been associated with a number of metabolic and CVD traits, including HDL, triglycerides, glucose, insulin resistance, renal function, C-reactive protein and protein C levels. GWASs have shown that the
GCKR SNP identified in our study, rs1260326, is also associated with triglyceride levels (
32), chronic kidney disease (
39) and 2 h post-load glucose levels (
40).
This study has several notable strengths, including a large sample size for both EA and AA participants. The IBC custom chip enables a more comprehensive study of lower frequency variants in comparison to GWAS. When comparing findings between populations, it is important to note differences in findings for individual loci may be due to different LD patterns, lower statistical power in AAs due to a smaller sample size, difference in minor allele frequency or effect modification by genetic or environmental factors.
In summary, we have confirmed and refined known loci, and identified several new genomic loci associated with FVII levels, including APOA5 and HNF4A in EAs. The chromosome 13q34 F7–F10 gene region contains multiple variants independently associated with higher and lower FVII in both EAs and AAs. Of other candidate regions associated with FVII in EA, we found evidence for replication of the chromosome 20 PROCR gene Ser219Gly and GCKR associations with higher FVII levels in AAs; on the other hand, FVII association with polymorphisms within the MS4A2, APOA5 and HNF4A gene regions were not evident in AAs. Larger samples and investigation of lower frequency variants may be required to identify additional FVII-associated loci in AAs.