1.  Sequencing rare and common APOL1 coding variants to determine kidney disease risk 
Kidney international  2015;88(4):754-763.
A third of African Americans with sporadic focal segmental glomerulosclerosis (FSGS) or HIV-associated nephropathy (HIVAN) do not carry APOL1 renal risk genotypes. This raises the possibility that other APOL1 variants may contribute to kidney disease. To address this question, we sequenced all APOL1 exons in 1, 437 Americans of African and European decent, including 464 patients with biopsy-proven FSGS/HIVAN. Testing for association with 33 common and rare variants with FSGS/HIVAN revealed no association independent of strong recessive G1 and G2 effects. Seeking additional variants that might have been under selection by pathogens and could represent candidates for kidney disease risk, we also sequenced an additional 1, 112 individuals representing 53 global populations. Except for G1 and G2, none of the 7 common codon-altering variants showed evidence of selection or could restore lysis against trypanosomes causing human African trypanosomiasis. Thus, only APOL1 G1 and G2 confer renal risk and other common and rare APOL1 missense variants, including the archaic G3 haplotype, do not contribute to sporadic FSGS and HIVAN in the United States population. Hence, in most potential clinical or screening applications, our study suggests that sequencing APOL1 exons is unlikely to bring additional information compared to genotyping only APOL1 G1 and G2 risk alleles.
PMCID: PMC4591109  PMID: 25993319
APOL1; FSGS; HIVAN; chronic kidney disease; association; population genetics; selection; trypanolysis; personalized medicine
2.  Variants in CXCR4 associate with juvenile idiopathic arthritis susceptibility 
BMC Medical Genetics  2016;17:24.
Juvenile idiopathic arthritis (JIA) is the most common chronic rheumatic disease among children, the etiology of which involves a strong genetic component, but much of the underlying genetic determinants still remain unknown. Our aim was to identify novel genetic variants that predispose to JIA.
We performed a genome-wide association study (GWAS) and replication in a total of 1166 JIA cases and 9500 unrelated controls of European ancestry. Correlation of SNP genotype and gene expression was investigated. Then we conducted targeted resequencing of a candidate locus, among a subset of 480 cases and 480 controls. SUM test was performed to evaluate the association of the identified rare functional variants.
The CXCR4 locus on 2q22.1 was found to be significantly associated with JIA, peaking at SNP rs953387. However, this result is subjected to subpopulation stratification within the subjects of European ancestry. After adjusting for principal components, nominal significant association remained (p < 10−4). Because of its interesting known function in immune regulation, we carried out further analyses to assess its relationship with JIA. Expression of CXCR4 was correlated with CXCR4 rs953387 genotypes in lymphoblastoid cell lines (p = 0.014) and T-cells (p = 0.0054). In addition, rare non-synonymous and stop-gain sequence variants in CXCR4, putatively damaging for CXCR4 function, were significantly enriched in JIA cases (p = 0.015).
Our results suggest the association of CXCR4 variants with JIA, implicating that this gene may be involved in the pathogenesis of autoimmune disease. However, because this locus is subjected to population stratification within the subjects of European ancestry, additional replication is still necessary for this locus to be considered a true risk locus for JIA. This cell-surface chemokine receptor has already been targeted in other diseases and may serve as a tractable therapeutic target for a specific subset of pediatric arthritis patients with additional replication and functional validation of the locus.
Electronic supplementary material
The online version of this article (doi:10.1186/s12881-016-0285-3) contains supplementary material, which is available to authorized users.
PMCID: PMC4804485  PMID: 27005825
Juvenile idiopathic arthritis; Genome-wide association study; CXCR4; Targeted resequencing
3.  Role of APOBEC3F Gene Variation in HIV-1 Disease Progression and Pneumocystis Pneumonia 
PLoS Genetics  2016;12(3):e1005921.
Human APOBEC3 cytidine deaminases are intrinsic resistance factors to HIV-1. However, HIV-1 encodes a viral infectivity factor (Vif) that degrades APOBEC3 proteins. In vitro APOBEC3F (A3F) anti-HIV-1 activity is weaker than A3G but is partially resistant to Vif degradation unlike A3G. It is unknown whether A3F protein affects HIV-1 disease in vivo. To assess the effect of A3F gene on host susceptibility to HIV- acquisition and disease progression, we performed a genetic association study in six well-characterized HIV-1 natural cohorts. A common six-Single Nucleotide Polymorphism (SNP) haplotype of A3F tagged by a codon-changing variant (p. I231V, with allele (V) frequency of 48% in European Americans) was associated with significantly lower set-point viral load and slower rate of progression to AIDS (Relative Hazards (RH) = 0.71, 95% CI: 0.56, 0.91) and delayed development of pneumocystis pneumonia (PCP) (RH = 0.53, 95% CI: 0.37–0.76). A validation study in the International Collaboration for the Genomics of HIV (ICGH) showed a consistent association with lower set-point viral load. An in vitro assay revealed that the A3F I231V variant may influence Vif mediated A3F degradation. Our results provide genetic epidemiological evidence that A3F modulates HIV-1/AIDS disease progression.
Author Summary
Cytidine deaminases of the human APOBEC3 (A3) gene family serve as intrinsic resistance factors to HIV-1 and other retroviruses. HIV-1 encodes the viral infectivity factor (Vif) protein that degrades APOBEC3 proteins via the ubiquitination-proteosomal pathway. APOBEC3F (A3F), unlike APOBEC3G (A3G), is partially resistant to Vif-mediated degradation. The antiviral activity of the A3 family has largely been demonstrated in in vitro experiments, and there is mounting evidence that A3G genetic variants influence HIV disease progression. It is not resolved if A3F protein affects HIV disease in vivo. To assess the in vivo effect of A3F, we performed a genetic association study of genetic variants in A3F for their influence on HIV- acquisition and HIV disease progression. A common A3F haplotype was associated with a 30% reduced rate of AIDS disease progression, lower set-point viral load and delayed development of pneumocystis pneumonia (PCP) in European Americans. This study provides the first epidemiological evidence that A3F might modify HIV-1/AIDS pathogenesis.
PMCID: PMC4778847  PMID: 26942578
4.  Genetic Ancestry Influences Asthma Susceptibility and Lung Function Among Latinos 
Childhood asthma prevalence and morbidity varies among Latinos in the United States, with Puerto Ricans having the highest and Mexicans the lowest.
To determine whether genetic ancestry is associated with the odds of asthma among Latinos, and secondarily whether genetic ancestry is associated with lung function among Latino children.
We analyzed 5,493 Latinos with and without asthma from three independent studies. For each participant we estimated the proportion of African, European, and Native American ancestry using genome-wide data. We tested whether genetic ancestry was associated with the presence of asthma and lung function among subjects with and without asthma. Odds ratios (OR) and effect sizes were assessed for every 20% increase in each ancestry.
Native American ancestry was associated with lower odds of asthma (OR=0.72, 95% confidence interval [CI]: 0.66–0.78, p=8.0×10−15), while African ancestry was associated with higher odds of asthma (OR=1.40, 95%CI: 1.14–1.72, p=0.001). These associations were robust to adjustment for covariates related to early life exposures, air pollution and socioeconomic status. Among children with asthma, African ancestry was associated with lower lung function, including both pre- and post-bronchodilator measures of forced expiratory volume in the first second (−77±19 ml, p=5.8×10−5 and −83±19 ml, p=1.1×10−5, respectively) and forced vital capacity (−100±21 ml, p=2.7×10−6 and −107±22 ml, p=1.0×10−6, respectively).
Differences in the proportions of genetic ancestry can partially explain disparities in asthma susceptibility and lung function among Latinos.
PMCID: PMC4289103  PMID: 25301036
genetic admixture; childhood asthma; Hispanics; minority; pulmonary function
5.  APOL1 Toxin, Innate Immunity and Kidney Injury 
Kidney international  2015;88(1):28-34.
The discovery that two common APOL1 alleles were strongly associated with non-diabetic kidney diseases in African descent populations led to hope for improved diagnosis and treatment. Unfortunately, we still do not have a clear understanding of the biological function played by APOL1 in podocytes or other kidney cells, nor how the renal risk alleles initiate the development of nephropathies. Important clues for APOL1 function may be gleaned from the natural defense mechanism of APOL1 against trypanosome infections and from similar proteins (e.g. diphtheria toxin, mammalian Bcl-2 family members). This review provides an update on the biological functions for circulating (trypanosome resistance) and intracellular (emerging role for autophagy) APOL1. Further, we introduce a multimer model for APOL1 in kidney cells that reconciles the gain-of-function variants with the recessive inheritance pattern of APOL1 renal risk alleles.
PMCID: PMC4490079  PMID: 25853332
APOL1; renal function; innate defense; autophagy; multimer; toxicity
6.  Regulatory Variation in HIV-1 Dependency Factor ZNRD1 Associates with Host Resistance to HIV-1 Acquisition 
The Journal of Infectious Diseases  2014;210(10):1539-1548.
Background. ZNRD1 was identified as a host protein required for the completion of the human immunodeficiency virus (HIV) lifecycle in a genome-wide screen using small interfering RNA gene silencing. Subsequently, a genome-wide association study (GWAS) of host determinants for HIV-1 disease identified an association of single nucleotide polymorphisms (SNPs) in the ZNRD1 region with CD4+ T-cell depletion.
Methods. We investigated the effects of SNPs in the ZNRD1 region on human immunodeficiency virus type 1 (HIV-1) infection and progression to clinical outcomes in 5 US-based HIV-1 longitudinal cohorts consisting of men who have sex with men, males with hemophilia, and injection drug users (IDUs) (n = 1865). SNP function was evaluated by electrophoretic mobility shift assay and promoter luciferase assay.
Results. A haplotype in the ZNRD1 gene showed significant association with a 35% decreased risk of HIV-1 acquisition (OR = 0.65, 95% CI, .47–.89), independent of HLA-C rs9264942, in European Americans. The SNP rs3132130 tagging this haplotype, located in the ZNRD1 5′ upstream region, caused a loss of nuclear factor binding and decrease in ZNRD1 promoter activity. ZNRD1 variants also affected HIV-1 disease progression in European- and African-American cohorts.
Conclusions. This study provides novel evidence that ZNRD1 polymorphism may confer host resistance to HIV-1 acquisition.
PMCID: PMC4215072  PMID: 24842830
HIV-1; infection; host susceptibility; AIDS; SNP; single nucleotide polymorphism; ZNRD1; genetic association
7.  Global diversity, population stratification, and selection of human copy number variation 
Science (New York, N.Y.)  2015;349(6253):aab3761.
In order to explore the diversity and selective signatures of duplication and deletion human copy number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single nucleotide variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load.
PMCID: PMC4568308  PMID: 26249230
8.  APOL1 Kidney Risk Alleles: Population Genetics and Disease Associations 
APOL1 kidney disease is a unique case in the field of the genetics of common disease: 2 variants (termed G1 and G2) with high population frequency have been repeatedly associated with nondiabetic CKDs, with very strong effect size (odds ratios 3–29) in populations of sub-Saharan African descent. This review provides an update on the spectrum of APOL1 kidney disease and on the worldwide distribution of these kidney risk variants. We also summarize the proper way to run a recessive analysis on joint and independent effects of APOL1 G1 and G2 kidney risk variants.
PMCID: PMC4157456  PMID: 25168832
Glomerular disease; Apolipoprotein L1; African admixture; APOL1 demographics; Chronic kidney disease
9.  GWAS and admixture mapping identify different asthma-associated loci in Latinos: The GALA II Study 
Asthma is a complex disease with both genetic and environmental causes. Genome-wide association studies of asthma have mostly involved European populations and replication of positive associations has been inconsistent.
To identify asthma-associated genes in a large Latino population with genome-wide association analysis and admixture mapping.
Latino children with asthma (n = 1,893) and healthy controls (n = 1,881) were recruited from five sites in the United States: Puerto Rico, New York, Chicago, Houston, and the San Francisco Bay Area. Subjects were genotyped on an Affymetrix World Array IV chip. We performed genome-wide association and admixture mapping to identify asthma-associated loci.
We identified a significant association between ancestry and asthma at 6p21 (lowest p-value: rs2523924, p < 5 × 10−6). This association replicates in a meta-analysis of the EVE Asthma Consortium (p = 0.01). Fine mapping of the region in this study and the EVE Asthma Consortium suggests an association between PSORS1C1 and asthma. We confirmed the strong allelic association between the 17q21 asthma in Latinos (IKZF3, lowest p-value: rs90792, OR: 0.67, 95% CI 0.61 – 0.75, p = 6 × 10−13) and replicated associations in several genes that had previously been associated with asthma in genome-wide association studies.
Admixture mapping and genome-wide association are complementary techniques that provide evidence for multiple asthma-associated loci in Latinos. Admixture mapping identifies a novel locus on 6p21 that replicates in a meta-analysis of several Latino populations, while genome-wide association confirms the previously identified locus on 17q21.
PMCID: PMC4085159  PMID: 24406073
Asthma; Latinos; Admixture Mapping; Genome-wide Association Study; Local Ancestry; 17q21; 6p21
10.  Evaluation of non-viral risk factors for nasopharyngeal carcinoma in a high-risk population of Southern China 
To understand the role of environmental and genetic influences on nasopharyngeal carcinoma (NPC) in populations at high risk of NPC, we have performed a case-control study in Guangxi Province of Southern China in 2004-2005. NPC cases (n=1049) were compared to 785 NPC-free matched controls who were seropositive for IgA antibodies (IgA) to Epstein-Barr virus (EBV) capsid antigen (VCA)—a predictive marker for NPC in Chinese populations. A questionnaire was used to capture exposure and NPC family history data. Risk factors associated with NPC in a multivariant analysis model were the following: 1) a first, second or third degree relative with NPC [Attributable risk (AR)= 6%, Odds ratio (OR) = 3.1, 95%CI = 2.0-4.9, p < 0.001]; 2) consumption of salted fish 3 or more than 3 times per month (AR=3%, OR = 1.9, 95%CI = 1.1-3.5, p = 0.035); 3) exposure to domestic wood cooking fires for more than 10 years (AR=69%, OR = 5.8, 95%CI = 2.5-13.6, p < 0.001); and 4) exposure to occupational solvents for 10 or less years (AR=4%, OR = 2.6, 95%CI = 1.4-4.8, p = 0.002). Consumption of preserved meats or a history of tobacco smoking were not associated with NPC (P>0.05). We also assessed the contribution of EBV/IgA/VCA antibody serostatus to NPC risk—32.2% of NPC can be explained by IgA+ status. However, family history and environmental risk factors cumulatively explained only 2.7% of NPC development in NPC high risk population. These findings should have important public health implications for NPC risk reduction in endemic regions.
PMCID: PMC4406046  PMID: 19296536
Nasopharyngeal Carcinoma; Risk factor; Epidemiology; Southern China; Epstein Barr Virus
11.  Ancient human genomes suggest three ancestral populations for present-day Europeans 
Lazaridis, Iosif | Patterson, Nick | Mittnik, Alissa | Renaud, Gabriel | Mallick, Swapan | Kirsanow, Karola | Sudmant, Peter H. | Schraiber, Joshua G. | Castellano, Sergi | Lipson, Mark | Berger, Bonnie | Economou, Christos | Bollongino, Ruth | Fu, Qiaomei | Bos, Kirsten I. | Nordenfelt, Susanne | Li, Heng | de Filippo, Cesare | Prüfer, Kay | Sawyer, Susanna | Posth, Cosimo | Haak, Wolfgang | Hallgren, Fredrik | Fornander, Elin | Rohland, Nadin | Delsate, Dominique | Francken, Michael | Guinet, Jean-Michel | Wahl, Joachim | Ayodo, George | Babiker, Hamza A. | Bailliet, Graciela | Balanovska, Elena | Balanovsky, Oleg | Barrantes, Ramiro | Bedoya, Gabriel | Ben-Ami, Haim | Bene, Judit | Berrada, Fouad | Bravi, Claudio M. | Brisighelli, Francesca | Busby, George B. J. | Cali, Francesco | Churnosov, Mikhail | Cole, David E. C. | Corach, Daniel | Damba, Larissa | van Driem, George | Dryomov, Stanislav | Dugoujon, Jean-Michel | Fedorova, Sardana A. | Romero, Irene Gallego | Gubina, Marina | Hammer, Michael | Henn, Brenna M. | Hervig, Tor | Hodoglugil, Ugur | Jha, Aashish R. | Karachanak-Yankova, Sena | Khusainova, Rita | Khusnutdinova, Elza | Kittles, Rick | Kivisild, Toomas | Klitz, William | Kučinskas, Vaidutis | Kushniarevich, Alena | Laredj, Leila | Litvinov, Sergey | Loukidis, Theologos | Mahley, Robert W. | Melegh, Béla | Metspalu, Ene | Molina, Julio | Mountain, Joanna | Näkkäläjärvi, Klemetti | Nesheva, Desislava | Nyambo, Thomas | Osipova, Ludmila | Parik, Jüri | Platonov, Fedor | Posukh, Olga | Romano, Valentino | Rothhammer, Francisco | Rudan, Igor | Ruizbakiev, Ruslan | Sahakyan, Hovhannes | Sajantila, Antti | Salas, Antonio | Starikovskaya, Elena B. | Tarekegn, Ayele | Toncheva, Draga | Turdikulova, Shahlo | Uktveryte, Ingrida | Utevska, Olga | Vasquez, René | Villena, Mercedes | Voevoda, Mikhail | Winkler, Cheryl | Yepiskoposyan, Levon | Zalloua, Pierre | Zemunik, Tatijana | Cooper, Alan | Capelli, Cristian | Thomas, Mark G. | Ruiz-Linares, Andres | Tishkoff, Sarah A. | Singh, Lalji | Thangaraj, Kumarasamy | Villems, Richard | Comas, David | Sukernik, Rem | Metspalu, Mait | Meyer, Matthias | Eichler, Evan E. | Burger, Joachim | Slatkin, Montgomery | Pääbo, Svante | Kelso, Janet | Reich, David | Krause, Johannes
Nature  2014;513(7518):409-413.
We sequenced the genomes of a ~7,000 year old farmer from Germany and eight ~8,000 year old hunter-gatherers from Luxembourg and Sweden. We analyzed these and other ancient genomes1–4 with 2,345 contemporary humans to show that most present Europeans derive from at least three highly differentiated populations: West European Hunter-Gatherers (WHG), who contributed ancestry to all Europeans but not to Near Easterners; Ancient North Eurasians (ANE) related to Upper Paleolithic Siberians3, who contributed to both Europeans and Near Easterners; and Early European Farmers (EEF), who were mainly of Near Eastern origin but also harbored WHG-related ancestry. We model these populations’ deep relationships and show that EEF had ~44% ancestry from a “Basal Eurasian” population that split prior to the diversification of other non-African lineages.
PMCID: PMC4170574  PMID: 25230663
12.  ALDsuite: Dense marker MALD using principal components of ancestral linkage disequilibrium 
BMC Genetics  2015;16:23.
Mapping by admixture linkage disequilibrium (MALD) is a whole genome gene mapping method that uses LD from extended blocks of ancestry inherited from parental populations among admixed individuals to map associations for diseases, that vary in prevalence among human populations. The extended LD queried for marker association with ancestry results in a greatly reduced number of comparisons compared to standard genome wide association studies. As ancestral population LD tends to confound the analysis of admixture LD, the earliest algorithms for MALD required marker sets sufficiently sparse to lack significant ancestral LD between markers. However current genotyping technologies routinely provide dense SNP data, which convey more information than sparse sets, if this information can be efficiently used. There are currently no software solutions that offer both local ancestry inference using dense marker data and disease association statistics.
We present here an R package, ALDsuite, which accounts for local LD using principal components of haplotypes from surrogate ancestral population data, and includes tools for quality control of data, MALD, downstream analysis of results and visualization graphics.
ALDsuite offers a fast, accurate estimation of global and local ancestry and comes bundled with the tools needed for MALD, from data quality control through mapping of and visualization of disease genes.
PMCID: PMC4408589  PMID: 25886794
Admixture linkage disequilibrium; MALD; Admixture inference
13.  Factors associated with degree of atopy in Latino children in a nationwide pediatric sample: The GALA II Study 
Atopy varies by ethnicity even within Latino groups. This variation may be due to environmental, socio-cultural or genetic factors.
To examine risk factors for atopy within a nationwide study of U.S. Latino children with and without asthma.
Aeroallergen skin test repsonse was analyzed in 1830 US latino subjects. Key determinants of atopy included: country / region of origin, generation in the U.S., acculturation, genetic ancestry and site to which individuals migrated. Serial multivariate zero inflated negative binomial regressions, stratified by asthma status, examined the association of each key determinant variable with the number of positive skin tests. In addition, the independent effect of each key variable was determined by including all key variables in the final models.
In baseline analyses, African ancestry was associated with 3 times as many positive skin tests in participants with asthma (95% CI:1.62–5.57) and 3.26 times as many positive skin tests in control participants (95% CI: 1.02–10.39). Generation and recruitment site were also associated with atopy in crude models. In final models adjusted for key variables, Puerto Rican [exp(β) (95%CI): 1.31(1.02–1.69)] and mixed ethnicity [exp(β) (95%CI):1.27(1.03–1.56)] asthmatics had a greater probability of positive skin tests compared to Mexican asthmatics. Ancestry associations were abrogated by recruitment site, but not region of origin.
Puerto Rican ethnicity and mixed origin were associated with degree of atopy within U.S. Latino children with asthma. African ancestry was not associated with degree of atopy after adjusting for recruitment site. Local environment variation, represented by site, was associated with degree of sensitization.
PMCID: PMC3788073  PMID: 23684070
Latino; atopy; region of origin; genetic ancestry; immigration; skin test; aeroallergen
14.  Evaluation and Integration of Genetic Signature for Prediction Risk of Nasopharyngeal Carcinoma in Southern China 
BioMed Research International  2014;2014:434072.
Genetic factors, as well as environmental factors, play a role in development of nasopharyngeal carcinoma (NPC). A number of single nucleotide polymorphisms (SNPs) have been reported to be associated with NPC. To confirm these genetic associations with NPC, two independent case-control studies from Southern China comprising 1166 NPC cases and 2340 controls were conducted. Seven SNPs in ITGA9 at 3p21.3 and 9 SNPs within the 6p21.3 HLA region were genotyped. To explore the potential clinical application of these genetic markers in NPC, we further evaluate the predictive/diagnostic role of significant SNPs by calculating the area under the curve (AUC). Results. The reported associations between ITGA9 variants and NPC were not replicated. Multiple loci of GABBR1, HLA-F, HLA-A, and HCG9 were statistically significant in both cohorts (Pcombined range from 5.96 × 10−17 to 0.02). We show for the first time that these factors influence NPC development independent of environmental risk factors. This study also indicated that the SNP alone cannot serve as a predictive/diagnostic marker for NPC. Integrating the most significant SNP with IgA antibodies status to EBV, which is presently used as screening/diagnostic marker for NPC in Chinese populations, did not improve the AUC estimate for diagnosis of NPC.
PMCID: PMC4142549  PMID: 25180181
15.  NPHS2 Variation in Sporadic Focal Segmental Glomerulosclerosis 
Mutations in NPHS2, the gene that encodes podocin, are well-established causes of both familial and sporadic steroid-resistant focal segmental glomerulosclerosis (FSGS) in the pediatric population, but have not been well-characterized in late-onset disease. To investigate the role of NPHS2 polymorphisms in sporadic cases of late-onset FSGS, we studied 377 biopsy-confirmed FSGS cases and 919 controls. We identified 18 single nucleotide polymorphisms (SNPs) by resequencing a subgroup of cases and controls, and subsequently genotyped African-American and European-American cases and controls for five missense SNPs, three SNPs within introns, and four SNPs in the 3′ untranslated region. No homozygotes or compound heterozygotes were observed for any missense mutation. R138Q carriers were more frequent among FSGS cases relative to controls (OR = 4.9, P = 0.06), but heterozygosity for the other four missense mutations was equally distributed among FSGS cases and controls. Finally, a common haplotype of noncoding SNPs carried by 20% of African-Americans, but not observed in European-Americans, was strongly associated with a 50% reduction in risk for sporadic FSGS (OR = 0.5, P = 0.001). These results indicate that genetic variation or mutation of NPHS2 may play a role in late-onset sporadic FSGS.
PMCID: PMC4096868  PMID: 17942957
16.  Genetic Variations Affecting Serum Carcinoembryonic Antigen Levels and Status of Regional Lymph Nodes in Patients with Sporadic Colorectal Cancer from Southern China 
PLoS ONE  2014;9(6):e97923.
Serum carcinoembryonic antigen (sCEA) level might be an indicator of disease. Indeed, an elevated sCEA level is a prognostic factor in colorectal cancer (CRC) patients. However, the genetic determinants of sCEA level in healthy and CRC population remains unclear. Thus we investigated the genetic markers associated with elevated serum sCEA level in these two populations and its clinical implications.
Methods and Findings
Genome-wide association study (GWAS) was conducted in a cohort study with 4,346 healthy male adults using the Illumina Omni 1 M chip. Candidate SNPs associated with elevated sCEA levels were validated in 194 CRC patients on ABI Taqman platform. Eight candidate SNPs were validated in CRC patients. The rs1047781 (chr19- FUT2) (A/T) was associated with elevated sCEA levels, and rs8176746 (chr9- ABO) was associated with the regional lymph metastasis in the CRC patients. The preoperative sCEA level was a risk factor for tumor recurrence in 5 years after operation (OR = 1.427, 95% CI: 1.005∼1.843, P = 0.006). It was also one of the risk factors for regional lymph node metastasis (OR = 2.266, 95% CI: 1.196∼4.293, P = 0.012). The sCEA level in rs1047781-T carriers was higher than that in the A carriers in CRC patients without lymph node metastasis (P = 0.006). The regional lymph node metastasis in patients with homozygote AA of rs8176746 was more common than that in the heterozygote AG carriers (P = 0.022). In addition, rs1047781-AT and TT CRC patients exhibited a worse disease-free survival than AA genotype carriers (P = 0.023).
We found candidate SNPs associated with elevated sCEA levels in both healthy males and CRC population. Rs1047781 (chr19- FUT2) may be the susceptible locus for recurrence of CRC in a population from Southern China.
PMCID: PMC4062418  PMID: 24941225
17.  APOL1 Risk Variants, Race, and Progression of Chronic Kidney Disease 
The New England journal of medicine  2013;369(23):2183-2196.
Among patients in the United States with chronic kidney disease, black patients are at increased risk for end-stage renal disease, as compared with white patients.
In two studies, we examined the effects of variants in the gene encoding apolipoprotein L1 (APOL1) on the progression of chronic kidney disease. In the African American Study of Kidney Disease and Hypertension (AASK), we evaluated 693 black patients with chronic kidney disease attributed to hypertension. In the Chronic Renal Insufficiency Cohort (CRIC) study, we evaluated 2955 white patients and black patients with chronic kidney disease (46% of whom had diabetes) according to whether they had 2 copies of high-risk APOL1 variants (APOL1 high-risk group) or 0 or 1 copy (APOL1 low-risk group). In the AASK study, the primary outcome was a composite of end-stage renal disease or a doubling of the serum creatinine level. In the CRIC study, the primary outcomes were the slope in the estimated glomerular filtration rate (eGFR) and the composite of end-stage renal disease or a reduction of 50% in the eGFR from baseline.
In the AASK study, the primary outcome occurred in 58.1% of the patients in the APOL1 high-risk group and in 36.6% of those in the APOL1 low-risk group (hazard ratio in the high-risk group, 1.88; P<0.001). There was no interaction between APOL1 status and trial interventions or the presence of baseline proteinuria. In the CRIC study, black patients in the APOL1 high-risk group had a more rapid decline in the eGFR and a higher risk of the composite renal outcome than did white patients, among those with diabetes and those without diabetes (P<0.001 for all comparisons).
Renal risk variants in APOL1 were associated with the higher rates of end-stage renal disease and progression of chronic kidney disease that were observed in black patients as compared with white patients, regardless of diabetes status. (Funded by the National Institute of Diabetes and Digestive and Kidney Diseases and others.)
PMCID: PMC3969022  PMID: 24206458
18.  F8 haplotype and inhibitor risk: results from the Hemophilia Inhibitor Genetics Study (HIGS) Combined Cohort 
Ancestral background, specifically African descent, confers higher risk for development of inhibitory antibodies to factor VIII (FVIII) in hemophilia A. It has been suggested that differences in the distribution of factor VIII gene (F8) haplotypes, and mismatch between endogenous F8 haplotypes and those comprising products used for treatment could contribute to risk.
Design and Methods
Data from the HIGS Combined Cohort were used to determine the association between F8 haplotype 3 (H3) vs. haplotypes 1 and 2 (H1+H2) and inhibitor risk among individuals of genetically-determined African descent. Other variables known to affect inhibitor risk including type of F8 mutation and HLA were included in the analysis. A second research question regarding risk related to mismatch in endogenous F8 haplotype and recombinant FVIII products used for treatment was addressed.
H3 was associated with higher inhibitor risk among those genetically-identified (N=49) as of African ancestry, but the association did not remain significant after adjustment for F8 mutation type and the HLA variables. Among subjects of all racial ancestries enrolled in HIGS who reported early use of recombinant products (N=223), mismatch in endogenous haplotype and the FVIII proteins constituting the products used did not confer greater risk for inhibitor development.
H3 was not an independent predictor of inhibitor risk. Further, our findings did not support a higher risk of inhibitors in the presence of a haplotype mismatch between the FVIII molecule infused and that of the individual.
PMCID: PMC3521089  PMID: 22958194
F8 haplotype; FVIII inhibitors; haplotype mismatch
19.  Genome-Wide and Differential Proteomic Analysis of Hepatitis B Virus and Aflatoxin B1 Related Hepatocellular Carcinoma in Guangxi, China 
PLoS ONE  2013;8(12):e83465.
Both hepatitis B virus (HBV) and aflatoxin B1 (AFB1) exposure can cause liver damage as well as increase the probability of hepatocellular carcinoma (HCC). To investigate the underlying genetic changes that may influence development of HCC associated with HBV infection and AFB1 exposure, HCC patients were subdivided into 4 groups depending upon HBV and AFB1 exposure status: (HBV(+)/AFB1(+), HBV(+)/AFB1(-), HBV(-)/AFB1(+), HBV(-)/AFB1(-)). Genetic abnormalities and protein expression profiles were analyzed by array-based comparative genomic hybridization and isobaric tagging for quantitation. A total of 573 chromosomal aberrations (CNAs) including 184 increased and 389 decreased were detected in our study population. Twenty-five recurrently altered regions (RARs; chromosomal alterations observed in ≥10 patients) in chromosomes were identified. Loss of 4q13.3-q35.2, 13q12.1-q21.2 and gain of 7q11.2-q35 were observed with a higher frequency in the HBV(+)/AFB1(+), HBV(+)/AFB1(-) and HBV(-)/AFB1(+) groups compared to the HBV(-)/AFB(-) group. Loss of 8p12-p23.2 was associated with high TNM stage tumors (P = 0.038) and was an unfavorable prognostic factor for tumor-free survival (P =0.045). A total of 133 differentially expressed proteins were identified in iTRAQ proteomics analysis, 69 (51.8%) of which mapped within identified RARs. The most common biological processes affected by HBV and AFB1 status in HCC tumorigenesis were detoxification and drug metabolism pathways, antigen processing and anti-apoptosis pathways. Expression of AKR1B10 was increased significantly in the HBV(+)/AFB1(+) and HBV(-)/AFB1(+) groups. A significant correlation between the expression of AKR1B10 mRNA and protein levels as well as AKR1B10 copy number was observered, which suggest that AKR1B10 may play a role in AFB1-related hepatocarcinogenesis. In summary, a number of genetic and gene expression alterations were found to be associated with HBV and AFB1- related HCC. The possible synergistic effects of HBV and AFB1 in hepatocarcinogenesis warrant further investigations.
PMCID: PMC3877066  PMID: 24391771
20.  A Genome-Wide Search for Linkage of Estimated Glomerular Filtration Rate (eGFR) in the Family Investigation of Nephropathy and Diabetes (FIND) 
PLoS ONE  2013;8(12):e81888.
Estimated glomerular filtration rate (eGFR), a measure of kidney function, is heritable, suggesting that genes influence renal function. Genes that influence eGFR have been identified through genome-wide association studies. However, family-based linkage approaches may identify loci that explain a larger proportion of the heritability. This study used genome-wide linkage and association scans to identify quantitative trait loci (QTL) that influence eGFR.
Genome-wide linkage and sparse association scans of eGFR were performed in families ascertained by probands with advanced diabetic nephropathy (DN) from the multi-ethnic Family Investigation of Nephropathy and Diabetes (FIND) study. This study included 954 African Americans (AA), 781 American Indians (AI), 614 European Americans (EA) and 1,611 Mexican Americans (MA). A total of 3,960 FIND participants were genotyped for 6,000 single nucleotide polymorphisms (SNPs) using the Illumina Linkage IVb panel. GFR was estimated by the Modification of Diet in Renal Disease (MDRD) formula.
The non-parametric linkage analysis, accounting for the effects of diabetes duration and BMI, identified the strongest evidence for linkage of eGFR on chromosome 20q11 (log of the odds [LOD] = 3.34; P = 4.4×10−5) in MA and chromosome 15q12 (LOD = 2.84; P = 1.5×10−4) in EA. In all subjects, the strongest linkage signal for eGFR was detected on chromosome 10p12 (P = 5.5×10−4) at 44 cM near marker rs1339048. A subsequent association scan in both ancestry-specific groups and the entire population identified several SNPs significantly associated with eGFR across the genome.
The present study describes the localization of QTL influencing eGFR on 20q11 in MA, 15q21 in EA and 10p12 in the combined ethnic groups participating in the FIND study. Identification of causal genes/variants influencing eGFR, within these linkage and association loci, will open new avenues for functional analyses and development of novel diagnostic markers for DN.
PMCID: PMC3866106  PMID: 24358131
21.  Host genes associated with HIV/AIDS: advances in gene discovery 
Trends in genetics : TIG  2010;26(3):119-131.
Twenty five years after the discovery of HIV as the cause of AIDS, there is still no effective vaccine and no cure for this disease. HIV susceptibility shows a substantial degree of individual heterogeneity, much of which can be conferred by host genetic variation. In an effort to discover host factors required for HIV replication, identify critical pathogenic pathways, and reveal the full armament of host defenses, there has been a shift from candidate gene studies to unbiased genome wide genetic and functional studies. However, the number of securely identified host factors involved in HIV disease remains small, explaining only ~15–20% of the observed heterogeneity – most of which is attributable to HLA. Multidisciplinary approaches integrating genetic epidemiology to systems biology will be required to fully understand viral-host interactions to effectively combat HIV/AIDS.
PMCID: PMC3792714  PMID: 20149939
22.  Risk Factors for Symptomatic Hyperlactatemia and Lactic Acidosis Among Combination Antiretroviral Therapy-Treated Adults in Botswana: Results from a Clinical Trial 
Nucleoside analogue reverse transcriptase inhibitors are an integral component of combination antiretroviral treatment regimens. However, their ability to inhibit polymerase-γ has been associated with several mitochondrial toxicities, including potentially life-threatening lactic acidosis. A total of 650 antiretroviral-naive adults (69% female) initiated combination antiretroviral therapy (cART) and were intensively screened for toxicities including lactic acidosis as part of a 3-year clinical trial in Botswana. Patients were categorized as no lactic acidosis symptoms, minor symptoms but lactate <4.4 mmol/liter, and symptoms with lactate ≥4.4 mmol/liter [moderate to severe symptomatic hyperlactatemia (SH) or lactic acidosis (LA)]. Of 650 participants 111 (17.1%) developed symptoms and/or laboratory results suggestive of lactic acidosis and had a serum lactate drawn; 97 (87.4%) of these were female. There were 20 events, 13 having SH and 7 with LA; all 20 (100%) were female (p<0.001). Cox proportional hazard analysis limited to the 451 females revealed that having a higher baseline BMI was predictive for the development of SH/LA [aHR=1.17 per one-unit increase (1.08–1.25), p<0.0001]. Ordered logistic regression performed among all 650 patients revealed that having a lower baseline hemoglobin [aOR=1.28 per one-unit decrease (1.1–1.49), p=0.002] and being randomized to d4T/3TC-based cART [aOR=1.76 relative to ZDV/3TC (1.03–3.01), p=0.04] were predictive of the symptoms and/or the development of SH/LA. cART-treated women in sub-Saharan Africa, especially those having higher body mass indices, should receive additional monitoring for SH/LA. Women presently receiving d4T/3TC-based cART in such settings also warrant more intensive monitoring.
PMCID: PMC3399551  PMID: 22540188
23.  Association Study of Common Genetic Variants and HIV-1 Acquisition in 6,300 Infected Cases and 7,200 Controls 
PLoS Pathogens  2013;9(7):e1003515.
Multiple genome-wide association studies (GWAS) have been performed in HIV-1 infected individuals, identifying common genetic influences on viral control and disease course. Similarly, common genetic correlates of acquisition of HIV-1 after exposure have been interrogated using GWAS, although in generally small samples. Under the auspices of the International Collaboration for the Genomics of HIV, we have combined the genome-wide single nucleotide polymorphism (SNP) data collected by 25 cohorts, studies, or institutions on HIV-1 infected individuals and compared them to carefully matched population-level data sets (a list of all collaborators appears in Note S1 in Text S1). After imputation using the 1,000 Genomes Project reference panel, we tested approximately 8 million common DNA variants (SNPs and indels) for association with HIV-1 acquisition in 6,334 infected patients and 7,247 population samples of European ancestry. Initial association testing identified the SNP rs4418214, the C allele of which is known to tag the HLA-B*57:01 and B*27:05 alleles, as genome-wide significant (p = 3.6×10−11). However, restricting analysis to individuals with a known date of seroconversion suggested that this association was due to the frailty bias in studies of lethal diseases. Further analyses including testing recessive genetic models, testing for bulk effects of non-genome-wide significant variants, stratifying by sexual or parenteral transmission risk and testing previously reported associations showed no evidence for genetic influence on HIV-1 acquisition (with the exception of CCR5Δ32 homozygosity). Thus, these data suggest that genetic influences on HIV acquisition are either rare or have smaller effects than can be detected by this sample size.
Author Summary
Comparing the frequency differences between common DNA variants in disease-affected cases and in unaffected controls has been successful in uncovering the genetic component of multiple diseases. This approach is most effective when large samples of cases and controls are available. Here we combine information from multiple studies of HIV infected patients, including more than 6,300 HIV+ individuals, with data from 7,200 general population samples of European ancestry to test nearly 8 million common DNA variants for an impact on HIV acquisition. With this large sample we did not observe any single common genetic variant that significantly associated with HIV acquisition. We further tested 22 variants previously identified by smaller studies as influencing HIV acquisition. With the exception of a deletion polymorphism in the CCR5 gene (CCR5Δ32) we found no convincing evidence to support these previous associations. Taken together these data suggest that genetic influences on HIV acquisition are either rare or have smaller effects than can be detected by this sample size.
PMCID: PMC3723635  PMID: 23935489
24.  Apolipoprotein L1 gene variants associate with hypertension-attributed nephropathy and the rate of kidney function decline in African Americans 
Kidney international  2012;83(1):114-120.
Despite intensive anti-hypertensive therapy there was a high incidence of renal end-points in participants of the African American Study of Kidney Disease and Hypertension (AASK) cohort. To better understand this, coding variants in the apolipoprotein L1 (APOL1) and the non-muscle myosin heavy chain 9 (MYH9) genes were evaluated for an association with hypertension-attributed nephropathy and clinical outcomes in a case-control study. Clinical data and DNA were available for 675 AASK participant cases and 618 African American non-nephropathy control individuals. APOL1 G1 and G2, and MYH9 E1 variants along with 44 ancestry informative markers were genotyped with allele frequency differences between cases and controls analyzed by logistic regression multivariable models adjusting for ancestry, age, and gender. In recessive models, APOL1 risk variants were significantly associated with kidney disease in all cases compared to controls with an odds ratio of 2.57. In AASK cases with more advanced disease, such as a baseline urine protein to creatinine ratio over 0.6 g/g or a serum creatinine over 3 mg/dL during follow-up, the association was strengthened with odds ratios of 6.29 and 4.61, respectively. APOL1 risk variants were consistently associated with renal disease progression across medication classes and blood pressure targets. Thus, kidney disease in AASK participants was strongly associated with APOL1 renal risk variants.
PMCID: PMC3484228  PMID: 22832513
25.  APOBEC3B Deletion and Risk of HIV-1 Acquisition 
The Journal of infectious diseases  2009;200(7):1054-1058.
The human APOBEC3 family of cytidine deaminases provides intrinsic immunity to retroviral infection. A naturally occurring 29.5-kb deletion removes the entire APOBEC3B gene. We examined the impact of the APOBEC3B gene deletion in >4000 individuals from five HIV-1 natural history cohorts. The hemizygous genotype had no effect on either infection or progression. However, the homozygous deletion was significantly associated with unfavorable outcomes for HIV-1 acquisition (OR=7.37, P=0.024), progression to AIDS (RH = 4.01, P=0.03), and viral set-point (P=0.04). These findings suggest that the loss of APOBEC3B may increase host susceptibility to HIV-1/AIDS and warrant further study.
PMCID: PMC3690486  PMID: 19698078

