The Hedgehog (Hh) gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typically in invertebrate genomes, most vertebrates species have three (Sonic hedgehog – Shh; Indian hedgehog – Ihh; and Desert hedgehog – Dhh), each with different expression patterns and functions, which likely helped promote the increasing complexity of vertebrates and their successful diversification. In this study, we used comparative genomic and adaptive evolutionary analyses to characterize the evolution of the Hh genes in vertebrates following the two major whole genome duplication (WGD) events. To overcome the lack of Hh-coding sequences on avian publicly available databases, we used an extensive dataset of 45 avian and three non-avian reptilian genomes to show that birds have all three Hh paralogs. We find suggestions that following the WGD events, vertebrate Hh paralogous genes evolved independently within similar linkage groups and under different evolutionary rates, especially within the catalytic domain. The structural regions around the ion-binding site were identified to be under positive selection in the signaling domain. These findings contrast with those observed in invertebrates, where different lineages that experienced gene duplication retained similar selective constraints in the Hh orthologs. Our results provide new insights on the evolutionary history of the Hh gene family, the functional roles of these paralogs in vertebrate species, and on the location of mutational hotspots.
As genome-wide sequence analyses for complex human disease determinants are expanding, it is increasingly necessary to develop strategies to promote discovery and validation of potential disease-gene associations.
Here we present a dynamic web-based platform – GWATCH – that automates and facilitates four steps in genetic epidemiological discovery: 1) Rapid gene association search and discovery analysis of large genome-wide datasets; 2) Expanded visual display of gene associations for genome-wide variants (SNPs, indels, CNVs), including Manhattan plots, 2D and 3D snapshots of any gene region, and a dynamic genome browser illustrating gene association chromosomal regions; 3) Real-time validation/replication of candidate or putative genes suggested from other sources, limiting Bonferroni genome-wide association study (GWAS) penalties; 4) Open data release and sharing by eliminating privacy constraints (The National Human Genome Research Institute (NHGRI) Institutional Review Board (IRB), informed consent, The Health Insurance Portability and Accountability Act (HIPAA) of 1996 etc.) on unabridged results, which allows for open access comparative and meta-analysis.
GWATCH is suitable for both GWAS and whole genome sequence association datasets. We illustrate the utility of GWATCH with three large genome-wide association studies for HIV-AIDS resistance genes screened in large multicenter cohorts; however, association datasets from any study can be uploaded and analyzed by GWATCH.
AIDS; HIV; Complex diseases; Genome-wide association studies (GWAS); Whole genome sequencing (WGS)
Adaptation of mammals to terrestrial life was facilitated by the unique vertebrate trait of body hair, which occurs in a range of morphological patterns. Keratin associated proteins (KRTAPs), the major structural hair shaft proteins, are largely responsible for hair variation.
We exhaustively characterized the KRTAP gene family in 22 mammalian genomes, confirming the existence of 30 KRTAP subfamilies evolving at different rates with varying degrees of diversification and homogenization. Within the two major classes of KRTAPs, the high cysteine (HS) subfamily experienced strong concerted evolution, high rates of gene conversion/recombination and high GC content. In contrast, high glycine-tyrosine (HGT) KRTAPs showed evidence of positive selection and low rates of gene conversion/recombination. Species with more hair and of higher complexity tended to have more KRATP genes (gene expansion). The sloth, with long and coarse hair, had the most KRTAP genes (175 with 141 being intact). By contrast, the “hairless” dolphin had 35 KRTAPs and the highest pseudogenization rate (74% relative to the 19% mammalian average). Unique hair-related phenotypes, such as scales (armadillo) and spines (hedgehog), were correlated with changes in KRTAPs. Gene expression variation probably also influences hair diversification patterns, for example human have an identical KRTAP repertoire as apes, but much less hair.
We hypothesize that differences in KRTAP gene repertoire and gene expression, together with distinct rates of gene conversion/recombination, pseudogenization and positive selection, are likely responsible for micro and macro-phenotypic hair diversification among mammals in response to adaptations to ecological pressures.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-779) contains supplementary material, which is available to authorized users.
Concerted evolution; Gene family; Keratin Associated Proteins; Keratin; Hair; Gene conversion; Recombination; Positive selection
Allotypes of the natural killer (NK) cell receptor KIR3DL1 vary in both NK cell expression patterns and inhibitory capacity upon binding to their ligands, HLA-B Bw4 molecules, present on target cells. Using a sample size of over 1,500 human immunodeficiency virus (HIV)+ individuals, we show that various distinct allelic combinations of the KIR3DL1 and HLA-B loci significantly and strongly influence both AIDS progression and plasma HIV RNA abundance in a consistent manner. These genetic data correlate very well with previously defined functional differences that distinguish KIR3DL1 allotypes. The various epistatic effects observed here for common, distinct KIR3DL1 and HLA-B Bw4 combinations are unprecedented with regard to any pair of genetic loci in human disease, and indicate that NK cells may have a critical role in the natural history of HIV infection.
Genetic factors, as well as environmental factors, play a role in development of nasopharyngeal carcinoma (NPC). A number of single nucleotide polymorphisms (SNPs) have been reported to be associated with NPC. To confirm these genetic associations with NPC, two independent case-control studies from Southern China comprising 1166 NPC cases and 2340 controls were conducted. Seven SNPs in ITGA9 at 3p21.3 and 9 SNPs within the 6p21.3 HLA region were genotyped. To explore the potential clinical application of these genetic markers in NPC, we further evaluate the predictive/diagnostic role of significant SNPs by calculating the area under the curve (AUC). Results. The reported associations between ITGA9 variants and NPC were not replicated. Multiple loci of GABBR1, HLA-F, HLA-A, and HCG9 were statistically significant in both cohorts (Pcombined range from 5.96 × 10−17 to 0.02). We show for the first time that these factors influence NPC development independent of environmental risk factors. This study also indicated that the SNP alone cannot serve as a predictive/diagnostic marker for NPC. Integrating the most significant SNP with IgA antibodies status to EBV, which is presently used as screening/diagnostic marker for NPC in Chinese populations, did not improve the AUC estimate for diagnosis of NPC.
Domestic cats enjoy an extensive veterinary medical surveillance which has described nearly 250 genetic diseases analogous to human disorders. Feline infectious agents offer powerful natural models of deadly human diseases, which include feline immunodeficiency virus, feline sarcoma virus and feline leukemia virus. A rich veterinary literature of feline disease pathogenesis and the demonstration of a highly conserved ancestral mammal genome organization make the cat genome annotation a highly informative resource that facilitates multifaceted research endeavors.
Here we report a preliminary annotation of the whole genome sequence of Cinnamon, a domestic cat living in Columbia (MO, USA), bisulfite sequencing of Boris, a male cat from St. Petersburg (Russia), and light 30× sequencing of Sylvester, a European wildcat progenitor of cat domestication. The annotation includes 21,865 protein-coding genes identified by a comparative approach, 217 loci of endogenous retrovirus-like elements, repetitive elements which comprise about 55.7% of the whole genome, 99,494 new SNVs, 8,355 new indels, 743,326 evolutionary constrained elements, and 3,182 microRNA homologues. The methylation sites study shows that 10.5% of cat genome cytosines are methylated. An assisted assembly of a European wildcat, Felis silvestris silvestris, was performed; variants between F. silvestris and F. catus genomes were derived and compared to F. catus.
The presented genome annotation extends beyond earlier ones by closing gaps of sequence that were unavoidable with previous low-coverage shotgun genome sequencing. The assembly and its annotation offer an important resource for connecting the rich veterinary and natural history of cats to genome discovery.
Felis catus; Domestic cat; Felis silvestris silvestris; European wildcat; Genome sequence; Annotation; Assembly
The Dominant White locus (W) in the domestic cat demonstrates pleiotropic effects exhibiting complete penetrance for absence of coat pigmentation and incomplete penetrance for deafness and iris hypopigmentation. We performed linkage analysis using a pedigree segregating White to identify KIT (Chr. B1) as the feline W locus. Segregation and sequence analysis of the KIT gene in two pedigrees (P1 and P2) revealed the remarkable retrotransposition and evolution of a feline endogenous retrovirus (FERV1) as responsible for two distinct phenotypes of the W locus, Dominant White, and white spotting. A full-length (7125 bp) FERV1 element is associated with white spotting, whereas a FERV1 long terminal repeat (LTR) is associated with all Dominant White individuals. For purposes of statistical analysis, the alternatives of wild-type sequence, FERV1 element, and LTR-only define a triallelic marker. Taking into account pedigree relationships, deafness is genetically linked and associated with this marker; estimated P values for association are in the range of 0.007 to 0.10. The retrotransposition interrupts a DNAase I hypersensitive site in KIT intron 1 that is highly conserved across mammals and was previously demonstrated to regulate temporal and tissue-specific expression of KIT in murine hematopoietic and melanocytic cells. A large-population genetic survey of cats (n = 270), representing 30 cat breeds, supports our findings and demonstrates statistical significance of the FERV1 LTR and full-length element with Dominant White/blue iris (P < 0.0001) and white spotting (P < 0.0001), respectively.
White; domestic cat; deaf; white spotting; retrotransposition; FERV1
Although highly active antiretroviral therapy has improved survivorship dramatically and decreased the incidence of cytomegalovirus retinitis among patients with AIDS, other ophthalmic complications continue to occur. One complication observed in ~12% of HIV-infected patients is a presumed neuroretinal disorder (NRD), manifested as decreased contrast sensitivity and associated with vague subjective complaints of hazy vision. Pathologically, patients with AIDS even without ocular opportunistic infections have loss of optic nerve axons, suggestive of mitochondrial dysfunction. We explored whether variation in mitochondrial DNA was associated with time to NRD in HIV-infected patients in the Longitudinal Study of Ocular Complications of AIDS cohort. Within the Western European, or “N”, mitochondrial DNA macrohaplogroup, haplogroup J, was associated with 80% decrease in the risk of progression to NRD during the study (hazard ratio = 0.20, P = 0.039) and suggested an independent association with protection against NRD in a cross-section of all patients taken at enrollment (1.5% vs. 8.9% in patients with vs. without haplogroup J, respectively, P = 0.05). These data suggest that mitochondrial genotype may influence propensity to develop HIV-associated NRD in patients with AIDS.
AIDS; mitochondrial DNA; neuroretinal disorder
Repetitive short interspersed elements (SINEs) are retrotransposons ubiquitous in mammalian genomes and are highly informative markers to identify species and phylogenetic associations. Of these, SINEs unique to the order Carnivora (CanSINEs) yield novel insights on genome evolution in domestic dogs and cats, but less is known about their role in related carnivores. In particular, genome-wide assessment of CanSINE evolution has yet to be completed across the Feliformia (cat-like) suborder of Carnivora. Within Feliformia, the cat family Felidae is composed of 37 species and numerous subspecies organized into eight monophyletic lineages that likely arose 10 million years ago. Using the Felidae family as a reference phylogeny, along with representative taxa from other families of Feliformia, the origin, proliferation and evolution of CanSINEs within the suborder were assessed.
We identified 93 novel intergenic CanSINE loci in Feliformia. Sequence analyses separated Feliform CanSINEs into two subfamilies, each characterized by distinct RNA polymerase binding motifs and phylogenetic associations. Subfamily I CanSINEs arose early within Feliformia but are no longer under active proliferation. Subfamily II loci are more recent, exclusive to Felidae and show evidence for adaptation to extant RNA polymerase activity. Further, presence/absence distributions of CanSINE loci are largely congruent with taxonomic expectations within Feliformia and the less resolved nodes in the Felidae reference phylogeny present equally ambiguous CanSINE data. SINEs are thought to be nearly impervious to excision from the genome. However, we observed a nearly complete excision of a CanSINEs locus in puma (Puma concolor). In addition, we found that CanSINE proliferation in Felidae frequently targeted existing CanSINE loci for insertion sites, resulting in tandem arrays.
We demonstrate the existence of at least two SINE families within the Feliformia suborder, one of which is actively involved in insertional mutagenesis. We find SINEs are powerful markers of speciation and conclude that the few inconsistencies with expected patterns of speciation likely represent incomplete lineage sorting, species hybridization and SINE-mediated genome rearrangement.
Incomplete lineage sorting; SINEs; Carnivora; Speciation; transposable elements; Adaptation; Feliformia; Felidae
Serum carcinoembryonic antigen (sCEA) level might be an indicator of disease. Indeed, an elevated sCEA level is a prognostic factor in colorectal cancer (CRC) patients. However, the genetic determinants of sCEA level in healthy and CRC population remains unclear. Thus we investigated the genetic markers associated with elevated serum sCEA level in these two populations and its clinical implications.
Methods and Findings
Genome-wide association study (GWAS) was conducted in a cohort study with 4,346 healthy male adults using the Illumina Omni 1 M chip. Candidate SNPs associated with elevated sCEA levels were validated in 194 CRC patients on ABI Taqman platform. Eight candidate SNPs were validated in CRC patients. The rs1047781 (chr19- FUT2) (A/T) was associated with elevated sCEA levels, and rs8176746 (chr9- ABO) was associated with the regional lymph metastasis in the CRC patients. The preoperative sCEA level was a risk factor for tumor recurrence in 5 years after operation (OR = 1.427, 95% CI: 1.005∼1.843, P = 0.006). It was also one of the risk factors for regional lymph node metastasis (OR = 2.266, 95% CI: 1.196∼4.293, P = 0.012). The sCEA level in rs1047781-T carriers was higher than that in the A carriers in CRC patients without lymph node metastasis (P = 0.006). The regional lymph node metastasis in patients with homozygote AA of rs8176746 was more common than that in the heterozygote AG carriers (P = 0.022). In addition, rs1047781-AT and TT CRC patients exhibited a worse disease-free survival than AA genotype carriers (P = 0.023).
We found candidate SNPs associated with elevated sCEA levels in both healthy males and CRC population. Rs1047781 (chr19- FUT2) may be the susceptible locus for recurrence of CRC in a population from Southern China.
Tuberculosis (TB) poses a worldwide threat due to advancing multidrug-resistant strains and deadly co-infections with Human immunodeficiency virus. Today large amounts of Mycobacterium tuberculosis whole genome sequencing data are being assessed broadly and yet there exists no comprehensive online resource that connects M. tuberculosis genome variants with geographic origin, with drug resistance or with clinical outcome.
Here we describe a broadly inclusive unifying Genome-wide Mycobacterium tuberculosis Variation (GMTV) database, (http://mtb.dobzhanskycenter.org) that catalogues genome variations of M. tuberculosis strains collected across Russia. GMTV contains a broad spectrum of data derived from different sources and related to M. tuberculosis molecular biology, epidemiology, TB clinical outcome, year and place of isolation, drug resistance profiles and displays the variants across the genome using a dedicated genome browser. GMTV database, which includes 1084 genomes and over 69,000 SNP or Indel variants, can be queried about M. tuberculosis genome variation and putative associations with drug resistance, geographical origin, and clinical stages and outcomes.
Implementation of GMTV tracks the pattern of changes of M. tuberculosis strains in different geographical areas, facilitates disease gene discoveries associated with drug resistance or different clinical sequelae, and automates comparative genomic analyses among M. tuberculosis strains.
Mycobacterium tuberculosis; Genome variations; Mutation; Genetic diversity; Whole genome sequencing; Database
The AIDS era has seen multiple advances in the power of genetics research; scores of host genetic protective factors have been nominated and several have translated to the bedside. We discuss how genomics may inform HIV/AIDS prevention, treatment and eradication.
Chronic hepatitis B virus (HBV) infection is a major health issue, especially in Asia. A recent genome-wide association study (GWAS) has implicated genetic variants in the HLA-DP locus associated with chronic hepatitis B in Japanese and Thai populations. To confirm whether the polymorphisms at the HLA-DP genes are associated with persistent chronic hepatitis B virus infection in Han Chinese, we conducted an independent case-control study using 521 persistent chronic HBV carriers and 819 controls that included 571 persons with HBV natural clearance and 248 never HBV-infected (healthy) individuals. Eleven single nucleotide polymorphisms (SNPs) in a region including HLA-DPA and HLA-DPB and an adjacent SNP in strong linkage disequilibrium (LD) with a neighboring HLA-DR13 locus were genotyped using TaqMan SNP genotyping assay. Eleven variants at HLA-DP showed a strong association with persistent chronic HBV carrier status (p = 1.82×10−12 to 0.01). We also stratified the analysis by HBV clearance status to test the association between these polymorphisms and HBV natural clearance; similar results were obtained (p = 2.70×10−11 to 0.003). Included SNPs define highly structured haplotypes which were also strongly associated with HBV chronic infection (Block 1: odds ratio (OR) = 0.54, p = 8.73×10−7; block 2: OR = 1.98, p = 1.37×10−10). These results further confirm that genetic variants in the HLA-DP locus are strongly associated with persistent HBV infection in the Han Chinese population.
Chronic hepatitis B; Haplotype association; GWAS; SNPs; Joint effects
Background. High-throughput genome-wide techniques have facilitated the identification of previously unknown host proteins involved in cellular human immunodeficiency virus (HIV) infection. Recently, 3 independent studies have used small interfering RNA technology to silence each gene in the human genome to determine the importance of each in HIV infection. Genes conferring a significant effect were termed HIV-dependency factors (HDFs).
Methods. We assembled high-density panels of 6380 single-nucleotide polymorphisms (SNPs) in 278 HDF genes and tested for genotype associations with HIV infection and AIDS progression in 1633 individuals from clinical AIDS cohorts.
Results. After statistical correction for multiple tests, significant associations with HIV acquisition were found for SNPs in 2 genes, NCOR2 and IDH1. Weaker associations with AIDS progression were revealed for SNPs within the TM9SF2 and EGFR genes.
Conclusions. This study independently verifies the influence of NCOR2 and IDH1 on HIV transmission, and its findings suggest that variation in these genes affects susceptibility to HIV infection in exposed individuals.
Both hepatitis B virus (HBV) and aflatoxin B1 (AFB1) exposure can cause liver damage as well as increase the probability of hepatocellular carcinoma (HCC). To investigate the underlying genetic changes that may influence development of HCC associated with HBV infection and AFB1 exposure, HCC patients were subdivided into 4 groups depending upon HBV and AFB1 exposure status: (HBV(+)/AFB1(+), HBV(+)/AFB1(-), HBV(-)/AFB1(+), HBV(-)/AFB1(-)). Genetic abnormalities and protein expression profiles were analyzed by array-based comparative genomic hybridization and isobaric tagging for quantitation. A total of 573 chromosomal aberrations (CNAs) including 184 increased and 389 decreased were detected in our study population. Twenty-five recurrently altered regions (RARs; chromosomal alterations observed in ≥10 patients) in chromosomes were identified. Loss of 4q13.3-q35.2, 13q12.1-q21.2 and gain of 7q11.2-q35 were observed with a higher frequency in the HBV(+)/AFB1(+), HBV(+)/AFB1(-) and HBV(-)/AFB1(+) groups compared to the HBV(-)/AFB(-) group. Loss of 8p12-p23.2 was associated with high TNM stage tumors (P = 0.038) and was an unfavorable prognostic factor for tumor-free survival (P =0.045). A total of 133 differentially expressed proteins were identified in iTRAQ proteomics analysis, 69 (51.8%) of which mapped within identified RARs. The most common biological processes affected by HBV and AFB1 status in HCC tumorigenesis were detoxification and drug metabolism pathways, antigen processing and anti-apoptosis pathways. Expression of AKR1B10 was increased significantly in the HBV(+)/AFB1(+) and HBV(-)/AFB1(+) groups. A significant correlation between the expression of AKR1B10 mRNA and protein levels as well as AKR1B10 copy number was observered, which suggest that AKR1B10 may play a role in AFB1-related hepatocarcinogenesis. In summary, a number of genetic and gene expression alterations were found to be associated with HBV and AFB1- related HCC. The possible synergistic effects of HBV and AFB1 in hepatocarcinogenesis warrant further investigations.
During knee arthroscopy, narrowness and tightness maybe encountered in the medial compartment that does not allow sufficient visualization or instrumentation. When this occurs, our team has found it helpful to perform a percutaneous clysis of the deep portion of the medial collateral ligament with a spinal needle. With the knee positioned in 10° to 20° of flexion and a valgus stress is applied. A spinal needle (18 Gauge) is passed percutaneously through the medial collateral ligament between the tibial plateau and undersurface of the medial meniscus. Several passes are made with the spinal needle with the bevel of the needle angled to selectively divide the fibers while keeping the medial collateral ligament under tension. Then with controlled valgus force, the medial compartment will progressively open allowing improved visualization to the posteromedial corner of the knee. This increase in space gives an enhanced visual field and further allows more room for arthroscopic instrumentation.
medial collateral ligament; knee arthroscopy; menisectomy; ligament release
The mechanosensory lateral line, found only in fishes and amphibians, is an important sense organ associated with aquatic life. Lateral line patterns differ among teleost, the most diverse vertebrate taxa, hypothetically in response to selective pressures from different aquatic habitats. In this article, we conduct evolutionary genomic analyses of 34 genes associated with lateral line system development in teleosts to elucidate the significance of contrasting evolutionary rates and changes in the protein coding sequences. We find that duplicated copies of these genes are preferentially retained in the teleost genomes and that episodic events of positive selection have occurred in 22 of the 30 postduplication branches. In general, teleost genes evolved at a faster rate relative to their tetrapod counterparts, and the mutation rates of 26 of the 34 genes differed among teleosts and tetrapods. We conclude that following whole genome duplication, evolutionary rates and episodic events of positive selection on the lateral line system development genes might have been one of the factors favoring the subsequent adaptive radiation of teleosts into diverse habitats. These results provide the foundation for further detailed explorations into lateral line system genes and the evolution of diverse phenotypes and adaptations.
lateral line; teleost; adaptive evolution; positive selection
The puma is an iconic predator that ranges throughout the Americas, occupying diverse habitats. Previous phylogeographic analyses have revealed that it exhibits moderate levels of genetic structure across its range, with few of the classically recognized subspecies being supported as distinct demographic units. Moreover, most of the species’ molecular diversity was found to be in South America. To further investigate the phylogeographic structure and demographic history of pumas we analyzed mtDNA sequences from 186 individuals sampled throughout their range, with emphasis on South America. Our objectives were to refine the phylogeographic assessment within South America and to investigate the demographic history of pumas using a coalescent approach. Our results extend previous phylogeographic findings, reassessing the delimitation of historical population units in South America and demonstrating that this species experienced a considerable demographic expansion in the Holocene, ca. 8,000 years ago. Our analyses indicate that this expansion occurred in South America, prior to the hypothesized re-colonization of North America, which was therefore inferred to be even more recent. The estimated demographic history supports the interpretation that pumas suffered a severe demographic decline in the Late Pleistocene throughout their distribution, followed by population expansion and re-colonization of the range, initiating from South America.
conservation genetics; mammals; mitochondrial DNA; molecular time estimate; phylogeography
Cytomegalovirus (CMV) retinitis is a common opportunistic infection among patients with AIDS and still causes visual morbidity despite the wide spread usage of highly active antiretroviral therapy (HAART). The ubiquitous CMV pathogen contains a human interleukin-10 (IL-10) homolog in its genome and utilizes it to evade host immune reactions through an IL-10 receptor mediated immune-suppression pathway.
Effects of IL-10R1, IL-10 and previously described AIDS restriction gene variants are investigated on the development of CMV retinitis in the Longitudinal Study of the Ocular Complications of AIDS (LSOCA) cohort (n=1284).
In Europen Americans (n=750), a haplotype carrying an amino acid changing variation in the cytoplasmic domain (S420L) of IL-10R1 can be protective (OR = 0.14, CI: 0.02–0.94, P = 0.04) against, whereas another haplotype carrying an amino acid changing variation in the extracellular domain (I224V) of IL-10R1 can be more susceptible (OR = 6.21, CI: 1.22–31.54, P = 0.03) to CMV retinitis. In African Americans (n=534), potential effects of IL-10 variants are observed.
Host genetics may have a role in the occurrence of CMV retinitis in patients infected with HIV.
AIDS; CMV retinitis; HIV-1; host genetics; interleukin-10 receptor
Approximately 10 to 15% of patients with AIDS but without ocular opportunistic infections will have a presumed neuroretinal disorder (HIV-NRD), manifested by reduced contrast sensitivity and abnormal visual fields. The loss of contrast sensitivity often is sufficient to impair reading speed. To evaluate the effect of host genetics on HIV-NRD, we explored validated AIDS restriction gene variants CCR5Δ32, CCR2-64I, CCR5 P1, SDF-3`A, IL-10-5`A, RANTES -403A, RANTES -28G, RANTES-In1.1C, CX3CR1-249I, CX3CR1-280M, IFNG-179T, MDR1-3435T, and MCP-1364G, each of which has been implicated previously to influence HIV-1 infection, AIDS progression, therapy response, and antiviral drug metabolism, and an IL-10 receptor gene, IL-10R1, in the Longitudinal Study of the Ocular Complications of AIDS (LSOCA) cohort. In European Americans (cases=55, controls=290), IL-10-5`A variant and its promoter haplotype (HR=2.09, CI: 1.19–3.67, P = 0.01); in African Americans (cases=54, controls=180) RANTES-In1.1C and the associated haplotype (HR=2.72, CI: 1.48–5.00, P = 0.001), showed increased HIV-NRD susceptibility. While sample sizes are small and P values do not pass a strict Bonferroni correction, our results suggest that, in European Americans, an IL-10-related pathway, and, in African Americans, chemokine receptor ligand polymorphisms in RANTES are risk factors for HIV- NRD development. Clearly, further studies are warrented.
AIDS; HIV-1; host genetics; HIV-neuroretinal disorder
HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (p<2.4 × 10−12). All associated SNPs mapped to the HLA class I region. Clinical relevance of host and pathogen variation was assessed using VL results. We identified two critical advantages to the use of viral variation for identifying host factors: (1) association signals are much stronger for HIV-1 sequence variants than VL, reflecting the ‘intermediate phenotype’ nature of viral variation; (2) association testing can be run without any clinical data. The proposed genome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host–pathogen interaction.
Developing treatments or vaccines for HIV is challenging because the genetic makeup of the virus is constantly changing in an effort to outwit the human immune system. Moreover, the immune system is highly variable as a result of the long-standing co-evolution of humans and microbes. Each individual will try to oppose the invading virus in a unique way, forcing the virus to acquire specific mutations that can be interpreted as the genetic signature of this one-against-one battle.
To explore the influence of co-evolution on HIV, Bartha et al. took samples of both human and viral genomes from 1071 individuals infected with HIV, the AIDS virus, and used genotyping and sequencing technology to obtain a comprehensive description of the genetic variation in both. Computational techniques were then used to search for links between variants in the human DNA sequences and variants in the viral sequences.
The most common type of genetic variation found in the human genome is a single nucleotide polymorphism, or SNP for short: a SNP is produced when a single nucleotide – an A, C, G or T – is replaced by a different nucleotide. Bartha et al. found that SNPs within the human DNA sequences in their study were linked to variations in 48 amino acids in HIV. Moreover, all these SNPs were found within a group of genes known as the HLA (human leukocyte antigen) system, which encodes for proteins that play a vital role in the immune response. This work identified the areas of the human genome that put pressure on the AIDS virus, and the regions of the virus that serve to escape human control.
The approach developed by Bartha et al. allows the interactions between a microbe and a human host to be studied by looking at the genome of the microbe and the genome of the infected person. It also differentiates host-induced mutations that limit the capacity of the virus to do harm from those that are tolerated by the pathogen. A similar strategy could be used to study other infectious diseases.
human genomics; HIV; viral mutations; Human
We report the construction of a 1.5 Mb resolution radiation hybrid map of the domestic cat genome. This new map includes novel microsatellite loci and markers derived from the 2X genome sequence that target previous gaps in the feline-human comparative map. Ninety-six percent of the 1793 cat markers we mapped have identifiable orthologues in the canine and human genome sequences. The updated autosomal and X chromosome comparative maps identify 152 cat-human and 134 cat-dog homologous synteny blocks. Comparative analysis shows the marked change in chromosomal evolution in the canid lineage relative to the felid lineage since divergence from their carnivoran ancestor. The canid lineage has a thirty-fold difference in the number of interchromosomal rearrangments relative to felids, while the felid lineage has primarily undergone intrachromosomal rearrangements. We have also refined the pseudoautosomal region and boundary in the cat and show that it is markedly longer than those of human or mouse. This improved RH comparative map provides a useful tool to facilitate positional cloning studies in the feline model.
domestic cat; radiation hybrid map; canine genome; genome evolution; synteny; chromosome rearrangement
To determine the genetic regulation of hair length in the domestic cat, a whole genome scan was performed in a multi-generational pedigree in which the long-haired phenotype was segregating. The two markers that demonstrated the greatest linkage to the long-haired trait (LOD≥6), flanked an estimated 10 Mb region on cat chromosome B1 containing the Fibroblast Growth Factor 5 gene (FGF5), a candidate gene implicated in regulating hair follicle growth cycle in other species. Sequence analyses of FGF5 in 26 cat breeds and two pedigrees of non-breed cats, revealed four separate mutations predicted to disrupt the biological activity of the FGF5 protein. Pedigree analyses demonstrated that different combinations of paired mutant FGF5 alleles segregated with the long-haired phenotype in an autosomal recessive manner. Association analyses of over 380 genotyped breed and non-breed cats were consistent with mutations in the FGF5 gene causing the long-haired phenotype in an autosomal recessive manner. In combination, these genomic approaches demonstrated that FGF5 is the major genetic determinant of hair length in the domestic cat.
Nucleoside analogue reverse transcriptase inhibitors are an integral component of combination antiretroviral treatment regimens. However, their ability to inhibit polymerase-γ has been associated with several mitochondrial toxicities, including potentially life-threatening lactic acidosis. A total of 650 antiretroviral-naive adults (69% female) initiated combination antiretroviral therapy (cART) and were intensively screened for toxicities including lactic acidosis as part of a 3-year clinical trial in Botswana. Patients were categorized as no lactic acidosis symptoms, minor symptoms but lactate <4.4 mmol/liter, and symptoms with lactate ≥4.4 mmol/liter [moderate to severe symptomatic hyperlactatemia (SH) or lactic acidosis (LA)]. Of 650 participants 111 (17.1%) developed symptoms and/or laboratory results suggestive of lactic acidosis and had a serum lactate drawn; 97 (87.4%) of these were female. There were 20 events, 13 having SH and 7 with LA; all 20 (100%) were female (p<0.001). Cox proportional hazard analysis limited to the 451 females revealed that having a higher baseline BMI was predictive for the development of SH/LA [aHR=1.17 per one-unit increase (1.08–1.25), p<0.0001]. Ordered logistic regression performed among all 650 patients revealed that having a lower baseline hemoglobin [aOR=1.28 per one-unit decrease (1.1–1.49), p=0.002] and being randomized to d4T/3TC-based cART [aOR=1.76 relative to ZDV/3TC (1.03–3.01), p=0.04] were predictive of the symptoms and/or the development of SH/LA. cART-treated women in sub-Saharan Africa, especially those having higher body mass indices, should receive additional monitoring for SH/LA. Women presently receiving d4T/3TC-based cART in such settings also warrant more intensive monitoring.
Multiple genome-wide association studies (GWAS) have been performed in HIV-1 infected individuals, identifying common genetic influences on viral control and disease course. Similarly, common genetic correlates of acquisition of HIV-1 after exposure have been interrogated using GWAS, although in generally small samples. Under the auspices of the International Collaboration for the Genomics of HIV, we have combined the genome-wide single nucleotide polymorphism (SNP) data collected by 25 cohorts, studies, or institutions on HIV-1 infected individuals and compared them to carefully matched population-level data sets (a list of all collaborators appears in Note S1 in Text S1). After imputation using the 1,000 Genomes Project reference panel, we tested approximately 8 million common DNA variants (SNPs and indels) for association with HIV-1 acquisition in 6,334 infected patients and 7,247 population samples of European ancestry. Initial association testing identified the SNP rs4418214, the C allele of which is known to tag the HLA-B*57:01 and B*27:05 alleles, as genome-wide significant (p = 3.6×10−11). However, restricting analysis to individuals with a known date of seroconversion suggested that this association was due to the frailty bias in studies of lethal diseases. Further analyses including testing recessive genetic models, testing for bulk effects of non-genome-wide significant variants, stratifying by sexual or parenteral transmission risk and testing previously reported associations showed no evidence for genetic influence on HIV-1 acquisition (with the exception of CCR5Δ32 homozygosity). Thus, these data suggest that genetic influences on HIV acquisition are either rare or have smaller effects than can be detected by this sample size.
Comparing the frequency differences between common DNA variants in disease-affected cases and in unaffected controls has been successful in uncovering the genetic component of multiple diseases. This approach is most effective when large samples of cases and controls are available. Here we combine information from multiple studies of HIV infected patients, including more than 6,300 HIV+ individuals, with data from 7,200 general population samples of European ancestry to test nearly 8 million common DNA variants for an impact on HIV acquisition. With this large sample we did not observe any single common genetic variant that significantly associated with HIV acquisition. We further tested 22 variants previously identified by smaller studies as influencing HIV acquisition. With the exception of a deletion polymorphism in the CCR5 gene (CCR5Δ32) we found no convincing evidence to support these previous associations. Taken together these data suggest that genetic influences on HIV acquisition are either rare or have smaller effects than can be detected by this sample size.