Search tips
Search criteria

Results 1-25 (840271)

Clipboard (0)

Related Articles

1.  HIV-1 Subtype B Protease and Reverse Transcriptase Amino Acid Covariation 
PLoS Computational Biology  2007;3(5):e87.
Despite the high degree of HIV-1 protease and reverse transcriptase (RT) mutation in the setting of antiretroviral therapy, the spectrum of possible virus variants appears to be limited by patterns of amino acid covariation. We analyzed patterns of amino acid covariation in protease and RT sequences from more than 7,000 persons infected with HIV-1 subtype B viruses obtained from the Stanford HIV Drug Resistance Database ( In addition, we examined the relationship between conditional probabilities associated with a pair of mutations and the order in which those mutations developed in viruses for which longitudinal sequence data were available. Patterns of RT covariation were dominated by the distinct clustering of Type I and Type II thymidine analog mutations and the Q151M-associated mutations. Patterns of protease covariation were dominated by the clustering of nelfinavir-associated mutations (D30N and N88D), two main groups of protease inhibitor (PI)–resistance mutations associated either with V82A or L90M, and a tight cluster of mutations associated with decreased susceptibility to amprenavir and the most recently approved PI darunavir. Different patterns of covariation were frequently observed for different mutations at the same position including the RT mutations T69D versus T69N, L74V versus L74I, V75I versus V75M, T215F versus T215Y, and K219Q/E versus K219N/R, and the protease mutations M46I versus M46L, I54V versus I54M/L, and N88D versus N88S. Sequence data from persons with correlated mutations in whom earlier sequences were available confirmed that the conditional probabilities associated with correlated mutation pairs could be used to predict the order in which the mutations were likely to have developed. Whereas accessory nucleoside RT inhibitor–resistance mutations nearly always follow primary nucleoside RT inhibitor–resistance mutations, accessory PI-resistance mutations often preceded primary PI-resistance mutations.
Author Summary
The identification of which mutations in a protein covary has played a major role in both structural and evolutionary biology. Covariation analysis has been used to help predict unsolved protein structures and to better understand the functions of proteins with known structures. The large number of published genetic sequences of the targets of HIV-1 therapy has provided an unprecedented opportunity to identify dependencies among mutations in these proteins that can be exploited to design inhibitors that have high genetic barriers to resistance. In our analysis, we identified many pairs of covarying drug-resistance mutations in HIV-1 protease and reverse transcriptase and organized them into clusters of mutations that often develop in a predictable order. Inhibitors that are active against early drug-resistant mutants are likely to be less prone to the development of resistance, whereas inhibitors that are active against fully evolved clusters of mutations may be useful drugs for salvage therapy.
PMCID: PMC1866358  PMID: 17500586
2.  Distinguishing Functional Amino Acid Covariation from Background Linkage Disequilibrium in HIV Protease and Reverse Transcriptase 
PLoS ONE  2007;2(8):e814.
Correlated amino acid mutation analysis has been widely used to infer functional interactions between different sites in a protein. However, this analysis can be confounded by important phylogenetic effects broadly classifiable as background linkage disequilibrium (BLD). We have systematically separated the covariation induced by selective interactions between amino acids from background LD, using synonymous (S) vs. amino acid (A) mutations. Covariation between two amino acid mutations, (A,A), can be affected by selective interactions between amino acids, whereas covariation within (A,S) pairs or (S,S) pairs cannot. Our analysis of the pol gene — including the protease and the reverse transcriptase genes — in HIV reveals that (A,A) covariation levels are enormously higher than for either (A,S) or (S,S), and thus cannot be attributed to phylogenetic effects. The magnitude of these effects suggests that a large portion of (A,A) covariation in the HIV pol gene results from selective interactions. Inspection of the most prominent (A,A) interactions in the HIV pol gene showed that they are known sites of independently identified drug resistance mutations, and physically cluster around the drug binding site. Moreover, the specific set of (A,A) interaction pairs was reproducible in different drug treatment studies, and vanished in untreated HIV samples. The (S,S) covariation curves measured a low but detectable level of background LD in HIV.
PMCID: PMC1950573  PMID: 17726544
3.  Mutation Patterns and Structural Correlates in Human Immunodeficiency Virus Type 1 Protease following Different Protease Inhibitor Treatments 
Journal of Virology  2003;77(8):4836-4847.
Although many human immunodeficiency virus type 1 (HIV-1)-infected persons are treated with multiple protease inhibitors in combination or in succession, mutation patterns of protease isolates from these persons have not been characterized. We collected and analyzed 2,244 subtype B HIV-1 isolates from 1,919 persons with different protease inhibitor experiences: 1,004 isolates from untreated persons, 637 isolates from persons who received one protease inhibitor, and 603 isolates from persons receiving two or more protease inhibitors. The median number of protease mutations per isolate increased from 4 in untreated persons to 12 in persons who had received four or more protease inhibitors. Mutations at 45 of the 99 amino acid positions in the protease—including 22 not previously associated with drug resistance—were significantly associated with protease inhibitor treatment. Mutations at 17 of the remaining 99 positions were polymorphic but not associated with drug treatment. Pairs and clusters of correlated (covarying) mutations were significantly more likely to occur in treated than in untreated persons: 115 versus 23 pairs and 30 versus 2 clusters, respectively. Of the 115 statistically significant pairs of covarying residues in the treated isolates, 59 were within 8 Å of each other—many more than would be expected by chance. In summary, nearly one-half of HIV-1 protease positions are under selective drug pressure, including many residues not previously associated with drug resistance. Structural factors appear to be responsible for the high frequency of covariation among many of the protease residues. The presence of mutational clusters provides insight into the complex mutational patterns required for HIV-1 protease inhibitor resistance.
PMCID: PMC152121  PMID: 12663790
4.  A Novel Substrate-Based HIV-1 Protease Inhibitor Drug Resistance Mechanism 
PLoS Medicine  2007;4(1):e36.
HIV protease inhibitor (PI) therapy results in the rapid selection of drug resistant viral variants harbouring one or two substitutions in the viral protease. To combat PI resistance development, two approaches have been developed. The first is to increase the level of PI in the plasma of the patient, and the second is to develop novel PI with high potency against the known PI-resistant HIV protease variants. Both approaches share the requirement for a considerable increase in the number of protease mutations to lead to clinical resistance, thereby increasing the genetic barrier. We investigated whether HIV could yet again find a way to become less susceptible to these novel inhibitors.
Methods and Findings
We have performed in vitro selection experiments using a novel PI with an increased genetic barrier (RO033-4649) and demonstrated selection of three viruses 4- to 8-fold resistant to all PI compared to wild type. These PI-resistant viruses did not have a single substitution in the viral protease. Full genomic sequencing revealed the presence of NC/p1 cleavage site substitutions in the viral Gag polyprotein (K436E and/or I437T/V) in all three resistant viruses. These changes, when introduced in a reference strain, conferred PI resistance. The mechanism leading to PI resistance is enhancement of the processing efficiency of the altered substrate by wild-type protease. Analysis of genotypic and phenotypic resistance profiles of 28,000 clinical isolates demonstrated the presence of these NC/p1 cleavage site mutations in some clinical samples (codon 431 substitutions in 13%, codon 436 substitutions in 8%, and codon 437 substitutions in 10%). Moreover, these cleavage site substitutions were highly significantly associated with reduced susceptibility to PI in clinical isolates lacking primary protease mutations. Furthermore, we used data from a clinical trial (NARVAL, ANRS 088) to demonstrate that these NC/p1 cleavage site changes are associated with virological failure during PI therapy.
HIV can use an alternative mechanism to become resistant to PI by changing the substrate instead of the protease. Further studies are required to determine to what extent cleavage site mutations may explain virological failure during PI therapy.
Changes in the cleavage site of the Gag substrate for the HIV protease can convey resistance to protease inhibitors and might contribute to virologic failure during therapy that includes these drugs.
Editors' Summary
Twenty-five years ago, infection with the human immunodeficiency virus (HIV)—the causative agent of AIDS—was a death sentence. However, drugs that attack various stages of the HIV life cycle were soon developed that, although not curing the infection, kept it in check when used in combination and greatly increased the life expectancy of people infected with HIV. Unfortunately, viruses resistant to these drugs have rapidly emerged and antiviral therapy now fails in many patients. The use of HIV protease inhibitors (PIs) in combination therapies, for example, has led to the stepwise selection of viral variants resistant to these drugs. Resistance is first acquired when the viral protease changes so that PIs no longer bind to it and inhibit it efficiently. These changes often reduce the efficiency with which the protease binds its substrates—polyproteins called Gag and GagPol that it chops up into smaller proteins to make new viral particles. So the next step is the accumulation of changes elsewhere in the protease that make it work better, and sometimes changes in its substrate that make it easier to cut; these compensatory changes do not directly affect viral resistance to PIs.
Why Was This Study Done?
To prevent viruses with resistance to PIs emerging, drug doses are kept high in patients and new PIs are being developed with high potency against known PI-resistant HIV variants. Both approaches set a “high genetic barrier” to the development of PI resistance by ensuring that HIV has to incorporate many changes in its protease to become resistant. But, the HIV genome naturally changes—mutates—very rapidly, so novel HIV variants could emerge that are less susceptible to the new potent PIs without the virus having to leap this high genetic barrier. In this study, the researchers have investigated whether HIV can find an alternative route to PI resistance that does not involve the introduction of multiple changes into its protease.
What Did the Researchers Do and Find?
The researchers took wild-type HIV and treated it in the laboratory with a new PI regimen that has a high genetic barrier. By gradually increasing its concentration, the researchers selected three viral populations that were able to grow in 4- to 8-fold higher concentrations of the PI than wild-type virus. None of these populations had mutations in the viral protease. Instead, they all had mutations near one of the sites—the NC/p1 site—where the protease normally cuts the Gag polyprotein. These mutations, the researchers report, enhanced the overall efficiency with which the wild-type protease cleaved the polyprotein, and a selection experiment with another PI showed that the development of PI resistance through alterations near the NC/p1 cleavage site was not unique to one PI. The researchers also investigated the potential clinical significance of this new drug resistance mechanism by looking for the same mutations in nearly 30,000 patient samples. Many of the samples did indeed have these mutations. Finally, they showed that mutations at the NC/p1 cleavage site were associated with virological failure (increased viral replication) during PI therapy in an ongoing clinical trial.
What Do These Findings Mean?
These results suggest that increased polyprotein processing because of mutations in the natural substrate of the HIV protease might be a new mechanism by which HIV can become resistant to PIs. This strategy, which occurs in the laboratory and in patients, allows HIV to develop PI resistance without the need for multiple changes in its protease and so avoids the high genetic barrier to resistance that new PIs provide. Clinical studies are now needed to test which of the mutations seen in this study contribute to virological failure, whether the degree of this failure is clinically relevant, and whether these substrate mutations enhance the effect of protease mutations. If the clinical importance of the new mechanism is confirmed, genetic examination of both the polyprotein and the protease will be needed when trying to figure out why a PI-containing therapy is failing in individual patients. Furthermore, it will be necessary to test whether this mechanism can contribute to the development of resistance when evaluating new drugs.
Additional Information.
Please access these Web sites via the online version of this summary at
US National Institute of Allergy and Infectious Diseases factsheet on HIV infection and AIDS
US Department of Health and Human Services information on AIDS
US Centers for Disease Control and Prevention information on HIV/AIDS
Aidsmap information on HIV and AIDS provided by the charity NAM
BioAfrica, Bioinformatics for HIV Research, information on HIV-1 protease cleavage sites
PMCID: PMC1769415  PMID: 17227139
5.  An Evolutionary-Network Model Reveals Stratified Interactions in the V3 Loop of the HIV-1 Envelope 
PLoS Computational Biology  2007;3(11):e231.
The third variable loop (V3) of the human immunodeficiency virus type 1 (HIV-1) envelope is a principal determinant of antibody neutralization and progression to AIDS. Although it is undoubtedly an important target for vaccine research, extensive genetic variation in V3 remains an obstacle to the development of an effective vaccine. Comparative methods that exploit the abundance of sequence data can detect interactions between residues of rapidly evolving proteins such as the HIV-1 envelope, revealing biological constraints on their variability. However, previous studies have relied implicitly on two biologically unrealistic assumptions: (1) that founder effects in the evolutionary history of the sequences can be ignored, and; (2) that statistical associations between residues occur exclusively in pairs. We show that comparative methods that neglect the evolutionary history of extant sequences are susceptible to a high rate of false positives (20%–40%). Therefore, we propose a new method to detect interactions that relaxes both of these assumptions. First, we reconstruct the evolutionary history of extant sequences by maximum likelihood, shifting focus from extant sequence variation to the underlying substitution events. Second, we analyze the joint distribution of substitution events among positions in the sequence as a Bayesian graphical model, in which each branch in the phylogeny is a unit of observation. We perform extensive validation of our models using both simulations and a control case of known interactions in HIV-1 protease, and apply this method to detect interactions within V3 from a sample of 1,154 HIV-1 envelope sequences. Our method greatly reduces the number of false positives due to founder effects, while capturing several higher-order interactions among V3 residues. By mapping these interactions to a structural model of the V3 loop, we find that the loop is stratified into distinct evolutionary clusters. We extend our model to detect interactions between the V3 and C4 domains of the HIV-1 envelope, and account for the uncertainty in mapping substitutions to the tree with a parametric bootstrap.
Author Summary
The third variable loop (V3) of the human immunodeficiency virus type 1 (HIV-1) envelope is a principal determinant of viral growth characteristics and an important target for the immune system. Interactions between residues of V3 allow the virus to shift between combinations of residues to escape the immune system while retaining its structure and functions. Comparative study of HIV-1 V3 sequences can detect such interactions by the covariation of sites in the sequence, which can then be used to inform vaccine development, but current methods for detecting such associations rely on biologically unrealistic assumptions. We demonstrate that these assumptions cause an excessive number of spurious associations, and present a new approach that couples phylogenetic and Bayesian network models, and greatly reduces this number while retaining the ability to detect real associations. Our analysis reveals that the V3 loop is stratified into discrete layers of interacting residues, suggesting a partition of functions along this viral structure with implications for vaccine development.
PMCID: PMC2082504  PMID: 18039027
6.  Relative Fitness and Replication Capacity of a Multinucleoside Analogue-Resistant Clinical Human Immunodeficiency Virus Type 1 Isolate with a Deletion of Codon 69 in the Reverse Transcriptase Coding Region▿  
Journal of Virology  2007;81(9):4713-4721.
Deletions, insertions, and amino acid substitutions in the β3-β4 hairpin loop-coding region of human immunodeficiency virus type 1 (HIV-1) reverse transcriptase (RT) have been associated with resistance to nucleoside RT inhibitors when appearing in combination with other mutations in the RT-coding region. In this work, we have measured the in vivo fitness of HIV-1 variants containing a deletion of 3 nucleotides affecting codon 69 (Δ69) of the viral RT as well as the replication capacity (RC) ex vivo of a series of recombinant HIV-1 variants carrying an RT bearing the Δ69 deletion or the T69A mutation in a multidrug-resistant (MDR) sequence background, including the Q151M complex and substitutions M184V, K103N, Y181C, and G190A. Patient-derived viral clones having RTs with Δ69 together with S163I showed increased RCs under drug pressure. These data were consistent with the viral population dynamics observed in a long-term-treated HIV-1-infected patient. In the absence of drugs, viral clones containing T69A replicated more efficiently than those having Δ69, but only when patient-derived sequences corresponding to RT residues 248 to 527 were present. These effects could be attributed to a functional interaction between the C-terminal domain of the p66 subunit (RNase H domain) and the DNA polymerase domain of the RT. Finally, recombinant HIV-1 clones bearing RTs with MDR-associated mutations, including deletions at codon 69, showed increased susceptibilities to protease inhibitors in phenotypic assays. These effects correlated with impaired Gag cleavage and could be attributed to delayed maturation and decreased production of active protease in those variants.
PMCID: PMC1900151  PMID: 17314158
7.  Molecular Basis for the Relative Substrate Specificity of Human Immunodeficiency Virus Type 1 and Feline Immunodeficiency Virus Proteases 
Journal of Virology  2001;75(19):9458-9469.
We have used a random hexamer phage library to delineate similarities and differences between the substrate specificities of human immunodeficiency virus type 1 (HIV-1) and feline immunodeficiency virus (FIV) proteases (PRs). Peptide sequences were identified that were specifically cleaved by each protease, as well as sequences cleaved equally well by both enzymes. Based on amino acid distinctions within the P3-P3′ region of substrates that appeared to correlate with these cleavage specificities, we prepared a series of synthetic peptides within the framework of a peptide sequence cleaved with essentially the same efficiency by both HIV-1 and FIV PRs, Ac-KSGVF↓VVNGLVK-NH2 (arrow denotes cleavage site). We used the resultant peptide set to assess the influence of specific amino acid substitutions on the cleavage characteristics of the two proteases. The findings show that when Asn is substituted for Val at the P2 position, HIV-1 PR cleaves the substrate at a much greater rate than does FIV PR. Likewise, Glu or Gln substituted for Val at the P2′ position also yields peptides specifically susceptible to HIV-1 PR. In contrast, when Ser is substituted for Val at P1′, FIV PR cleaves the substrate at a much higher rate than does HIV-1 PR. In addition, Asn or Gln at the P1 position, in combination with an appropriate P3 amino acid, Arg, also strongly favors cleavage by FIV PR over HIV PR. Structural analysis identified several protease residues likely to dictate the observed specificity differences. Interestingly, HIV PR Asp30 (Ile-35 in FIV PR), which influences specificity at the S2 and S2′ subsites, and HIV-1 PR Pro-81 and Val-82 (Ile-98 and Gln-99 in FIV PR), which influence specificity at the S1 and S1′ subsites, are residues which are often involved in development of drug resistance in HIV-1 protease. The peptide substrate KSGVF↓VVNGK, cleaved by both PRs, was used as a template for the design of a reduced amide inhibitor, Ac-GSGVFΨ(CH2NH)VVNGL-NH2. This compound inhibited both FIV and HIV-1 PRs with approximately equal efficiency. These findings establish a molecular basis for distinctions in substrate specificity between human and feline lentivirus PRs and offer a framework for development of efficient broad-based inhibitors.
PMCID: PMC114513  PMID: 11533208
8.  Episodic Sexual Transmission of HIV Revealed by Molecular Phylodynamics 
PLoS Medicine  2008;5(3):e50.
The structure of sexual contact networks plays a key role in the epidemiology of sexually transmitted infections, and their reconstruction from interview data has provided valuable insights into the spread of infection. For HIV, the long period of infectivity has made the interpretation of contact networks more difficult, and major discrepancies have been observed between the contact network and the transmission network revealed by viral phylogenetics. The high rate of HIV evolution in principle allows for detailed reconstruction of links between virus from different individuals, but often sampling has been too sparse to describe the structure of the transmission network. The aim of this study was to analyze a high-density sample of an HIV-infected population using recently developed techniques in phylogenetics to infer the short-term dynamics of the epidemic among men who have sex with men (MSM).
Methods and Findings
Sequences of the protease and reverse transcriptase coding regions from 2,126 patients, predominantly MSM, from London were compared: 402 of these showed a close match to at least one other subtype B sequence. Nine large clusters were identified on the basis of genetic distance; all were confirmed by Bayesian Monte Carlo Markov chain (MCMC) phylogenetic analysis. Overall, 25% of individuals with a close match with one sequence are linked to 10 or more others. Dated phylogenies of the clusters using a relaxed clock indicated that 65% of the transmissions within clusters took place between 1995 and 2000, and 25% occurred within 6 mo after infection. The likelihood that not all members of the clusters have been identified renders the latter observation conservative.
Reconstruction of the HIV transmission network using a dated phylogeny approach has revealed the HIV epidemic among MSM in London to have been episodic, with evidence of multiple clusters of transmissions dating to the late 1990s, a period when HIV prevalence is known to have doubled in this population. The quantitative description of the transmission dynamics among MSM will be important for parameterization of epidemiological models and in designing intervention strategies.
Using viral genotype data from HIV drug resistance testing at a London clinic, Andrew Leigh Brown and colleagues derive the structure of the transmission network through phylogenetic analysis.
Editors' Summary
Human immunodeficiency virus (HIV), the cause of acquired immunodeficiency syndrome (AIDS), is mainly spread through unprotected sex with an infected partner. Like other sexually transmitted diseases, HIV/AIDS spreads through networks of sexual contacts. The characteristics of these complex networks (which include people who have serial sexual relationships with single partners and people who have concurrent sexual relationships with several partners) affect how quickly diseases spread in the short term and how common the disease is in the long term. For many sexually transmitted diseases, sexual contact networks can be reconstructed from interview data. The information gained in this way can be used for partner notification so that transmitters of the disease and people who may have been unknowingly infected can be identified, treated, and advised about disease prevention. It can also be used to develop effective community-based prevention strategies.
Why Was This Study Done?
Although sexual contact networks have provided valuable information about the spread of many sexually transmitted diseases, they cannot easily be used to understand HIV transmission patterns. This is because the period of infectivity with HIV is long and the risk of infection from a single sexual contact with an infected person is low. Another way to understand the spread of HIV is through phylogenetics, which examines the genetic relatedness of viruses obtained from different individuals. Frequent small changes in the genetic blueprint of HIV allow the virus to avoid the human immune response and to become resistant to antiretroviral drugs. In this study, the researchers use recently developed analytical methods, viral sequences from a large proportion of a specific HIV-infected population, and information on when each sample was taken, to learn about transmission of HIV/AIDS in London among men who have sex with men (MSM; a term that encompasses gay, bisexual, and transgendered men and heterosexual men who sometimes have sex with men). This new approach, which combines information on viral genetic variation and viral population dynamics, is called “molecular phylodynamics.”
What Did the Researchers Do and Find?
The researchers compared the sequences of the genes encoding the HIV-1 protease and reverse transcriptase from more than 2,000 patients, mainly MSM, attending a large London HIV clinic between 1997 and 2003. 402 of these sequences closely matched at least one other subtype B sequence (the HIV/AIDS epidemic among MSM in the UK primarily involves HIV subtype B). Further analysis showed that the patients from whom this subset of sequences came formed six clusters of ten or more individuals, as well as many smaller clusters, based on the genetic relatedness of their HIV viruses. The researchers then used information on the date when each sample was collected and a “relaxed clock” approach (which accounts for the possibility that different sequences evolve at different rates) to determine dated phylogenies (patterns of genetic relatedness that indicate when gene sequences change) for the clusters. These phylogenies indicated that at least in one in four transmissions between the individuals in the large clusters occurred within 6 months of infection, and that most of the transmissions within each cluster occurred over periods of 3–4 years during the late 1990s.
What Do These Findings Mean?
This phylodynamic reconstruction of the HIV transmission network among MSM in a London clinic indicates that the HIV epidemic in this population has been episodic with multiple clusters of transmission occurring during the late 1990s, a time when the number of HIV infections in this population doubled. It also suggests that transmission of the virus during the early stages of HIV infection is likely to be an important driver of the epidemic. Whether these results apply more generally to the MSM population at risk for transmitting or acquiring HIV depends on whether the patients in this study are representative of that group. Additional studies are needed to determine this, but if the patterns revealed here are generalizable, then this quantitative description of HIV transmission dynamics should help in the design of strategies to strengthen HIV prevention among MSM.
Additional Information.
Please access these Web sites via the online version of this summary at
Read a related PLoS Medicine Perspective article
Information is available from the US National Institute of Allergy and Infectious Diseases on HIV infection and AIDS
HIV InSite has comprehensive information on all aspects of HIV/AIDS, including a list of organizations that provide information for gay men and MSM
The US Centers for Disease Control and Prevention provides information on HIV/AIDS and on HIV/AIDS among MSM (in English and Spanish)
Information is available from Avert, an international AIDS charity, on HIV, AIDS, and men who have sex with men
The Center for AIDS Prevention Studies (University of California, San Francisco) provides information on sexual networks and HIV prevention
The US National Center for Biotechnology Information provides a science primer on molecular phylogenetics
UK Collaborative Group on HIV Drug Resistance maintains a database of resistance tests
HIV i-Base offers HIV treatment information for health-care professionals and HIV-positive people
The NIH-funded HIV Sequence Database contains data on genetic sequences, resistance, immunology, and vaccine trials
PMCID: PMC2267814  PMID: 18351795
9.  Computational Analysis of HIV-1 Protease Protein Binding Pockets 
Mutations that arise in HIV-1 protease after exposure to various HIV-1 protease inhibitors have proved to be a difficult aspect in the treatment of HIV. Mutations in the binding pocket of the protease can prevent the protease inhibitor from binding to the protein effectively. In the present study, the crystal structures of 68 HIV-1 proteases complexed with one of the nine FDA approved protease inhibitors from the Protein Data Bank (PDB) were analyzed by (a) identifying the mutational changes with the aid of a developed mutation map and (b) correlating the structure of the binding pockets with the complexed inhibitors. The mutations of each crystal structure were identified by comparing the amino acid sequence of each structure against the HIV-1 wild type strain HXB2. These mutations were visually presented in the form of a mutation map to analyze mutation patterns corresponding to each protease inhibitor. The crystal structure mutation patterns of each inhibitor (in vitro) were compared against the mutation patterns observed in in vivo data. The in vitro mutation patterns were found to be representative of most of the major in vivo mutations. We then performed a data mining analysis of the binding pockets from each crystal structure in terms of their chemical descriptors to identify important structural features of the HIV-1 protease protein with respect to the binding conformation of the HIV-1 protease inhibitors. Data mining analysis is performed using several classification techniques: Random Forest (RF), linear discriminant analysis (LDA), and logistic regression (LR). We developed two hybrid models, RF-LDA and RF-LR. Random Forest is used as a feature selection proxy, reducing the descriptor space to a few of the most relevant descriptors determined by the classifier. These descriptors are then used to develop the subsequent LDA, LR, and hierarchical classification models. Clustering analysis of the binding pockets using the selected descriptors used to produce the optimal classification models reveals conformational similarities of the ligands in each cluster. This study provides important information in understanding the structural features of HIV-1 protease which cannot be studied by other existing in vivo genomic datasets.
PMCID: PMC2981608  PMID: 20925403
10.  pol gene diversity of five human immunodeficiency virus type 1 subtypes: evidence for naturally occurring mutations that contribute to drug resistance, limited recombination patterns, and common ancestry for subtypes B and D. 
Journal of Virology  1997;71(9):6348-6358.
Naturally occurring mutations in the polymerase gene of human immunodeficiency virus type 1 (HIV-1) have important implications for therapy and the outcome of clinical studies. Using 42 virus isolates obtained from the UNAIDS sample collection, we analyzed the protease (99 amino acids [aa]) and the first 297 aa of reverse transcriptase (RT) coding regions. Based on the V3 sequence analysis, the collection includes subtype A (n = 5), subtype B (n = 12), subtype C (n = 1), subtype D (n = 11), and subtype E (n = 13) viruses. Of the 42 protease genes, 37 contained naturally occurring mutations at positions in the gene that contribute to resistance to protease inhibitors (indinavir, saquinavir, ritonavir, and nelfinavir) in clade B isolates. The phenotypic effect of these substitutions in non-B isolates is unclear. The The 5'half RT coding region of the 42 isolates was found to be less variable, although 19 of the 42 RT sequences contained amino acid substitutions known to contribute to nucleoside and/or nonnucleoside drug resistance. Since the virus isolates were obtained in 1992, it is unlikely that the infected subjects received protease inhibitors, but we found evidence that one subject acquired a zidovudine (AZT)-resistant HIV-1 strain from a contact who had received AZT. Phylogenetic analysis identified five subtype pol clusters: A, B, C, D, and A'. Comparison of env and pol sequences of the same viruses showed no more recombination events than were already identified on the basis of gag/env comparison (M. Cornelissen, G. Kampinga, F. Zorgdrager, J. Goudsmit, and the UNAIDS Network for HIV Isolation and Characterization, J. Virol. 70:8209-8212, 1996). In one of the known recombinants, a crossover site between subtypes A and C could be identified, and in another, a crossover site could not be identified due to lack of a reference subtype F pol sequence. We analyzed the ds/da ratio of gag, pol, and env sequences of 35 isolates, excluding the recombinants. Our analysis showed that gag and pol are subjected to purifying selection with an average ds/da ratio above 1, independent of the subtype and in contrast with V3 (ds/da approximately 1). Based on the low ds/da ratio of the intergroup analysis of A/E and B/D gag and pol sequences, we analyzed the evolutionary relation between subtypes B and D in more detail by constructing separate phylogenetic trees for synonymous and nonsynonymous substitutions. Our analysis suggests a common ancestry for subtypes B and D that is distinct from that of subtypes A and E.
PMCID: PMC191908  PMID: 9261352
11.  Global analysis of sequence diversity within HIV-1 subtypes across geographic regions 
Future virology  2012;7(5):505-517.
HIV-1 sequence diversity can affect host immune responses and phenotypic characteristics such as antiretroviral drug resistance. Current HIV-1 sequence diversity classification uses phylogeny-based methods to identify subtypes and recombinants, which may overlook distinct subpopulations within subtypes. While local epidemic studies have characterized sequence-level clustering within subtypes using phylogeny, identification of new genotype – phenotype associations are based on mutational correlations at individual sequence positions. We perform a systematic, global analysis of position-specific pol gene sequence variation across geographic regions within HIV-1 subtypes to characterize subpopulation differences that may be missed by standard subtyping methods and sequence-level phylogenetic clustering analyses.
Materials & methods
Analysis was performed on a large, globally diverse, cross-sectional pol sequence dataset. Sequences were partitioned into subtypes and geographic subpopulations within subtypes. For each subtype, we identified positions that varied according to geography using VESPA (viral epidemiology signature pattern analysis) to identify sequence signature differences and a likelihood ratio test adjusted for multiple comparisons to characterize differences in amino acid (AA) frequencies, including minority mutations. Synonymous nonsynonymous analysis program (SNAP) was used to explore the role of evolutionary selection witihin subtype C.
In 7693 protease (PR) and reverse transcriptase (RT) sequences from untreated patients in multiple geographic regions, 11 PR and 11 RT positions exhibited sequence signature differences within subtypes. Thirty six PR and 80 RT positions exhibited within-subtype geography-dependent differences in AA distributions, including minority mutations, at both conserved and variable loci. Among subtype C samples from India and South Africa, nine PR and nine RT positions had significantly different AA distributions, including one PR and five RT positions that differed in consensus AA between regions. A selection analysis of subtype C using SNAP demonstrated that estimated rates of nonsynonymous and synonymous mutations are consistent with the possibility of positive selection across geographic subpopulations within subtypes.
We characterized systematic genotypic pol differences across geographic regions within subtypes that are not captured by the subtyping nomenclature. Awareness of such differences may improve the interpretation of future studies determining the phenotypic consequences of genetic backgrounds.
PMCID: PMC3400699  PMID: 22822410
geography; HIV-1; pol gene sequences; protease; reverse transcriptase; subtyping
12.  The Transcription Factor Mrr1p Controls Expression of the MDR1 Efflux Pump and Mediates Multidrug Resistance in Candida albicans 
PLoS Pathogens  2007;3(11):e164.
Constitutive overexpression of the MDR1 (multidrug resistance) gene, which encodes a multidrug efflux pump of the major facilitator superfamily, is a frequent cause of resistance to fluconazole and other toxic compounds in clinical Candida albicans strains, but the mechanism of MDR1 upregulation has not been resolved. By genome-wide gene expression analysis we have identified a zinc cluster transcription factor, designated as MRR1 (multidrug resistance regulator), that was coordinately upregulated with MDR1 in drug-resistant, clinical C. albicans isolates. Inactivation of MRR1 in two such drug-resistant isolates abolished both MDR1 expression and multidrug resistance. Sequence analysis of the MRR1 alleles of two matched drug-sensitive and drug-resistant C. albicans isolate pairs showed that the resistant isolates had become homozygous for MRR1 alleles that contained single nucleotide substitutions, resulting in a P683S exchange in one isolate and a G997V substitution in the other isolate. Introduction of these mutated alleles into a drug-susceptible C. albicans strain resulted in constitutive MDR1 overexpression and multidrug resistance. By comparing the transcriptional profiles of drug-resistant C. albicans isolates and mrr1Δ mutants derived from them and of C. albicans strains carrying wild-type and mutated MRR1 alleles, we defined the target genes that are controlled by Mrr1p. Many of the Mrr1p target genes encode oxidoreductases, whose upregulation in fluconazole-resistant isolates may help to prevent cell damage resulting from the generation of toxic molecules in the presence of fluconazole and thereby contribute to drug resistance. The identification of MRR1 as the central regulator of the MDR1 efflux pump and the elucidation of the mutations that have occurred in fluconazole-resistant, clinical C. albicans isolates and result in constitutive activity of this trancription factor provide detailed insights into the molecular basis of multidrug resistance in this important human fungal pathogen.
Author Summary
The Candida albicans MDR1 (multidrug resistance) gene encodes a multidrug efflux pump of the major facilitator superfamily that is constitutively overexpressed in many fluconazole-resistant strains. Although MDR1 overexpression is a major cause of resistance to this widely used antifungal agent and other metabolic inhibitors, so far the molecular basis of MDR1 upregulation in resistant strains has remained elusive. By comparing the transcription profiles of MDR1 overexpressing, clinical C. albicans isolates and matched, drug-susceptible isolates from the same patients, we identified a transcription factor, termed multidrug resistance regulator 1 (MRR1), which was upregulated in all resistant isolates and turned out to be a central regulator of MDR1 expression. Resistant isolates contained point mutations in MRR1, which rendered the transcription factor constitutively active. Introduction of these mutated alleles into a susceptible strain caused MDR1 overexpression und multidrug resistance. Inactivation of MRR1 in clinical isolates abolished MDR1 expression and affected fluconazole resistance even more strongly than deletion of the MDR1 efflux pump itself, indicating that additional Mrr1p target genes, which were identified by genome-wide gene expression analysis, contribute to fluconazole resistance. These findings provide detailed insights into the molecular basis of multidrug resistance in one of the most important human fungal pathogens.
PMCID: PMC2048531  PMID: 17983269
13.  Genetic Diversity of Mycobacterium tuberculosis in Peru and Exploration of Phylogenetic Associations with Drug Resistance 
PLoS ONE  2013;8(6):e65873.
There is limited available data on the strain diversity of M tuberculosis in Peru, though there may be interesting lessons to learn from a setting where multidrug resistant TB has emerged as a major problem despite an apparently well-functioning DOTS control programme.
Spoligotyping was undertaken on 794 strains of M tuberculosis collected between 1999 and 2005 from 553 community-based patients and 241 hospital-based HIV co-infected patients with pulmonary tuberculosis in Lima, Peru. Phylogenetic and epidemiologic analyses permitted identification of clusters and exploration of spoligotype associations with drug resistance.
Mean patient age was 31.9 years, 63% were male and 30.4% were known to be HIV+. Rifampicin mono-resistance, isoniazid mono-resistance and multidrug resistance (MDR) were identified in 4.7%, 8.7% and 17.3% of strains respectively. Of 794 strains from 794 patients there were 149 different spoligotypes. Of these there were 27 strains (3.4%) with novel, unique orphan spoligotypes. 498 strains (62.7%) were clustered in the nine most common spoligotypes: 16.4% SIT 50 (clade H3), 12.3% SIT 53 (clade T1), 8.3% SIT 33 (LAM3), 7.4% SIT 42 (LAM9), 5.5% SIT 1 (Beijing), 3.9% SIT 47 (H1), 3.0% SIT 222 (clade unknown), 3.0% SIT1355 (LAM), and 2.8% SIT 92 (X3). Amongst HIV-negative community-based TB patients no associations were seen between drug resistance and specific spoligotypes; in contrast HIV-associated MDRTB, but not isoniazid or rifampicin mono-resistance, was associated with SIT42 and SIT53 strains.
Two spoligotypes were associated with MDR particularly amongst patients with HIV. The MDR-HIV association was significantly reduced after controlling for SIT42 and SIT53 status; residual confounding may explain the remaining apparent association. These data are suggestive of a prolonged, clonal, hospital-based outbreak of MDR disease amongst HIV patients but do not support a hypothesis of strain-specific propensity for the acquisition of resistance-conferring mutations.
PMCID: PMC3691179  PMID: 23826083
14.  Emergence of protease inhibitor resistance mutations in human immunodeficiency virus type 1 isolates from patients and rapid screening procedure for their detection. 
Antimicrobial Agents and Chemotherapy  1996;40(11):2535-2541.
Patient human immunodeficiency virus type 1 (HIV-1) isolates that are resistant to protease inhibitors may contain amino acid substitutions L10I/V, M46L/I, G-48V, L63P, V82A/F/T, I84V, and L90M in the protease gene. Substitutions at positions 82 and/or 90 occur in variants that display high levels of resistance to certain protease inhibitors. Nucleotide substitutions at these two sites also lead to the loss of two HindII restriction enzyme digestion sites, and these changes make possible a rapid procedure for the detection of drug-resistant variants in patients on protease inhibitor therapy. This procedure was used to detect the emergence of mutated viruses at various times after the initiation of therapy with the HIV-1 protease inhibitor indinavir. The method includes viral RNA isolation from plasma and reverse transcription PCR amplification of the protease gene with fluorescence-tagged primers. The PCR product is digested with HindII, the cleavage products are separated on a urea-acrylamide gel in a DNA sequencer, and the extent of cleavage is automatically analyzed with commercially available software. In viruses from 34 blood samples from four patients, mutations leading to an amino acid change at residue 82 appeared as early as 6 weeks after the start of therapy and persisted throughout the course of the study period (48 weeks). Mutations leading to double substitutions at residues 82 and 90 were seen at a lower frequency and appeared later than the change at position 82. The changes detected by restriction enzyme cleavage were confirmed by DNA sequencing of the cloned protease genes by reverse transcription PCR amplification of viral RNA from isolates in plasma. In addition to the changes at positions 82 and 90, we have identified M46L/I, G48V, and I54V substitutions in isolates derived from indinavir-treated patients. HindII analysis of uncloned, PCR-amplified DNA offers a rapid screening procedure for the detection of virus isolates containing mutations at amino acid residues 82 and 90 in the HIV-1 protease gene. By using other restriction enzymes, the same method can be used to detect additional protease drug-resistant variants and is generally applicable for the detection of mutations.
PMCID: PMC163570  PMID: 8913459
15.  The Role of Viral Introductions in Sustaining Community-Based HIV Epidemics in Rural Uganda: Evidence from Spatial Clustering, Phylogenetics, and Egocentric Transmission Models 
PLoS Medicine  2014;11(3):e1001610.
Using different approaches to investigate HIV transmission patterns, Justin Lessler and colleagues find that extra-community HIV introductions are frequent and likely play a role in sustaining the epidemic in the Rakai community.
Please see later in the article for the Editors' Summary
It is often assumed that local sexual networks play a dominant role in HIV spread in sub-Saharan Africa. The aim of this study was to determine the extent to which continued HIV transmission in rural communities—home to two-thirds of the African population—is driven by intra-community sexual networks versus viral introductions from outside of communities.
Methods and Findings
We analyzed the spatial dynamics of HIV transmission in rural Rakai District, Uganda, using data from a cohort of 14,594 individuals within 46 communities. We applied spatial clustering statistics, viral phylogenetics, and probabilistic transmission models to quantify the relative contribution of viral introductions into communities versus community- and household-based transmission to HIV incidence. Individuals living in households with HIV-incident (n = 189) or HIV-prevalent (n = 1,597) persons were 3.2 (95% CI: 2.7–3.7) times more likely to be HIV infected themselves compared to the population in general, but spatial clustering outside of households was relatively weak and was confined to distances <500 m. Phylogenetic analyses of gag and env genes suggest that chains of transmission frequently cross community boundaries. A total of 95 phylogenetic clusters were identified, of which 44% (42/95) were two individuals sharing a household. Among the remaining clusters, 72% (38/53) crossed community boundaries. Using the locations of self-reported sexual partners, we estimate that 39% (95% CI: 34%–42%) of new viral transmissions occur within stable household partnerships, and that among those infected by extra-household sexual partners, 62% (95% CI: 55%–70%) are infected by sexual partners from outside their community. These results rely on the representativeness of the sample and the quality of self-reported partnership data and may not reflect HIV transmission patterns outside of Rakai.
Our findings suggest that HIV introductions into communities are common and account for a significant proportion of new HIV infections acquired outside of households in rural Uganda, though the extent to which this is true elsewhere in Africa remains unknown. Our results also suggest that HIV prevention efforts should be implemented at spatial scales broader than the community and should target key populations likely responsible for introductions into communities.
Please see later in the article for the Editors' Summary
Editors' Summary
About 35 million people (25 million of whom live in sub-Saharan Africa) are currently infected with HIV, the virus that causes AIDS, and about 2.3 million people become newly infected every year. HIV destroys immune system cells, leaving infected individuals susceptible to other infections. HIV infection can be controlled by taking antiretroviral drugs (antiretroviral therapy, or ART) daily throughout life. Although originally available only to people living in wealthy countries, recent political efforts mean that 9.7 million people in low- and middle-income countries now have access to ART. However, ART does not cure HIV infection, so prevention of viral transmission remains extremely important. Because HIV is usually transmitted through unprotected sex with an infected partner, individuals can reduce their risk of infection by abstaining from sex, by having one or a few partners, and by using condoms. Male circumcision also reduces HIV transmission. In addition to reducing illness and death among HIV-positive people, ART also reduces HIV transmission.
Why Was This Study Done?
Effective HIV control requires an understanding of how HIV spreads through sexual networks. These networks include sexual partnerships between individuals in households, between community members in different households, and between individuals from different communities. Local sexual networks (household and intra-community sexual partnerships) are sometimes assumed to be the dominant driving force in HIV spread in sub-Saharan Africa, but are viral introductions from sexual partnerships with individuals outside the community also important? This question needs answering because the effectiveness of interventions such as ART as prevention partly depends on how many new infections in an intervention area are attributable to infection from partners residing in that area and how many are attributable to infection from partners living elsewhere. Here, the researchers use three analytical methods—spatial clustering statistics, viral phylogenetics, and egocentric transmission modeling—to ask whether HIV transmission in rural Uganda is driven predominantly by intra-community sexual networks. Spatial clustering analysis uses the geographical coordinates of households to measure the tendency of HIV-infected people to cluster spatially at scales consistent with community transmission. Viral phylogenetic analysis examines the genetic relatedness of viruses; if transmission is through local networks, viruses in newly infected individuals should more closely resemble viruses in other community members than those in people outside the community. Egocentric transmission modelling uses information on the locations of recent sexual partners to estimate the proportions of new transmissions from household, intra-community, and extra-community partners.
What Did the Researchers Do and Find?
The researchers applied their three analytical methods to data collected from 14,594 individuals living in 46 communities (governmental administrative units) in Rakai District, Uganda. Spatial clustering analysis indicated that individuals who lived in households with individuals with incident HIV (newly diagnosed) or prevalent HIV (previously diagnosed) were 3.2 times more likely than the general population to be HIV-positive themselves. Spatial clustering outside households was relatively weak, however, and was confined to distances of less than half a kilometer. Viral phylogenetic analysis indicated that 44% of phylogenetic clusters (viruses with related genetic sequences found in more than one individual) were within households, but that 40% of clusters crossed community borders. Finally, analysis of the locations of self-reported sexual partners indicated that 39% of new viral transmissions occurred within stable household partnerships, but that among people newly infected by extra-household partners, nearly two-thirds were infected by partners from outside their community.
What Do These Findings Mean?
The results of all three analyses suggest that HIV introductions into communities are frequent and are likely to play an important role in sustaining HIV transmission in the Rakai District. Specifically, within this rural HIV-endemic region (a region where HIV infection is always present), viral introductions combined with intra-household transmission account for the majority of new infections, although community-based sexual networks also play a critical role in HIV transmission. These findings may not be generalizable to the broader Ugandan population or to other regions of Africa, and their accuracy is likely to be limited by the use of self-reported sexual partner data. Nevertheless, these findings indicate that the dynamics of HIV transmission in rural Uganda (and probably elsewhere) are complex. Consequently, to halt the spread of HIV, prevention efforts will need to be implemented at spatial scales broader than individual communities, and key populations that are likely to introduce HIV into communities will need to be targeted.
Additional Information
Please access these websites via the online version of this summary at
Information is available from the US National Institute of Allergy and Infectious Diseases on HIV infection and AIDS
NAM/aidsmap provides basic information about HIV/AIDS, and summaries of recent research findings on HIV care and treatment
Information is available from Avert, an international AIDS charity, on many aspects of HIV/AIDS, including information on HIV and AIDS in Uganda and on HIV prevention strategies (in English and Spanish)
The UNAIDS Report on the Global AIDS Epidemic 2013 provides up-to-date information about the AIDS epidemic and efforts to halt it
The Center for AIDS Prevention Studies (University of California, San Francisco) has a fact sheet about sexual networks and HIV prevention
Wikipedia provides information on spatial clustering analysis (note that Wikipedia is a free online encyclopedia that anyone can edit; available in several languages)
A PLOS Computational Biology Topic Page (a review article that is a published copy of record of a dynamic version of the article as found in Wikipedia) about viral phylodynamics is available
Personal stories about living with HIV/AIDS are available through Avert, NAM/aidsmap, and Healthtalkonline
PMCID: PMC3942316  PMID: 24595023
16.  Inferring within-patient HIV-1 evolutionary dynamics under anti-HIV therapy using serial virus samples with vSPA 
BMC Bioinformatics  2009;10:360.
Analysis of within-patient HIV evolution under anti-HIV therapy is crucial to a better understanding the possible mechanisms of HIV drug-resistance acquisition. The high evolutionary rate of HIV allows us to trace its evolutionary process in real time by analyzing virus samples serially collected from the same patient. However, such studies are still uncommon due to the lack of powerful computational methods designed for serial virus samples. In this study, we develop a computational method, vSPA (viral Sequential Pathway Analysis), which groups viral sequences from the same sampling time into clusters and traces the evolution between clusters over sampling times. The method makes use of information of different sampling times and traces the evolution of important amino acid mutations. Second, a permutation test at the codon level is conducted to determine the threshold of the correlation coefficient for clustering viral quasispecies. We applied vSPA to four large data sets of HIV-1 protease and reverse transcriptase genes serially collected from two AIDS patients undergoing anti-HIV therapy over several years.
The results show that vSPA can trace within-patient HIV evolution by detecting many amino acid changes, including important drug-resistant mutations, and by classifying different viral quasispecies coexisting during different periods of the therapy.
Given that many new anti-HIV drugs will be available in the near future, vSPA may be useful for quickly providing information on the acquisition of HIV drug-resistant mutations by monitoring the within-patient HIV evolution under anti-HIV therapy as a computational approach.
PMCID: PMC2776027  PMID: 19863822
17.  Amino Acid Preferences for a Critical Substrate Binding Subsite of Retroviral Proteases in Type 1 Cleavage Sites 
Journal of Virology  2005;79(7):4213-4218.
The specificities of the proteases of 11 retroviruses representing each of the seven genera of the family Retroviridae were studied using a series of oligopeptides with amino acid substitutions in the P2 position of a naturally occurring type 1 cleavage site (Val-Ser-Gln-Asn-Tyr↓Pro-Ile-Val-Gln; the arrow indicates the site of cleavage) in human immunodeficiency virus type 1 (HIV-1). This position was previously found to be one of the most critical in determining the substrate specificity differences of retroviral proteases. Specificities at this position were compared for HIV-1, HIV-2, equine infectious anemia virus, avian myeloblastosis virus, Mason-Pfizer monkey virus, mouse mammary tumor virus, Moloney murine leukemia virus, human T-cell leukemia virus type 1, bovine leukemia virus, human foamy virus, and walleye dermal sarcoma virus proteases. Three types of P2 preferences were observed: a subgroup of proteases preferred small hydrophobic side chains (Ala and Cys), and another subgroup preferred large hydrophobic residues (Ile and Leu), while the protease of HIV-1 preferred an Asn residue. The specificity distinctions among the proteases correlated well with the phylogenetic tree of retroviruses prepared solely based on the protease sequences. Molecular models for all of the proteases studied were built, and they were used to interpret the results. While size complementarities appear to be the main specificity-determining features of the S2 subsite of retroviral proteases, electrostatic contributions may play a role only in the case of HIV proteases. In most cases the P2 residues of naturally occurring type 1 cleavage site sequences of the studied proteases agreed well with the observed P2 preferences.
PMCID: PMC1061542  PMID: 15767422
18.  Human Immunodeficiency Virus Type 1 Reverse-Transcriptase and Protease Subtypes 
The Journal of infectious diseases  2001;184(8):998-1006.
Phylogenetic analysis of the reverse transcriptase (RT) and protease of 117 published complete human immunodeficiency virus (HIV) type 1 genome sequences demonstrated that these genes cluster into distinct subtypes. There was a slightly higher proportion of informative sites in the RT (40.4%) than in the protease (34.8%; P = .03). Although most variation between subtypes was due to synonymous nucleotide substitutions, several subtype-specific amino acid patterns were observed. In the protease, the subtype-specific variants included 7 positions associated with drug resistance. Variants at positions 10, 20, 36, and 82 were more common in non-B isolates, whereas variants at positions 63, 77, and 93 were more common in subtype B isolates. In the RT, the subtype-specific mutations did not include positions associated with anti-retroviral drug resistance. RT and protease sequences from 2246 HIV-infected persons in northern California were also examined: 99.4% of the sequences clustered with subtype B, whereas 0.6% clustered with subtype A, C, or D.
PMCID: PMC2597357  PMID: 11574914
19.  Multidrug-Resistant Tuberculosis in Panama Is Driven by Clonal Expansion of a Multidrug-Resistant Mycobacterium tuberculosis Strain Related to the KZN Extensively Drug-Resistant M. tuberculosis Strain from South Africa 
Journal of Clinical Microbiology  2013;51(10):3277-3285.
Multidrug-resistant tuberculosis (MDR-TB) is a significant health problem in Panama. The extent to which such cases are the result of primary or acquired resistance and the strain families involved are unknown. We performed whole-genome sequencing of a collection of 66 clinical MDR isolates, along with 31 drug-susceptible isolates, that were isolated in Panama between 2001 and 2010; 78% of the MDR isolates belong to the Latin American-Mediterranean (LAM) family. Drug resistance mutations correlated well with drug susceptibility profiles. To determine the relationships among these strains and to better understand the acquisition of resistance mutations, a phylogenetic tree was constructed based on a genome-wide single-nucleotide polymorphism analysis. The phylogenetic tree shows that the isolates are highly clustered, with a single strain (LAM9-c1) accounting for nearly one-half of the MDR isolates (29/66 isolates). The LAM9-c1 strain was most prevalent among male patients of working age and was associated with high mortality rates. Members of this cluster all share identical mutations conferring resistance to isoniazid (KatG S315T mutation), rifampin (RpoB S531L mutation), and streptomycin (rrs C517T mutation). This evidence of primary resistance supports a model in which MDR-TB in Panama is driven by clonal expansion and ongoing transmission of several strains in the LAM family, including the highly successful MDR strain LAM9-c1. The phylogenetic analysis also shows that the LAM9-c1 strain is closely related to the KwaZulu-Natal (KZN) extensively drug-resistant TB strain identified in KwaZulu-Natal, South Africa. The LAM9-c1 and KZN strains likely arose from a recent common ancestor that was transmitted between Panama and South Africa and had the capacity to tolerate an accumulation of multiple resistance mutations.
PMCID: PMC3811646  PMID: 23884993
20.  A new ensemble coevolution system for detecting HIV-1 protein coevolution 
Biology Direct  2015;10:1.
A key challenge in the field of HIV-1 protein evolution is the identification of coevolving amino acids at the molecular level. In the past decades, many sequence-based methods have been designed to detect position-specific coevolution within and between different proteins. However, an ensemble coevolution system that integrates different methods to improve the detection of HIV-1 protein coevolution has not been developed.
We integrated 27 sequence-based prediction methods published between 2004 and 2013 into an ensemble coevolution system. This system allowed combinations of different sequence-based methods for coevolution predictions. Using HIV-1 protein structures and experimental data, we evaluated the performance of individual and combined sequence-based methods in the prediction of HIV-1 intra- and inter-protein coevolution. We showed that sequence-based methods clustered according to their methodology, and a combination of four methods outperformed any of the 27 individual methods. This four-method combination estimated that HIV-1 intra-protein coevolving positions were mainly located in functional domains and physically contacted with each other in the protein tertiary structures. In the analysis of HIV-1 inter-protein coevolving positions between Gag and protease, protease drug resistance positions near the active site mostly coevolved with Gag cleavage positions (V128, S373-T375, A431, F448-P453) and Gag C-terminal positions (S489-Q500) under selective pressure of protease inhibitors.
This study presents a new ensemble coevolution system which detects position-specific coevolution using combinations of 27 different sequence-based methods. Our findings highlight key coevolving residues within HIV-1 structural proteins and between Gag and protease, shedding light on HIV-1 intra- and inter-protein coevolution.
This article was reviewed by Dr. Zoltán Gáspári.
Electronic supplementary material
The online version of this article (doi:10.1186/s13062-014-0031-8) contains supplementary material, which is available to authorized users.
PMCID: PMC4332441  PMID: 25564011
HIV-1; Protein coevolution; Gag; Protease; Ensemble coevolution system; Sequence-based method
21.  Crystal Structures of a Multidrug-Resistant Human Immunodeficiency Virus Type 1 Protease Reveal an Expanded Active-Site Cavity 
Journal of Virology  2004;78(6):3123-3132.
The goal of this study was to use X-ray crystallography to investigate the structural basis of resistance to human immunodeficiency virus type 1 (HIV-1) protease inhibitors. We overexpressed, purified, and crystallized a multidrug-resistant (MDR) HIV-1 protease enzyme derived from a patient failing on several protease inhibitor-containing regimens. This HIV-1 variant contained codon mutations at positions 10, 36, 46, 54, 63, 71, 82, 84, and 90 that confer drug resistance to protease inhibitors. The 1.8-angstrom (Å) crystal structure of this MDR patient isolate reveals an expanded active-site cavity. The active-site expansion includes position 82 and 84 mutations due to the alterations in the amino acid side chains from longer to shorter (e.g., V82A and I84V). The MDR isolate 769 protease “flaps” stay open wider, and the difference in the flap tip distances in the MDR 769 variant is 12 Å. The MDR 769 protease crystal complexes with lopinavir and DMP450 reveal completely different binding modes. The network of interactions between the ligands and the MDR 769 protease is completely different from that seen with the wild-type protease-ligand complexes. The water molecule-forming hydrogen bonds bridging between the two flaps and either the substrate or the peptide-based inhibitor are lacking in the MDR 769 clinical isolate. The S1, S1′, S3, and S3′ pockets show expansion and conformational change. Surface plasmon resonance measurements with the MDR 769 protease indicate higher koff rates, resulting in a change of binding affinity. Surface plasmon resonance measurements provide kon and koff data (Kd = koff/kon) to measure binding of the multidrug-resistant protease to various ligands. This MDR 769 protease represents a new antiviral target, presenting the possibility of designing novel inhibitors with activity against the open and expanded protease forms.
PMCID: PMC354404  PMID: 14990731
22.  Amino Acid Preferences of Retroviral Proteases for Amino-Terminal Positions in a Type 1 Cleavage Site▿ ‡  
Journal of Virology  2008;82(20):10111-10117.
The specificities of the proteases of 11 retroviruses were studied using a series of oligopeptides with amino acid substitutions in the P1, P3, and P4 positions of a naturally occurring type 1 cleavage site (Val-Ser-Gln-Asn-Tyr↓Pro-Ile-Val-Gln) in human immunodeficiency virus type 1 (HIV-1). Previously, the substrate specificity of the P2 site was studied for the same representative set of retroviral proteases, which included at least one member from each of the seven genera of the family Retroviridae (P. Bagossi, T. Sperka, A. Fehér, J. Kádas, G. Zahuczky, G. Miklóssy, P. Boross, and J. Tözsér, J. Virol. 79:4213-4218, 2005). Our enzyme set comprised the proteases of HIV-1, HIV-2, equine infectious anemia virus, avian myeloblastosis virus (AMV), Mason-Pfizer monkey virus, mouse mammary tumor virus (MMTV), Moloney murine leukemia virus, human T-lymphotropic virus type 1, bovine leukemia virus, walleye dermal sarcoma virus, and human foamy virus. Molecular models were used to interpret the similarities and differences in specificity between these retroviral proteases. The results showed that the retroviral proteases had similar preferences (Phe and Tyr) for the P1 position in this sequence context, but differences were found for the P3 and P4 positions. Importantly, the sizes of the P3 and P4 residues appear to be a major contributor for specificity. The substrate specificities correlated well with the phylogenetic tree of the retroviruses. Furthermore, while the specificities of some enzymes belonging to different genera appeared to be very similar (e.g., those of AMV and MMTV), the specificities of the primate lentiviral proteases substantially differed from that observed for a nonprimate lentiviral protease.
PMCID: PMC2566267  PMID: 18701588
23.  Transmission Cluster of Multiclass Highly Drug-Resistant HIV-1 Among 9 Men Who Have Sex With Men in Seattle/King County, WA, 2005−2007 
From 2005 through 2007, Seattle health care providers identified cases of primary multiclass drug-resistant (MDR) HIV-1 with common patterns of resistance to antiretrovirals (ARVs). Through surveillance activities and genetic analysis, the local Health Department and the University of Washington identified phylogenetically linked cases among ARV treatment–naive and -experienced individuals.
HIV-1 pol nucleotide consensus sequences submitted to the University of Washington Clinical Virology Laboratory were assessed for phylogenetically related MDR HIV. Demographic and clinical data collected included HIV diagnosis date, ARV history, and laboratory results.
Seven ARV-naive men had phylogenetically linked MDR strains with resistance to most ARVs; these were linked to 2 ARV-experienced men. All 9 men reported methamphetamine use and multiple anonymous male partners. Primary transmissions were diagnosed for more than a 2-year period, 2005−2007. Three, including the 2 ARV-experienced men, were prescribed ARVs.
This cluster of 9 men with phylogenetically related highly drug-resistant MDR HIV strains and common risk factors but without reported direct epidemiologic links may have important implications to public health. This cluster demonstrates the importance of primary resistance testing and of collaboration between the public and private medical community in identifying MDR outbreaks. Public health interventions and surveillance are needed to reduce transmission of MDR HIV-1.
PMCID: PMC2586929  PMID: 18769347
HIV; HIV-1; multiple drug resistance; disease clustering; highly active antiretroviral therapy
24.  Susceptibility to PNU-140690 (Tipranavir) of Human Immunodeficiency Virus Type 1 Isolates Derived from Patients with Multidrug Resistance to Other Protease Inhibitors 
In our study we examined the anti-human immunodeficiency virus type 1 (anti-HIV-1) activity of a novel HIV-1 protease inhibitor, PNU-140690 (tipranavir), against patient-derived isolates resistant to multiple other protease inhibitors (PIs). The aim of our experiments was to investigate the genotypes and the in vitro phenotypes of drug resistance of PNU-140690. We carried out drug susceptibility tests with peripheral blood mononuclear cells and a fixed amount of infectious virus (1,000 50% tissue culture infective doses) to determine the 50% inhibitory concentration (IC50) and IC90, PCR assays for the detection of drug resistance mutations in RNA in plasma, and direct sequencing of PCR products. Phenotypic resistance to PIs was invariably related to genotypic mutations. The substitutions among the amino acid residues of the protease included L10I, K20R, L24I, M36I, N37D, G48V, I54V, L63P, I64V, A71V, V77I, V82A, I84V, and L90M. Isolates from all of the patients had developed a maximal degree of resistance to indinavir, ritonavir, and nelfinavir (IC50s, >0.1 μM). We also compared these mutations with the amino acid changes previously described in association with in vivo tipranavir administration. The mutations included the following: I15V, E35D, N37D, R41K, D60E, and A71T. Infections with IIIB, 14aPre, and N70 were inhibited by an average drug IC90 of 0.18 ± 0.02 μM in multiple experiments. The average mean ± standard error of mean IC90 for the entire group of multidrug-resistant isolates derived from the mean values for two culture wells with p24 antigen supernatant appeared to be 0.619 ± 0.055 μM (range, 0.31 to 0.86 μM). Tipranavir retained a sustained antiviral activity against PI-MDR clinical isolates and might be useful in combination regimens with other antiretroviral agents for patients who have already failed other PI-containing therapies.
PMCID: PMC89863  PMID: 10770770
25.  Correlated Electrostatic Mutations Provide a Reservoir of Stability in HIV Protease 
PLoS Computational Biology  2012;8(9):e1002675.
HIV protease, an aspartyl protease crucial to the life cycle of HIV, is the target of many drug development programs. Though many protease inhibitors are on the market, protease eventually evades these drugs by mutating at a rapid pace and building drug resistance. The drug resistance mutations, called primary mutations, are often destabilizing to the enzyme and this loss of stability has to be compensated for. Using a coarse-grained biophysical energy model together with statistical inference methods, we observe that accessory mutations of charged residues increase protein stability, playing a key role in compensating for destabilizing primary drug resistance mutations. Increased stability is intimately related to correlations between electrostatic mutations – uncorrelated mutations would strongly destabilize the enzyme. Additionally, statistical modeling indicates that the network of correlated electrostatic mutations has a simple topology and has evolved to minimize frustrated interactions. The model's statistical coupling parameters reflect this lack of frustration and strongly distinguish like-charge electrostatic interactions from unlike-charge interactions for of the most significantly correlated double mutants. Finally, we demonstrate that our model has considerable predictive power and can be used to predict complex mutation patterns, that have not yet been observed due to finite sample size effects, and which are likely to exist within the larger patient population whose virus has not yet been sequenced.
Author Summary
HIV is incurable because its enzymes evolve rapidly by developing resistance mutations to retroviral inhibitors. Most of these mutations work synergistically, but the biophysical basis behind their cooperation is not well understood. Our work addresses these important issues by bridging the gap between the statistical modeling of HIV protease subtype B sequences with the energetics of mutations involving charged amino acids by showing that electrostatic stability is intimately related to correlations. Moreover, we demonstrate that our statistical model has considerable predictive power and can be used to predict complex mutation patterns that have not yet been observed due to the finite sizes of the current sequence databases. In other words, as the database size increases, our model has the ability to predict the identities of the high probability mutations patterns, which are more likely to be observed. Knowing which currently unobserved mutations are more likely to be observed can be very advantageous in combating the disease.
PMCID: PMC3435258  PMID: 22969420

Results 1-25 (840271)