Recent years have seen revolutions in both genomics and computational science (Lander et al.,
2001; Venter et al.,
2001; Chen et al.,
2012). In both of these fields, capabilities are advancing exponentially (Kahn,
2011). The impact of this non-linear development on biology is pervasive and multifaceted. With respect to virus research, the influence has been profound and is the focus of this special issue of
Frontiers. Medical interest in viruses is focused on pathogens and their infection, and the biological mirror of infection is the host immune response. Advances in genomics and computational science have the potential for an equally profound impact on our understanding of the immune response. Here we focus on the application of new genomic and computational techniques, particularly 454 pyrosequencing of B cell transcripts (Reddy et al.,
2009; Reddy and Georgiou,
2011; Wu et al.,
2011) and systems-level bioinformatics (Kitano,
2002), to understand the antibody response to infection.
The human immunodeficiency virus type I, HIV-1, is the etiological agent of a global pandemic, which has killed over 30 million people, and currently infects ~1% of adults worldwide (UNAIDS,
2010). HIV-1 is a retrovirus and member of the lentivirus genus (Gonda et al.,
1985; Sonigo et al.,
1985). Global genetic diversity of HIV-1 is extraordinarily high (Starcich et al.,
1986; Korber et al.,
2001), and this is thought to result from the low fidelity of its genome replication (Preston et al.,
1988) as well as the persistent nature of the infection: the diversity of HIV-1 virus within a single individual after 6

years of infection is equivalent to the global diversity of H1N1 influenza observed annually (Korber et al.,
2001). Infection by HIV-1 elicits many antibodies, but in general these are not capable of neutralization of diverse strains of HIV-1. However, after several years of infection, 10–25% of infected individuals develop broadly neutralizing antibodies (Li et al.,
2007; Gray et al.,
2009; Sather et al.,
2009; Simek et al.,
2009; Stamatatos et al.,
2009; Doria-Rose et al.,
2010; Gnanakaran et al.,
2010). These antibodies provide little or no benefit to the infected host, as the evolution of the virus outpaces the immune response (Parren et al.,
1999; Poignard et al.,
1999; Wei et al.,
2003). Nevertheless these antibodies, when tested in humanized mice or macaque models by passive antibody transfer, impart effective immunity to challenge with HIV-1 or simian/human chimeric immunodeficiency viruses (Mascola et al.,
1999,
2000; Parren et al.,
2001; Mascola,
2003; Veazey et al.,
2003; Hessell et al.,
2009a,
b; Balazs et al.,
2011), indicating the potential for their use as targets for re-elicitation by rationally designed vaccines (reviewed in Walker and Burton,
2010; Kwong et al.,
2011). Thus, substantial interest has focused on understanding human antibodies that effectively neutralize diverse strains of HIV-1.
A number of techniques have recently been applied to identification of such antibodies. These methods – including antigen-specific B cell sorting (Scheid et al.,
2009; Wu et al.,
2010) and direct assessment of neutralization by antibodies secreted from individually sorted B cells (Walker et al.,
2009,
2011), each coupled to single B cell sequencing techniques – have so far yielded dozens of broadly HIV-1-neutralizing antibodies. These antibodies represent an extraordinarily sparse sampling of the humoral immune response, which typically generates roughly a billion new B cells in a healthy individual each day. We therefore asked whether the revolutionary new capabilities of next-generation sequencing (Mardis,
2008a,
b; Boyd et al.,
2010; Hawkins et al.,
2010) and computational science could expand this sampling to generate a more complete understanding of the humoral immune response. In principle, memory B cells contain a persistent record of the antibody response to infection. As memory B cells are readily attained from blood, they provide a convenient means to access the antibody record, with B cell transcripts in peripheral blood mononuclear cells (PBMCs) providing a genetic representation. Using three antibodies, PGT135–137 from Protocol G donor 39 (Walker et al.,
2011) as an example, we used 454 pyrosequencing of PCR-amplified heavy- and light-chain transcripts to capture a more comprehensive genetic record. We used bioinformatics approaches to interrogate this record, to identify populations of neutralizing antibodies, and to characterize their ontogenies. We link these ontogenies to the natural mechanisms of B cell development to provide a view of how somatic populations of antibodies engender a diverse immunological response to infection.