The ability to simultaneously screen for a large panel of pathogens in clinical samples, especially viruses, will represent a major development in the diagnosis of infectious diseases and in surveillance programs for emerging pathogens. Currently, most diagnostic methods are based on species-specific viral nucleic acid amplification. Although rapid and extremely sensitive, these methods are suboptimal when testing for a large number of known pathogens, when viral sequence divergence is high, when new but related viruses are anticipated, or when no clear viral etiologic agent is suspected. To overcome these technical difficulties, newer technologies have been employed, especially microarrays dedicated to pathogen detection. Indeed, DNA microarrays have been shown to be a powerful platform for the highly multiplexed differential diagnosis of infectious diseases. For example, pathogen microarrays can be simultaneously used to screen various viral or bacterial families and have been successfully used in the detection of microbial agents from different clinical samples (
10-
12,
19,
32,
35,
41,
42,
48).
The “classical” DNA microarrays developed so far are based on the use of long-oligonucleotide pathogen-specific probes (≥50 nucleotides [nt]). Although powerful in terms of sensitivity, these diagnostic tools have the disadvantage of decreased specificity, making it necessary to target multiple markers, and rely on hybridization patterns for pathogen identification, leading to unquantifiable errors (
4). Moreover, these methods lack comprehensive information about the pathogen at the single-nucleotide level, which could represent a major problem when the sequences in question show a high degree of similarity (
21). The microarray-based pathogen resequencing assay represents a promising alternative tool with which to overcome these limitations. This method identifies each specific pathogen and is capable of resequencing, or “fingerprinting,” multiple pathogens in a single test. Indeed, this technology uses tiled sets of 10
5 to 10
6 probes of 25mers, which contain one perfectly matched and three mismatched probes per base for both strands of the target genes (
16). This technology also offers the potential for a single test that detects and discriminates between a target pathogen and its closest phylogenetic neighbors, which expands the repertoire of identifiable organisms far beyond those that are initially included in the array. Successful results have been obtained using this technology, especially for the detection of broad-spectrum respiratory tract pathogens using respiratory pathogen microarrays (
2,
25,
26) or the detection of a broad range of biothreat agents (
1,
23,
36,
45). The amplification step, which is more often limiting for this technology, has also benefited from recent developments. Phi29 polymerase-based amplification methods provide amplified DNA with minimal changes in sequence and relative abundance for many biomedical applications (
3,
31,
40). The amplification factor varied from 10
6 to 10
9, and it was also demonstrated that coamplification occurred when viral RNA was mixed with bacterial DNA (
3). This whole-transcriptome amplification (WTA) approach can also be successfully applied to viral genomic RNA of all sizes. Amplifying viral RNA by WTA provides considerably better sensitivity and accuracy of detection than random reverse transcription (RT)-PCR in the context of resequencing microarrays (RMAs) (
3).
The rhabdoviruses are single-stranded, negative-sense RNA genome viruses classified into six genera, three of which—
Vesiculovirus,
Lyssavirus, and
Ephemerovirus—include arthropod-borne agents that infect birds, reptiles, and mammals, as well as a variety of non-vector-borne mammalian or fish viruses (International Committee on Taxonomy of Viruses database [ICTVdb]) (reviewed in reference
7). These rhabdoviruses are the etiological agents of human diseases, such as rabies, that cause serious public health problems. Some rhabdoviruses also cause important economic losses in livestock. The three others genera include
Nucleorhabdovirus and
Cytorhabdovirus, which are arthropod-borne viruses infecting plants, and
Novirhabdovirus, which comprises fish viruses. Other than the well-characterized rhabdoviruses that are known to be important for agriculture and public health, there is also a constantly growing list of rhabdoviruses, isolated from a variety of vertebrate and invertebrate hosts, that are partially characterized and are still waiting for definitive genus or species assignment. Considering the large spectrum of potential animal reservoirs of these viruses compared to the few identified virus species, it is highly likely that the number of uncharacterized rhabdoviruses is immense.
Unclassified or unassigned viruses have been tentatively identified as members of the family
Rhabdoviridae by electron microscopy, based on their bullet-shaped morphology—a characteristic trait of members of this family—or using their antigenic relationships based on serological tests (
9,
38). Gene sequencing and phylogenetic relationships have then been progressively applied to complete this initial virus taxonomy (
6,
22,
27). Importantly, a strongly conserved domain in the rhabdovirus genome, within the polymerase gene, is a useful target for the exploration of the distant evolutionary relationships among these diverse viruses (
6). This region corresponds to block III of the viral polymerase, a region predicted to be essential for RNA polymerase function, as it is highly conserved among most of the RNA-dependent RNA polymerases (
14,
33,
46). A direct application using this sequence region was recently described for lyssavirus RNA detection in human rabies diagnosis (
13). Taking advantage of these characteristics, this polymerase region was also used to design probes for high-density RMAs, also called PathogenID arrays (Affymetrix), which are optimized for the detection and sequence determination of several RNA viruses, particularly rhabdoviruses (
1).
In the present study, PathogenID microarrays containing probes for the detection of up to 126 viruses were tested using a consensus sequence determination strategy for the analysis of output RMA data. We demonstrate that this approach has the potential to identify, in experimentally infected and clinical specimens, known but also phylogenetically related rhabdoviruses for which precise sequence information was not available.