The category of virus is biological, not evolutionary. Viruses are intracellular parasites that depend on the host cell for their protein synthesis, most of the reactions of nucleic acid precursor biosynthesis and, to a variable extent, transcription and replication (
15). Clearly, viruses are not a monophyletic group. There is little doubt, for example, that small viruses with single-stranded RNA genomes of only 5 to 10 kb, such as poliovirus or tobacco mosaic virus, on the one hand, and large viruses with double-stranded DNA (dsDNA) genomes of 100 to 500 kb, such as herpesviruses, poxviruses, or iridoviruses, on the other hand, have evolved independently. However, comparative analyses of the genomes of many groups of viruses have suggested common origins for large, heterogeneous assemblages. For example, it appears most likely that all reverse-transcribing viruses and mobile elements, in spite of the extreme diversity of their life cycles and the sets of encoded proteins, have evolved from a common ancestor (
17,
56,
70). Even more unexpected evolutionary connections are suggested by the involvement of homologous enzymes, such as superfamily III helicases, in genome replication of both RNA and DNA viruses with small genomes (
23), and the central role of the conserved rolling circle replication initiator protein in single-stranded DNA (ssDNA) viruses of eukaryotes and bacteria and in bacterial plasmids (
26).
Viruses with large, dsDNA genomes are generally thought to have evolved by capturing multiple genes from the genomes of cellular organisms, their hosts. Indeed, many genes of these viruses, particularly those involved in virus-host interactions, show high levels of protein sequence similarity to their cellular homologs, which is apparently indicative of relatively recent acquisition by the viral genomes (
12,
51,
59). However, viruses belonging to a particular large family, such as the herpesvirus family or the poxvirus family, share between themselves a core set of genes encoding proteins involved in DNA replication, transcription, and virion biogenesis, most of which are only moderately similar to cellular homologs, if such are detectable at all (
3,
51). The existence of core sets of up to 40 to 50 conserved viral genes (
8,
22) establishes beyond reasonable doubt that the extant members of the families
Herpesviridae and
Poxviridae have diverged from the respective ancestral viruses that already possessed the principal features of genome replication and expression and of virion structure that are typical of these viral families. In contrast, it remains unclear whether there are any evolutionary connections between different viral families. Poxviruses, African swine fever virus (ASFV, the archetypal member of the family
Asfarviridae), and iridoviruses are the three families of eukaryotic viruses with large dsDNA genomes that undergo their replication cycle either entirely in the cytoplasm (poxviruses) or start their replication in the nucleus and complete it in the cytoplasm (
20,
22,
38,
40,
63,
67), as opposed to herpesviruses and baculoviruses, whose DNA replication and transcription occur exclusively in the nucleus (
30,
65). Poxviruses, asfarviruses, and iridoviruses encode their own transcription machinery, which includes, in each case, several RNA polymerase subunits and additional transcription factors, and share several other conserved genes (
58,
72). Large DNA viruses isolated from very diverse algae, the
Paramecium bursaria chlorella virus (PBCV) and the related
Ectocarpus siliculosus virus (ESV), members of the
Phycodnaviridae family, also share several genes with nucleocytoplasmic large DNA viruses, although genomes of these viruses are transcribed in the nucleus and, accordingly, they lack genes for RNA polymerase subunits (
41,
61). The four families of large eukaryotic DNA viruses,
Poxviridae,
Asfarviridae,
Iridoviridae, and
Phycodnaviridae, to which we collectively refer here as nucleocytoplasmic large DNA viruses (NCLDV), have both common and unique features of genomic DNA and virion structure. Poxviruses, ASFV, and PBCV have linear DNA genomes with terminal inverted repeats that form covalently closed hairpins (
40,
67,
75), iridoviruses have circularly permuted linear genomes (
60), and ESV appears to have a circular genome (
41). The virions of ASFV, iridoviruses, and PBCV consist of a DNA-protein core that is surrounded by a lipid bilayer, which in turn is encased in one or more icosahedral capsid shells (
58,
63,
66). Poxviruses have a more complex, unique virion structure, with a core surrounded by a “brick-shaped” proteolipid shell (
40).
It remains uncertain whether the similarities between the gene repertoires, genome structures, and virion architectures of different families of NCLDV are due to independent recruitment of the same or related host genes driven by the common functional requirements for the viral replication cycles or by origin from a common viral ancestor. This crucial dilemma is not readily amenable to conventional phylogenetic analysis because even homologous proteins of viruses from different families show moderate or weak sequence conservation and may be less similar to each other than to the corresponding cellular homologs (
51). At face value, these observations appear to favor the polyphyletic origin of different viral families. However, this aspect of the relationships between viruses needs to be interpreted with caution given the realistic possibility of rapid evolution of viral genes (
44). Moreover, such rapid divergence potentially might even preclude the very detection of evolutionary relationships between some viral genes. Given these considerations, we were interested in delineating the complete set of conserved genes among NCLDV by applying the most advanced available methods for sequence similarity detection and assessing the hypothesis of independent recruitment of similar sets of genes from the host as opposed to an origin of several viral families from a single, ancestor virus. We expand the list of conserved genes shared by all or a majority of NCLDV families and show that origin from a common viral ancestor is the most parsimonious scenario for the evolution of all of these viruses.