Although Staurastrum and Zygnema cpDNAs bear high similarity in primary sequence and gene content to their Chaetosphaeridium and land plant counterparts, they differ substantially from one another and from the latter genomes in overall structure, gene order and intron content. From our comparative analysis of streptophyte cpDNAs, we infer that the chloroplast genome of the last common ancestor of Staurastrum and Zygnema probably lacked a large IR encoding the rRNA genes, had a low gene density, and more closely resembled Chaetosphaeridium and land plant cpDNAs at the gene organizational and intron levels than do Zygnema and Staurastrum cpDNAs. At least 16 of the 22 intron positions commonly found in land plant cpDNAs, including three sites that have not been identified in Chaetosphaeridium, were probably present in the common ancestor of Staurastrum and Zygnema.
Considering the absence of an rDNA-encoding IR region in both
Staurastrum and
Zygnema cpDNAs, it is not surprising that these genomes are considerably rearranged relative to their coleochaetalean and land plants counterparts that have retained the quadripartite structure. All green plant cpDNAs that have lost the IR tend to be highly scrambled in gene order [
26,
27]. It has been hypothesized that the loss of the IR enhances opportunities for intramolecular recombination between small dispersed repeats [
28]. In agreement with the idea that there is a direct link between the frequency of intramolecular recombination events and the abundance of small dispersed repeats [
28], we identified more rearrangements in the repeat-rich genome of
Zygnema than in the repeat-poor genome of
Staurastrum. As in the cpDNAs of the nonphotosynthetic, parasitic flowering plant
Epifagus virginiana [
29] and the evening primrose
Oenothera [
30], the repeated sequences in
Zygnema cpDNA consist essentially of tandem repeats that probably arose by replication slippage.
A single event of IR loss likely accounts for the absence of a quadripartite structure from both
Staurastrum and
Zygnema cpDNAs. This hypothesis is more parsimonious than the alternative scenario involving two independent losses, and is consistent with previous evidence that the cpDNA of
Spirogyra (a distant relative of
Zygnema) has no IR [
19]. It is also supported by our finding that
Staurastrum and
Zygnema cpDNAs share 11 rearrangement breakpoints within ancestral gene clusters. Given the close connection between IR loss and gene rearrangements, several of these shared breakpoints might have appeared following the loss of the IR in the lineage leading to the last common ancestor of
Staurastrum and
Zygnema. Considering that this ancestor occupies a basal position in the tree describing the relationships among zygnematalean green algae [
21,
22], then most, if not all, of the algae belonging to the Zygnematales are expected to lack an IR in their chloroplast genome.
As introns appear to be generally stable in land plant cpDNAs [
28], the important difference in intron content displayed by
Staurastrum and
Zygnema cpDNAs is unexpected. The two zygnematalean cpDNAs share only five of the 16 intron insertion sites they exhibit in total.
Staurastrum cpDNA lacks seven of the 13 introns that are present in
Zygnema cpDNA, whereas the latter cpDNA lacks five of the eight introns found in the former genome. The intron distributions in these cpDNAs are best explained by assuming that all 16 insertion sites were populated with introns in the common ancestor of
Staurastrum and
Zygnema and that subsequently, several introns were specifically lost in each of the lineages leading to these green algae. Obviously, we cannot exclude the possibility that chloroplast introns occupying common insertion sites were lost independently in the
Staurastrum and
Zygnema lineages; thus, the predicted number of introns in the common ancestor of these algae may represent a minimal estimate. Given that intron losses are thought to result from insertions, through homologous recombination, of intron-less cDNA copies generated by reverse transcription [
31], the frequency of homologous recombination events or the level of reverse transcriptase activity might be higher in the chloroplasts of conjugating green algae than in land plant chloroplasts. In this respect, it is interesting to note that the
Staurastrum trans-spliced
rps12 intron specifies a reverse transcriptase and is the only known streptophyte chloroplast intron encoding such an activity.
Our finding that
matK is free-standing in
Zygnema cpDNA together with the absence of the
trnK(uuu) intron in which it usually resides strongly suggests that its putative maturase product is essential for the splicing of group II introns other than the
trnK(uuu) intron. Circumstantial evidence that MatK functions in splicing of multiple introns has previously been reported for land plant chloroplasts. The
matK gene is located within the group II intron of
trnK(uuu) in all photosynthetic land plants, but occurs as a free-standing gene in
Epifagus cpDNA [
29].
In vivo splicing analyses of the complete set of chloroplast group II introns in land plant mutants lacking chloroplast ribosomes disclosed specific splicing defects involving mainly group IIA introns (in
atpF,
rpl2,
rps12,
trnA,
trnI,
trnK), thus implying that cpDNA-encoded protein(s) act as splicing factors [
32-
35]. It has been proposed that MatK evolved from a
trnK(uuu) intron-specific maturase to a more versatile maturase that assists the splicing of most or all group IIA introns of land plants [
32-
35].