Mobile elements make up ~45% of the human genome
[1]. Among them are L1 and
Alu elements, that have been active since well before the divergence of the human and chimpanzee lineages, and remain active in their host genomes. These two elements mobilize via a “copy and paste” mechanism and integrate into new genomic regions by means of an RNA intermediate
[2]. A full-length functional L1 element is about 6 kb in length and able to code for enzymes which are required for L1 retrotransposition, making the L1 an autonomous element
[3]. By contrast, the
Alu element is 300 bp long and does not encode the means of its own retrotransposition, instead borrowing the enzymatic machinery of the L1 elements for its propagation
[4],
[5], making it a non-autonomous mobile element. Although L1 elements contribute the most to the genome in terms of total size,
Alu elements are the most successful mobile element family in terms of copy number, reaching a copy number of ~1.2 million in the human genome
[6].
L1 and
Alu elements have played an important role in shaping their host genomes. They can alter gene expression patterns and cause chromosomal rearrangements through various mechanisms including novel insertion, insertion-mediated deletion, and unequal homologous recombination between elements
[7]–
[9]. Sequence identity between two retrotransposons of the same type (e.g.,
Alu-
Alu and L1-L1) can lead to non-allelic homologous recombination between them, that subsequently results in chromosomal rearrangements such as duplications, deletions, translocations, and inversions
[9]–
[12]. Such recombination can cause species-specific local genomic instability and has been reported as a major source of genomic disorders
[13].
Inverted
Alu and L1 pairs (i.e., two
Alu elements or two L1 elements inserted in opposite orientations along a chromosome) have caused chromosomal rearrangements in their host genomes through several mechanisms including large inverted duplications, translocations, inversions, and deletions
[14]–
[16]. Due to their sequence similarity, they have the ability to form a hairpin structure in single-stranded DNA or a cruciform structure in double-stranded DNA
[15],
[17],
[18]. These structures can potentially block progression of the replication fork and cause intra- or inter-molecular template switching of DNA polymerase between the inverted elements
[15],
[19]. In reality, inverted
Alu pairs cause a 1000-fold increase in homologous recombination
[15]. Here, we report for the first time a genome-wide analysis of
retrotransposon
recombination-
mediated
inversion (RRMI), causing genomic and subsequently phenotypic differences between humans and chimpanzees. The previously reported mechanism,
Alu recombination-
mediated
deletion (ARMD), alters or interrupts gene function through the deletion of intronic and exonic regions. By contrast, RRMI usually does not cause any change in genome size. Instead, it could alter the structure of genes or transcription of genes by inverting intron or exon sequences and introducing alternative gene splicing sites. Through the comparison of human and chimpanzee draft genome sequences
[6],
[20], we identified 49 RRMI loci, 28 of which were human-specific inversions and 21 were chimpanzee-specific inversions. Among them, 53% of the RRMI occurred within genic regions. Interestingly, we found that three RRMI events caused alteration of exonic regions in known genes with ten RRMIs that are polymorphic within a species. These findings suggest that recombination between inverted L1 and
Alu pairs might have generated genomic variation within a species as well as between species.