Centromeres, the DNA regions at which microtubules attach to and segregate daughter chromosomes, are essential for genome integrity. Point centromeres have been characterized extensively because of their small size and relative simplicity, especially in
Saccharomyces cerevisiae. They are composed of one centromere-specific nucleosome that spans <200 bp of DNA, organized in a tripartite structure that includes a specific DNA binding site necessary for centromere function (reviewed in
[1],
[2]). In contrast, regional centromeres are found in most eukaryotes, including fungi other than a subgroup of the Saccharomycotina
[3]. They span very large DNA domains (10′s to 1000′s of kb) and are organized into core DNA regions associated with centromere-specific nucleosomes and flanked by highly repetitive DNA packaged into pericentric heterochromatin (reviewed in
[4]). Regional centromere function is epigenetic in character
[5]–
[8]: rather than being dependent upon a specific DNA sequence, the presence of CENP-A, the centromere-specific histone H3 variant, defines the position of a functional centromere. It is thought that CENP-A is regulated by its stabilization at functional centromeres/kinetochores, because at non-centromeric loci CENP-A is removed and proteolyzed
[9], (reviewed in
[7]).
Neocentromeres, defined as functional kinetochores that assemble at ectopic positions, usually appear together with other chromosome rearrangement events (reviewed in
[6]). Neocentromeres have the properties of active centromeres, and, by definition, they associate with CENP-A
[10]–
[12]. Over 90 examples of human neocentromeres have been documented, most involving the formation of supernumerary chromosomes and often associated with developmental disabilities or specific cancers
[6]. Many more neocentromeres likely escape detection because they are eliminated during development
[6],
[8],
[13]. It is not known if neocentromere formation occurs first, followed by mutation of the natural centromere or, conversely, if mutation of the natural centromere leads to neocentromere formation.
The chromosomal position of neocentromeres is different in different organisms. In Drosophila, they have been found only adjacent to chromosome breaks that inactivated the original centromere
[14],
[15]. In
S. pombe they appear only at telomeric loci
[16]. In contrast, human neocentromeres exhibit flexible adaptation to changes in chromosome structure, often appearing far from the site of the original centromere at either terminal or submetacentric loci (reviewed in
[6]).
No specific DNA sequence properties necessary for functional regional centromere assembly have been identified. Regional centromeres in humans, flies, plants and fungi are composed of long tracts of repetitive DNA, yet repeat tracts are not absolutely required for centromere function or for the formation of neocentromeres (reviewed in
[8],
[17]–
[19]). In
S. pombe, pericentric heterochromatin formation is necessary for efficient
de novo assembly of a functional kinetochore
[20], as well as for formation of telocentric neocentromeres
[16].
C. albicans, an opportunistic fungal pathogen that resides as a commensal in its human host, possess regional centromeres that are much smaller and simpler than other regional centromeres
[3],
[21]. Each of its 8 diploid chromosomes has a centromere that is regional based on its size (~3–4.5 kb that specifically associates with CENP-A
Cse4p [22],
[23]), the lack of tripartite point-centromere DNA structure, the presence of several orthologs of proteins found only at regional centromeres, and the absence of orthologs of proteins found only at point centromeres
[3].
Most notably,
C. albicans CENs lack pericentric heterochromatin: CENP-A associated core sequences are not embedded in long tracts of repetitive DNA
[21],
[22]; there are no clear orthologs of either heterochromatin protein 1 (HP1) or of enzymes necessary for the methylation of histone H3 lysine 9; and there is no homolog of CENP-V, a protein that regulates the extent of pericentric chromatin in human cells
[24]. Consistent with a lack of pericentric heterochromatin, genes near the centromeres are transcribed at levels close to the average level of transcription across the genome (K. E. S. Tang and JB, data not shown). Furthermore periodic nucleosome spacing is seen at inactive centromere DNA and not at active CENs, suggesting that nucleosomes at active CENs do not associate tightly with a specific DNA sequence
[21]. Naked
CEN7 DNA used to transform
C. albicans did not permit
de novo assembly of centromere function
[21],
[22]. Taken together, these observations suggest that, like other regional CENs, the assembly of a centromere and the inheritance of centromere function in
C. albicans requires epigenetic properties conferred by the association of CENP-A and other kinetochore proteins, rather than by a specific DNA sequence.
The stoichiometry of microtubules and centromere-specific nucleosomes differs in different organisms.
S. cerevisiae has only one CENP-A nucleosome and one microtubule per centromere, while
S. pombe has ~2–3 CENP-A nucleosomes
[25] and ~2–4 microtubules per centromere
[26]. This suggests that one microtubule is attached to kinetochore proteins assembled at each CENP-A nucleosome
[25]. In humans, the number of CENP-A nucleosomes is thought to be far larger than the number of microtubule attachments
[4],
[27]. In
C. albicans there are ~8 CENP-A
Cse4p molecules per centromere, presumed to be organized into 4 centromere-specific nucleosomes, and only one microtubule per centromere
[25]. This suggests that only one of the four CENP-A nucleosomes at each centromere assembles a kinetochore structure that binds a microtubule. Thus, the prevailing model is that at
C. albicans CENs, as at human centromeres, some CENP-A-containing nucleosomes bind microtubules while others do not.
Seven of the eight
C. albicans centromeres are near short repeats; only the centromere of Chr7 (
CEN7) does not have obvious repeat sequences nearby
[22]. Yet most analysis of
C. albicans centromere function has been performed with
CEN7 DNA, which is necessary for Chr7 stability
[23]. On Chrs 2, 3 and 6 there are direct repeats within ~3 kb of the centromere core sequence. On Chrs 1, 4, 5 and R the CENP-A bound centromere core DNA is flanked by a short inverted repeat (IR). The palindromic structure of the four centromeres with a flanking IR is most reminiscent of the structure of
S. pombe and other regional centromeres.
Here, we studied the properties of C. albicans CEN5, a centromere with a palindromic structure, by replacing the CEN5 DNA and the flanking IR with a URA3 marker. We found that it behaved like a regional centromere: the resulting cen5Δ::URA3 Chr5 derivatives were stably maintained through mitosis by efficient formation of a neocentromere at one of several non-centromeric loci. Loss of the wild-type chromosome 5 homolog and homozygosis of the neoCEN5 homolog demonstrated that cells can survive in the absence of any CEN5 DNA. Some of the cen5Δ transformants formed a “proximal neoCEN” near cen5Δ::URA3, which subsequently moved onto, and silenced the URA3 gene. Other cen5Δ transformants formed “distal neoCENs” at several different loci far from the deleted CEN5 locus. Thus, neocentromere formation does not require a specific DNA sequence and can occur at several different chromosomal loci.