Kinetochores are eukaryote-specific structures that assemble on centromeric (
CEN) DNA and perform three crucial functions: they bind paired sister chromatids to spindle microtubules (MTs) in a bipolar fashion compatible with chromatid disjunction; they couple MT (+)-end polymer dynamics to chromosome movement during metaphase and anaphase [
1]; and they generate the spindle checkpoint signals linking anaphase onset to the completion of kinetochore-MT attachment [
2]. Despite the conservation of these functions, and of MT structure and dynamics,
CENs in closely related organisms are highly diverged in sequence, as are
CENs on different chromosomes in a single organism [
2,
3]. The simplest known
CENs, those in the budding yeast
Saccharomyces cerevisiae, consist of 125 base-pairs (bp) of DNA and three protein-binding motifs (CDEI, CDEII and CDEIII) that are present on all 16 chromosomes [
4]. These short
CEN sequences, often called 'point'
CENs, are structurally similar to enhancers and transcriptional regulators in that their assembly is initiated by highly sequence-selective DNA-protein interactions [
5]. In contrast,
CEN DNA in fungi such as the budding yeast
Candida albicans and fission yeast
Schizosaccharomyces pombe, plants such as
Arabidopsis thaliana, and metazoans such as
Drosophila melanogaster and
Homo sapiens, are longer and more complex and exhibit poor sequence conservation [
6-
10]. These regional
CENs range in size from 1 kb in
C. albicans [
6], to several megabases in
H. sapiens [
8] and typically contain long stretches of repetitive AT-rich DNA.
CEN organization is particularly divergent in nematodes such as
Caenorhabditis elegans, which contain holocentric
CENs with MT-attachment sites distributed along the length of chromosomes [
11]. Sequence-selective DNA-protein interactions have not been identified in regional
CENs and it is thought that kinetochore position is determined by a specialized chromatin domain whose formation at one site on each chromosome is controlled by epigenetic mechanisms [
2,
12].
A combination of genetics and mass spectrometry in
S. cerevisiae has yielded a fairly detailed view of the composition and architecture of its simple kinetochores.
S. cerevisiae kinetochores contain upwards of 70 protein subunits organized into 14 or more multi-protein complexes that together have a molecular mass in excess of 5 to 10 MDa [
5].
S. cerevisiae kinetochore proteins can be assigned to DNA-binding, linker, MT-binding and regulatory functions. While 'linker protein' is used rather loosely, all linkers exhibit a clear hierarchical relationship with respect to DNA and MT-binding proteins: linker proteins require DNA binding proteins, and possibly also other linker proteins, for
CEN DNA binding but not MTs or MT-associated proteins (MAPs).
Kinetochore assembly in
S. cerevisiae is initiated by association of the essential four-protein CBF3 complex with the CDEIII region of
CEN DNA. CBF3-CDEIII association then recruits several additional DNA binding proteins, including scCse4, a specialized histone H3 found only at
CENs (CenH3). CenH3-containing nucleosomes are thought to be core components of all kinetochores [
13]. When
CEN associated, the DNA binding subunits of
S. cerevisiae kinetochores recruit four essential multi-protein linker complexes, the NDC80 complex (four proteins), COMA (four proteins), MIND (four proteins) and the SPC105 complex (two proteins). These complexes, in turn, recruit a multiplicity of motor proteins and MAPs to form a fully functional MT-attachment site (P De Wulf and PK Sorger, unpublished observation) [
14-
16].
A key question in the study of kinetochores is whether architectural features currently being elucidated in
S. cerevisiae are conserved in higher cells. Some
S. cerevisiae proteins have been shown to have orthologs in one or more metazoa. These metazoan orthologs include CenH3, CENP-C
Mif2, Mis6
Ctf3/CENP-I, Spc105
KNL-1/Kia1570, members of the NDC80 and MIND complexes as well as MT-associated proteins such as EB1
Bim1 and CLIP170
Bik1, Mad-Bub spindle checkpoint proteins and some regulatory kinases [
2,
17-
26]. To date, however, only CenH3 and CENP-C have been carefully compared at a sequence level in a wide range of organisms [
27]. Here we report a systematic analysis of sequence relationships among a set of approximately 50 fungal, plant and metazoan kinetochore proteins with the overall aim of exploring their structural and evolutionary relationships. Our analysis supports the conclusion that the four linkers at the core of
S. cerevisiae kinetochores, the NDC80 complex, MIND, COMA, and the SPC105 complex, have been conserved through eukaryotic evolution. A subset of kinetochore proteins, perhaps 20% of the total in
S. cerevisiae, seems to be specific to point
CENs, all of which are very closely related. A second set of kinetochore proteins is found only on regional
CENs. It appears, therefore, that all kinetochores have a single ancestor, probably based on a regional
CEN, from which contemporary kinetochores diverged rapidly while conserving key structural features.