Cyclin-dependent kinases (CDKs) are protein kinases involved in critical cellular processes, such as cell cycle or transcription, whose activity requires association with specific cyclin subunits. Based on sequence similarity, the human genome contains 21 genes encoding CDKs and five additional genes encoding a more distant group of proteins known as CDK like (CDKL) kinases. The current nomenclature for CDK proteins includes 11 classical CDKs (CDK1-11), two newly proposed family members (CDK12-13) and additional proteins whose names are based on the presence of a cyclin-binding element (PFTAIRE and PCTAIRE proteins) or simply based on sequence relationship with the original CDKs, such as CDC2-like kinases (CDC2L) or Cell cycle-related kinases (CCRK)1. We propose here the use of ‘CDK’ for all CDK family members (CDK1-20) based on similarities in sequence and function as described below. This nomenclature will facilitate the rational and comparative study of the poorly understood family members and the analysis of their relevance in human disease.
During the Cold Spring Harbor Symposium on the Cell Cycle in 1991, a group of interested scientists proposed that members of this kinase family would be called cyclin dependent kinases (CDKs). The consensus was that no kinase should be called a ‘CDK’ until it was proven rigorously that its activity depended on association with some cyclin-like regulatory subunit. Known family members were then were renamed CDK1-6 and further members (CDK7-CDK10) were cloned and characterized in the following years1. CDK11 has been used to refer to the protein encoded by three different human loci (CDC2L1, CDC2L2 and CDC2L6) in different publications. CDC2L1 and CDC2L2 are two highly similar human genes originated by duplication in human chromosome 1 and they encode almost identical protein kinases. Each of these loci encode at least two major peptides: a protein kinase of 110 kDa, and a smaller 58 kDa isoform that is expressed following alternative initiation of translation. To make the situation more complex, some publications refer to CDK11 as the protein encoded by CDC2L6, including the original description of the human kinome2 (Table S1). The single Cdc2l1 gene in the mouse genome encodes a protein more similar to human CDC2L2 (91% identity) than CDC2L1 (87%). We therefore propose the use of Cdk11 for the mouse gene and CDK11A and CDK11B for the human CDC2L2 and CDC2L1 loci, respectively (Figure 1 and Table S1). The corresponding human proteins should be referred to as CDK11Ap58 or CDK11Ap110 for the CDC2L2-encoded proteins, and CDK11Bp58 or CDK11Bp110 for the CDC2L1-encoded proteins. Two additional kinases, formerly known as Crk7 (CrkRS) and Ched (Cdc2L5), were recently renamed CDK12 and CDK13, as they were reported to interact with cyclin L1 and cyclin L23,4.
Additional members of the family, such as PCTAIRE or PFTAIRE proteins, CCRK or CDC2L6, although being very similar to other CDKs in their primary structure (Figure 1), and their predicted cyclin-binding domain (Figure S1) still maintain their original names (Table S1). Ironically, some these proteins have not received CDK names because their association with cyclins has not been demonstrated, and yet they are known by their predicted cyclin association element sequence (PFTAIRE or PCTAIRE; Figure S1). Although the strict ‘cyclin dependent’ rule has been useful during the gradual characterization of the CDK family, we feel that there is now enough evidence to group all these CDK-related proteins into a single protein family with sufficient sequence and functional similarities to justify the use of the ‘CDK’ term for all family members. PFTAIRE (PFTK) and PCTAIRE (PCTK) proteins are encoded by 5 genes more related to CDK1 than other classical family members such as CDK4 or CDK6 (Figure 1). Recent studies indicate that PFTK1 is activated by cyclin D3 and cyclin Y, and phosphorylates the retinoblastoma protein (pRb) (Refs 5,6 and Table S1). PFTK1 should therefore be renamed CDK14. PFTK2 displays 60% identity with CDK14/PFTK1 but no studies of this protein have been published since its identification in 2001. Based on conservation of the PFTAIRE motif and surrounding sequences, as well as overall similarity, APFTK2 seems very likely to have a cyclin partner and should therefore be renamed as CDK15. The related PCTAIRE kinases (PCTK1-3) are also poorly studied, but retain a cyclin binding motif and have some evidence for cyclin interaction. PCTK3 was found to interact with cyclin K in a large-scale interactions study7. The two other PCTK kinases also interact with p35, the major activator of CDK5 (Table S1). Given the interaction with cyclin K, p35 and the high similarity between PCTAIRE proteins and CDK5, we suggest that these proteins should be renamed as CDK16, CDK17 and CDK18. The same rationale can be applied to the remaining members of the family, CDC2L6 and CCRK. CDC2L6 is highly similar to CDK8, contains an identical SMSACRE motif and it is also a component of the Mediator complex8, suggesting not only sequence homology but also functional similarities. CDC2L6 form complexes with cyclin C although they interaction has not been described in detail (Table S1). CDK19 should therefore be used instead of CDC2L6. Finally, CCRK (also known as CAKp42) is selectively similar to CDK7, the catalytic subunit of the CDK-activating kinase (CAK) complex (Figure 1), and it has also been suggested to function similarly by phosphorylating and activating CDK2 (Refs. 9,10). We therefore propose to use the CDK20 symbol for this protein.
Although a few of these kinases do not have an identified cyclin partner, we believe that the CDK term should be applied to the whole family. First, it is quite likely that all these proteins are activated by cyclins or cyclin-related proteins, many of which are still poorly studied. Second, even if some family members are not activated by cyclins, this does not invalidate the use of the ‘CDK’ root to denominate all family members. There are multiple examples in gene nomenclature where some family members do not share the major features that originally defined that family. For example, 45 human protein kinases lack at least one of the conserved catalytic residues2 and may not have kinase activity. Kinase activity has not been detected for several CDK-related proteins but they are still known as PFTK2, PCTK2 or CCRK, where “K” stands for “kinase”. Finally, most ‘cyclins’ do not cycle.
The current nomenclature has probably contributed to the lack of consideration of the therapeutic value of the lesser-known CDKs, and also a failure to examine these protein kinases for specificity when developing CDK inhibitors. Protein names like PFTAIRE-1 do not encourage the consideration of the therapeutic value of these proteins, which are usually not tested for therapeutic purposes. In summary, we believe that sequence and functional relationships suggest that all of the 21 human CDKs should be given the ‘CDK’ prefix. This expanded CDK family nomenclature has also been agreed upon by the HGNC and the Mouse Genomic Nomenclature Committee, and will be listed in all the major databases for the human and mouse genes encoding the CDK family proteins. The analysis of sequence and functional trends in the CDK family may facilitate new research (e.g. compensatory functions in loss-off function studies) directed toward their relevance in the cell cycle, transcription or other cellular processes. In addition, this simplified nomenclature may increase awareness of their potential importance as therapeutic targets in human disease. Researchers and drug discoverers should keep the whole family picture in mind.