TOR (
Target
of
rapaymcin) is a serine/theronine protein kinase and signaling protein that is conserved in all eukaryotes. TOR was initially identified in
Saccharomyces cerivisiae as a gene locus that confers drug resistance to the immunosuppressant rapamycin, an inhibitor of TOR kinase activity [
1-
5]. TOR is one of six members of the phosphoinositide-3-OH-related kinase (PIKK) protein family of eukaryotic signaling proteins that are essential for monitoring cellular nutrient supply, genome and transcript integrity, and regulating chromatin structure [
6-
8]. Importantly, TOR regulates both cell growth and cell cycle progression and is a major target for cancer therapeutics [
9-
11].
The overall domain structure of TOR is well conserved and is defined by a C-terminal PI3K (
phosphatidyl
inositol-
3-
kinase) catalytic domain that is immediately flanked by four different alpha helical domains [
8]. The amino terminus of the PI3K domain is flanked by a four-helix bundle called the FRB (
FKBP12
rapamycin
binding) domain and an extended helical region called the FAT (
FRAP,
ATM, and
TRRAP) domain [
2,
12,
13]. The FAT domain is conserved among PIKK family members and possibly folds into a combination of HEAT (Huntington, EF3A, ATM, TOR) and TPR (
Tetratricopeptide
repeat) repeats [
2,
12,
14,
15]. The C-terminus of TOR consists of the FATC (
FRAP,
ATM and
TRRAP C-terminal) domain, composed of one short and one long helix joined by a disulfide bond [
16]. The N-terminus of PIKKs, which is less conserved at the primary structure level, is composed almost entirely of tandem alpha helices that possibly fold into repeats [
14,
15].
Little is known about the structure and function of the repeat domains of PIKKs, but in other repeat-containing proteins these structures mediate protein-protein interactions. Sequence analysis of the PIKK family indicates that these proteins contain HEAT and TPR repeats in their HEAT and FAT domains [
14,
15]. However, the number and position of these repeats remains undetermined. Individual repeat units vary in length from 30 to 40 amino acid residues and number from as few as two repeats to more than dozens within a protein [
17,
18]. HEAT repeats consist of two antiparallel alpha helices, forming a helical hairpin ~30 amino acids in length. These repeats pack together in parallel via hydrophobic interactions between adjacent repeat helices and ultimately fold into an elegant superhelix [
19]. TPR repeats also consist of ~34 amino acids that fold into two alpha helices in an anti-parallel arrangement that in turn pack into a helical superstructure similar to that described for HEAT repeat-containing proteins [
20,
21].
Understanding the repeat architectures of TOR is key to understanding how this protein ultimately functions in the cell and how TOR interacts with its targets. A major obstacle to detailed structural and functional analysis of TOR and other members of the PIKK protein family is their large size (~280-470 KDa), which renders experimental manipulation difficult. To date, only three structures of TOR have been solved: two high resolution structures of the FRB and FATC domains and a low resolution EM structure of the full-length protein [
2,
13,
16,
22]. Structural and functional studies of PIKKs have also been hindered by poor understanding of their repeat architectures, as traditional web-based repeat prediction methods detect few to none of the repeats in their HEAT and FAT domains. The main reason for this is that the primary amino acid structures of protein repeats exhibit too much variation for accurate prediction. Therefore, the HEAT and FAT domains of PIKKs may contain more divergent protein repeats that would not be identified by sequence similarity alone.
For this reason, I developed a simple method capable of predicting structural repeats in TOR that could be used to guide structural and functional analysis. The strategy uses HHPred, a powerful and fast sequence homology detection method that predicts a new and potentially more complete depiction of the domain and repeat architecture of the TOR protein family. The HHpred strategy presented here performed significantly better than traditional web-based methods by predicting 100% of the HEAT and TPR repeats within a test set of proteins of known three dimensional (3D) structure. Based on these results, I predict that the HEAT and FAT domains of TOR fold into a continuous superhelix of 48 tandem HEAT and TPR repeats. Comparative sequence analysis of TOR proteins from divergent eukaryotic species provides further supporting evidence that TOR proteins from fly, plant, yeast, and humans fold into remarkably similar structures and exhibit several intriguing structural differences in their HEAT and FAT domains that are examined in this report. These findings were used to assemble a 3D structural model for the TOR protein.