Search tips
Search criteria 


Logo of narLink to Publisher's site
Nucleic Acids Res. 1997 March 15; 25(6): 1087–1093.
PMCID: PMC146560

Compilation and analysis of intein sequences.


We have compiled a list of all the inteins (protein splicing elements) whose sequences have been published or were available from on-line sequence databases as of September 18, 1996. Analysis of the 36 available intein sequences refines the previously described intein motifs and reveals the presence of another intein motif, Block H. Furthermore, analysis of the new inteins reshapes our view of the conserved splice junction residues, since three inteins lack the intein penultimate His seen in prior examples. Comparison of intein sequences suggests that, in general, (i) inteins present in the same location within extein homologs from different organisms are very closely related to each other in paired sequence comparison or phylogenetic analysis and we suggest that they should be considered intein alleles; (ii) multiple inteins present in the same gene are no more similar to each other than to inteins present in different genes; (iii) phylogenetic analysis indicates that inteins are so divergent that trees with statistically significant branches cannot be generated except for intein alleles.

Full Text

The Full Text of this article is available as a PDF (474K).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Perler FB, Davis EO, Dean GE, Gimble FS, Jack WE, Neff N, Noren CJ, Thorner J, Belfort M. Protein splicing elements: inteins and exteins--a definition of terms and recommended nomenclature. Nucleic Acids Res. 1994 Apr 11;22(7):1125–1127. [PMC free article] [PubMed]
  • Guan C, Cui T, Rao V, Liao W, Benner J, Lin CL, Comb D. Activation of glycosylasparaginase. Formation of active N-terminal threonine by intramolecular autoproteolysis. J Biol Chem. 1996 Jan 19;271(3):1732–1737. [PubMed]
  • Porter JA, Ekker SC, Park WJ, von Kessler DP, Young KE, Chen CH, Ma Y, Woods AS, Cotter RJ, Koonin EV, et al. Hedgehog patterning activity: role of a lipophilic modification mediated by the carboxy-terminal autoprocessing domain. Cell. 1996 Jul 12;86(1):21–34. [PubMed]
  • Hirata R, Ohsumk Y, Nakano A, Kawasaki H, Suzuki K, Anraku Y. Molecular structure of a gene, VMA1, encoding the catalytic subunit of H(+)-translocating adenosine triphosphatase from vacuolar membranes of Saccharomyces cerevisiae. J Biol Chem. 1990 Apr 25;265(12):6726–6733. [PubMed]
  • Kane PM, Yamashiro CT, Wolczyk DF, Neff N, Goebl M, Stevens TH. Protein splicing converts the yeast TFP1 gene product to the 69-kD subunit of the vacuolar H(+)-adenosine triphosphatase. Science. 1990 Nov 2;250(4981):651–657. [PubMed]
  • Gimble FS, Thorner J. Homing of a DNA endonuclease gene by meiotic gene conversion in Saccharomyces cerevisiae. Nature. 1992 May 28;357(6376):301–306. [PubMed]
  • Bremer MC, Gimble FS, Thorner J, Smith CL. VDE endonuclease cleaves Saccharomyces cerevisiae genomic DNA at a single site: physical mapping of the VMA1 gene. Nucleic Acids Res. 1992 Oct 25;20(20):5484–5484. [PMC free article] [PubMed]
  • Pietrokovski S. Conserved sequence features of inteins (protein introns) and their use in identifying new inteins and related proteins. Protein Sci. 1994 Dec;3(12):2340–2350. [PubMed]
  • Perler FB, Comb DG, Jack WE, Moran LS, Qiang B, Kucera RB, Benner J, Slatko BE, Nwankwo DO, Hempstead SK, et al. Intervening sequences in an Archaea DNA polymerase gene. Proc Natl Acad Sci U S A. 1992 Jun 15;89(12):5577–5581. [PubMed]
  • Belfort M, Perlman PS. Mechanisms of intron mobility. J Biol Chem. 1995 Dec 22;270(51):30237–30240. [PubMed]
  • Lambowitz AM, Belfort M. Introns as mobile genetic elements. Annu Rev Biochem. 1993;62:587–622. [PubMed]
  • Davis EO, Jenner PJ, Brooks PC, Colston MJ, Sedgwick SG. Protein splicing in the maturation of M. tuberculosis recA protein: a mechanism for tolerating a novel class of intervening sequence. Cell. 1992 Oct 16;71(2):201–210. [PubMed]
  • Davis EO, Thangaraj HS, Brooks PC, Colston MJ. Evidence of selection for protein introns in the recAs of pathogenic mycobacteria. EMBO J. 1994 Feb 1;13(3):699–703. [PubMed]
  • Fsihi H, Vincent V, Cole ST. Homing events in the gyrA gene of some mycobacteria. Proc Natl Acad Sci U S A. 1996 Apr 16;93(8):3410–3415. [PubMed]
  • Gu HH, Xu J, Gallagher M, Dean GE. Peptide splicing in the vacuolar ATPase subunit A from Candida tropicalis. J Biol Chem. 1993 Apr 5;268(10):7372–7381. [PubMed]
  • Bult CJ, White O, Olsen GJ, Zhou L, Fleischmann RD, Sutton GG, Blake JA, FitzGerald LM, Clayton RA, Gocayne JD, et al. Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996 Aug 23;273(5278):1058–1073. [PubMed]
  • Pietrokovski S. A new intein in cyanobacteria and its significance for the spread of inteins. Trends Genet. 1996 Aug;12(8):287–288. [PubMed]
  • Huang C, Wang S, Chen L, Lemieux C, Otis C, Turmel M, Liu XQ. The Chlamydomonas chloroplast clpP gene contains translated large insertion sequences and is essential for cell growth. Mol Gen Genet. 1994 Jul 25;244(2):151–159. [PubMed]
  • Xu MQ, Comb DG, Paulus H, Noren CJ, Shao Y, Perler FB. Protein splicing: an analysis of the branched intermediate and its resolution by succinimide formation. EMBO J. 1994 Dec 1;13(23):5517–5522. [PubMed]
  • Anraku Y, Hirata R. Protozyme: emerging evidence in nature. J Biochem. 1994 Feb;115(2):175–178. [PubMed]
  • Davis EO, Jenner PJ. Protein splicing--the lengths some proteins will go to. Antonie Van Leeuwenhoek. 1995;67(2):131–137. [PubMed]
  • Cooper AA, Chen YJ, Lindorfer MA, Stevens TH. Protein splicing of the yeast TFP1 intervening protein sequence: a model for self-excision. EMBO J. 1993 Jun;12(6):2575–2583. [PubMed]
  • Schuler GD, Altschul SF, Lipman DJ. A workbench for multiple alignment construction and analysis. Proteins. 1991;9(3):180–190. [PubMed]
  • Lawrence CE, Altschul SF, Boguski MS, Liu JS, Neuwald AF, Wootton JC. Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science. 1993 Oct 8;262(5131):208–214. [PubMed]
  • Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915–10919. [PubMed]
  • Roberts RJ, Macelis D. REBASE--restriction enzymes and methylases. Nucleic Acids Res. 1996 Jan 1;24(1):223–235. [PMC free article] [PubMed]
  • Gimble FS, Thorner J. Purification and characterization of VDE, a site-specific endonuclease from the yeast Saccharomyces cerevisiae. J Biol Chem. 1993 Oct 15;268(29):21844–21853. [PubMed]
  • Xu MQ, Southworth MW, Mersha FB, Hornstra LJ, Perler FB. In vitro protein splicing of purified precursor and the identification of a branched intermediate. Cell. 1993 Dec 31;75(7):1371–1377. [PubMed]
  • Davis EO, Sedgwick SG, Colston MJ. Novel structure of the recA locus of Mycobacterium tuberculosis implies processing of the gene product. J Bacteriol. 1991 Sep;173(18):5653–5662. [PMC free article] [PubMed]
  • Perler FB, Kumar S, Kong H. Thermostable DNA polymerases. Adv Protein Chem. 1996;48:377–435. [PubMed]
  • Sun D, Setlow P. Cloning and nucleotide sequence of the Bacillus subtilis ansR gene, which encodes a repressor of the ans operon coding for L-asparaginase and L-aspartase. J Bacteriol. 1993 May;175(9):2501–2506. [PMC free article] [PubMed]
  • Belfort M, Reaban ME, Coetzee T, Dalgaard JZ. Prokaryotic introns and inteins: a panoply of form and function. J Bacteriol. 1995 Jul;177(14):3897–3903. [PMC free article] [PubMed]
  • Kaneko T, Tanaka A, Sato S, Kotani H, Sazuka T, Miyajima N, Sugiura M, Tabata S. Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. I. Sequence features in the 1 Mb region from map positions 64% to 92% of the genome. DNA Res. 1995 Aug 31;2(4):153–198. [PubMed]
  • Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995 Jul 28;269(5223):496–512. [PubMed]
  • Fraser CM, Gocayne JD, White O, Adams MD, Clayton RA, Fleischmann RD, Bult CJ, Kerlavage AR, Sutton G, Kelley JM, et al. The minimal gene complement of Mycoplasma genitalium. Science. 1995 Oct 20;270(5235):397–403. [PubMed]
  • Pühler G, Lottspeich F, Zillig W. Organization and nucleotide sequence of the genes encoding the large subunits A, B and C of the DNA-dependent RNA polymerase of the archaebacterium Sulfolobus acidocaldarius. Nucleic Acids Res. 1989 Jun 26;17(12):4517–4534. [PMC free article] [PubMed]
  • Klenk HP, Schwass V, Lottspeich F, Zillig W. Nucleotide sequence of the genes encoding the three largest subunits of the DNA-dependent RNA polymerase from the archaeum Thermococcus celer. Nucleic Acids Res. 1992 Sep 11;20(17):4659–4659. [PMC free article] [PubMed]
  • Klenk HP, Renner O, Schwass V, Zillig W. Nucleotide sequence of the genes encoding the subunits H, B, A' and A'' of the DNA-dependent RNA polymerase and the initiator tRNA from Thermoplasma acidophilum. Nucleic Acids Res. 1992 Oct 11;20(19):5226–5226. [PMC free article] [PubMed]
  • Leffers H, Gropp F, Lottspeich F, Zillig W, Garrett RA. Sequence, organization, transcription and evolution of RNA polymerase subunit genes from the archaebacterial extreme halophiles Halobacterium halobium and Halococcus morrhuae. J Mol Biol. 1989 Mar 5;206(1):1–17. [PubMed]
  • Berghöfer B, Kröckel L, Körtner C, Truss M, Schallenberg J, Klein A. Relatedness of archaebacterial RNA polymerase core subunits to their eubacterial and eukaryotic equivalents. Nucleic Acids Res. 1988 Aug 25;16(16):8113–8128. [PMC free article] [PubMed]
  • Xu MQ, Perler FB. The mechanism of protein splicing and its modulation by mutation. EMBO J. 1996 Oct 1;15(19):5146–5153. [PubMed]
  • Shao Y, Xu MQ, Paulus H. Protein splicing: characterization of the aminosuccinimide residue at the carboxyl terminus of the excised intervening sequence. Biochemistry. 1995 Aug 29;34(34):10844–10850. [PubMed]
  • Shao Y, Xu MQ, Paulus H. Protein splicing: evidence for an N-O acyl rearrangement as the initial step in the splicing process. Biochemistry. 1996 Mar 26;35(12):3810–3815. [PubMed]
  • Chong S, Shao Y, Paulus H, Benner J, Perler FB, Xu MQ. Protein splicing involving the Saccharomyces cerevisiae VMA intein. The steps in the splicing pathway, side reactions leading to protein cleavage, and establishment of an in vitro splicing system. J Biol Chem. 1996 Sep 6;271(36):22159–22168. [PubMed]
  • Koonin EV. A protein splice-junction motif in hedgehog family proteins. Trends Biochem Sci. 1995 Apr;20(4):141–142. [PubMed]
  • Lee JJ, Ekker SC, von Kessler DP, Porter JA, Sun BI, Beachy PA. Autoproteolysis in hedgehog protein biogenesis. Science. 1994 Dec 2;266(5190):1528–1537. [PubMed]
  • Porter JA, von Kessler DP, Ekker SC, Young KE, Lee JJ, Moses K, Beachy PA. The product of hedgehog autoproteolytic cleavage active in local and long-range signalling. Nature. 1995 Mar 23;374(6520):363–366. [PubMed]
  • Attwood TK, Beck ME, Bleasby AJ, Degtyarenko K, Parry Smith DJ. Progress with the PRINTS protein fingerprint database. Nucleic Acids Res. 1996 Jan 1;24(1):182–188. [PMC free article] [PubMed]
  • Hodges RA, Perler FB, Noren CJ, Jack WE. Protein splicing removes intervening sequences in an archaea DNA polymerase. Nucleic Acids Res. 1992 Dec 11;20(23):6153–6157. [PMC free article] [PubMed]
  • Gimble FS, Stephens BW. Substitutions in conserved dodecapeptide motifs that uncouple the DNA binding and DNA cleavage activities of PI-SceI endonuclease. J Biol Chem. 1995 Mar 17;270(11):5849–5856. [PubMed]
  • Quirk SM, Bell-Pedersen D, Belfort M. Intron mobility in the T-even phages: high frequency inheritance of group I introns promoted by intron open reading frames. Cell. 1989 Feb 10;56(3):455–465. [PubMed]
  • Bell-Pedersen D, Quirk SM, Aubrey M, Belfort M. A site-specific endonuclease and co-conversion of flanking exons associated with the mobile td intron of phage T4. Gene. 1989 Oct 15;82(1):119–126. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press