Search tips
Search criteria 


Logo of narLink to Publisher's site
Nucleic Acids Res. 1994 June 25; 22(12): 2360–2365.
PMCID: PMC523695

HOVERGEN: a database of homologous vertebrate genes.


Comparison of homologous genes is a major step for many studies related to genome structure, function or evolution. Similarity search programs easily find genes homologous to a given sequence. However, only very tedious manual procedures allow the retrieval of all sets of homologous genes sequenced for a given set of species. Moreover, this search often generates errors due to the complexity of data to be managed simultaneously: phylogenetic trees, alignments, taxonomy, sequences and related information. HOVERGEN helps to solve these problems by integrating all this information. HOVERGEN corresponds to GenBank sequences from all vertebrate species, with some data corrected, clarified, or completed, notably to address the problem of redundancy. Coding sequences have been classified in gene families. Protein multiple alignments and phylogenetic trees have been calculated for each family. Sequences and related information have been structured in an ACNUC database which permits complex selections. A graphical interface has been developed to visualize and edit trees. Genes are displayed in color, according to their taxonomy. Users have directly access to all information attached to sequences and to multiple alignments simply by clicking on genes. This graphical tool gives thus a rapid and simple access to all data necessary to interpret homology relationships between genes. HOVERGEN allows the user to easily select sets of homologous vertebrate genes, and thus is particularly useful for comparative sequence analysis, or molecular evolution studies.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.5M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Images in this article

Click on the image to see a larger version.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Green P, Lipman D, Hillier L, Waterston R, States D, Claverie JM. Ancient conserved regions in new gene sequences and the protein databases. Science. 1993 Mar 19;259(5102):1711–1716. [PubMed]
  • Duret L, Dorkeld F, Gautier C. Strong conservation of non-coding sequences during vertebrates evolution: potential involvement in post-transcriptional regulation of gene expression. Nucleic Acids Res. 1993 May 25;21(10):2315–2322. [PMC free article] [PubMed]
  • Mouchiroud D, Bernardi G. Compositional properties of coding sequences and mammalian phylogeny. J Mol Evol. 1993 Aug;37(2):109–116. [PubMed]
  • Collins F, Galas D. A new five-year plan for the U.S. Human Genome Project. Science. 1993 Oct 1;262(5130):43–46. [PubMed]
  • O'hUigin C, Li WH. The molecular clock ticks regularly in muroid rodents and hamsters. J Mol Evol. 1992 Nov;35(5):377–384. [PubMed]
  • Goodman M, Czelusniak J, Koop BF, Tagle DA, Slightom JL. Globins: a case study in molecular phylogeny. Cold Spring Harb Symp Quant Biol. 1987;52:875–890. [PubMed]
  • Burks C, Cassidy M, Cinkosky MJ, Cumella KE, Gilna P, Hayden JE, Keen GM, Kelley TA, Kelly M, Kristofferson D, et al. GenBank. Nucleic Acids Res. 1991 Apr 25;19 (Suppl):2221–2225. [PMC free article] [PubMed]
  • Larsen F, Gundersen G, Lopez R, Prydz H. CpG islands as gene markers in the human genome. Genomics. 1992 Aug;13(4):1095–1107. [PubMed]
  • Bird A. The essentials of DNA methylation. Cell. 1992 Jul 10;70(1):5–8. [PubMed]
  • Mouchiroud D, D'Onofrio G, Aïssani B, Macaya G, Gautier C, Bernardi G. The distribution of genes in the human genome. Gene. 1991 Apr;100:181–187. [PubMed]
  • Li WH, Sadler LA. Low nucleotide diversity in man. Genetics. 1991 Oct;129(2):513–523. [PubMed]
  • Krawetz SA. Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation. Nucleic Acids Res. 1989 May 25;17(10):3951–3957. [PMC free article] [PubMed]
  • Kristensen T, Lopez R, Prydz H. An estimate of the sequencing error frequency in the DNA sequence databases. DNA Seq. 1992;2(6):343–346. [PubMed]
  • Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. [PubMed]
  • Higgins DG, Bleasby AJ, Fuchs R. CLUSTAL V: improved software for multiple sequence alignment. Comput Appl Biosci. 1992 Apr;8(2):189–191. [PubMed]
  • Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987 Jul;4(4):406–425. [PubMed]
  • Gouy M, Gautier C, Attimonelli M, Lanave C, di Paola G. ACNUC--a portable retrieval system for nucleic acid sequence databases: logical and physical designs and usage. Comput Appl Biosci. 1985 Sep;1(3):167–172. [PubMed]
  • Mouchiroud D, Gautier C. Codon usage changes and sequence dissimilarity between human and rat. J Mol Evol. 1990 Aug;31(2):81–91. [PubMed]
  • Wolfe KH, Sharp PM. Mammalian gene evolution: nucleotide sequence divergence between mouse and rat. J Mol Evol. 1993 Oct;37(4):441–456. [PubMed]
  • Bloch KD, Friedrich SP, Lee ME, Eddy RL, Shows TB, Quertermous T. Structural organization and chromosomal assignment of the gene encoding endothelin. J Biol Chem. 1989 Jun 25;264(18):10851–10857. [PubMed]
  • Inoue A, Yanagisawa M, Kimura S, Kasuya Y, Miyauchi T, Goto K, Masaki T. The human endothelin family: three structurally and pharmacologically distinct isopeptides predicted by three separate genes. Proc Natl Acad Sci U S A. 1989 Apr;86(8):2863–2867. [PubMed]
  • Barker WC, George DG, Hunt LT. Protein sequence database. Methods Enzymol. 1990;183:31–49. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press