New search engine
Each online gene record contains 23 fields, with 14 links to other relevant resources including: Ensembl (
2), GENATLAS (
3), GeneCards (
4), GeneClinics/GeneTests (
http://www.genetests.org), the international ImMunoGeneTics database® (IMGT) (
5), LocusLink (
6), MGD (
7), OMIM (
8), Ref_Seq (
6) and Swiss-Prot (
9).
The new Genew search engine has received a total of 422 113 hits (since July 2002), with an average of 31 038 hits per month. Table gives an indication of how many of these hits are followed by searches of the database.
| Table 1.Use of the Genew search engine (July 2002–August 2003) |
We also monitor the top 20 search terms used, as this assists us in developing both a more user-friendly search engine and a better understanding of commonly used (but possibly not approved) gene symbols. Table shows the total number of searches for the top 20 search terms and their approved symbols (which are the same in all bar one case: TP53 is the approved symbol for ‘p53’).
| Table 2.Top 20 search terms used in the Genew search engine |
Non-human orthologues
With increased requests for gene symbols in other species, we have added a new gene status, ‘Approved Non-Human’. This currently includes 98 entries that we have approved in order to maintain the orthologous symbol in the human gene family series. It is quite likely that most of these genes will ultimately be found in the human genome. Each ‘Approved Non-Human’ gene symbol has links to the appropriate non-human sequence accession ID where possible. The orthologous species currently include: mouse, cow, rat, African clawed toad, pig, zebrafish and dog.
LocusLink updates
In order to update correctly the LocusLink entries with approved gene symbols we have added a new field designated<! COMMENT -- sgml op. please revert these next three commands (/p, p, zzsection_fullout) to a normal variable space -- KB>
‘Locus Type’. This includes designations such as:
(i) gene with no protein product;
(ii) model, supported by EST alignments;
(iii) phenotype only;
(iv) pseudogene;
(v) RNA, ribosomal.
Confidential gene records
Unnamed genes are placed into the confidential section of Genew (known previously as ‘pending’). This includes those genes that have been submitted by authors and/or journals for symbol approval prior to publication. In addition, we have further increased this resource with unnamed genes from two major public data sets: the ‘Interim’ human genes from LocusLink and the interim mouse genes from MGD which are updated once a week. There are now just over 3000 unnamed gene records awaiting approval.