dbSNP can be searched directly or via other NCBI resources that
comprise the NCBI discovery space as illustrated in Figure . Direct searching can be done by submitter
handle (laboratory), new batches of submissions, identification
method used, population type studied, publication title, level of
population variation or STS mapping information. As an integrated
part of NCBI, the contents of dbSNP are cross-linked to records
in other information resources such as GenBank, LocusLink, the human
genome sequence and PubMed. The result sets from queries in any
of these resources will point the user back to the relevant records
in dbSNP.
BLAST. dbSNP can be searched
with the standard BLAST algorithm that will compare a user-submitted
sequence against all flanking sequence records in dbSNP. The BLAST
service is provided on the dbSNP homepage, rather than the general NCBI
BLAST page.
LocusLink. dbSNP can also be
queried by integrating it with other NCBI resources. Via LocusLink,
queries can be done by gene name or nomenclature association. Query
results from the LocusLink database will show a purple ‘V’ button
in SNP records have been mapped to the gene. Clicking on the ‘V’ will lead
to a list of the reference SNP records for any gene in the LocusLink
database.
Entrez. Selecting the ‘Linkout’ display
format from the pull-down menu beneath the query bar will provide
a hypertext link to dbSNP (labeled as ‘NCBI variation database’)
when dbSNP submissions have been mapped to the queried sequence. Following
the link will lead to the list of reference SNP records similar
to those for LocusLink.
Genome sequence. The NCBI genome viewer can
be set to show ‘variations’ as a sequenced-based
map. This map is aligned in common sequence-based coordinates with
other sequence features such as gene regions, STS markers, reference
sequence contigs and clone sequences. By clicking ‘display
options’ the user may select the variation map as the ‘master’ map
and ‘show verbose’. In this configuration, the full
annotation of variations on the genome sequence are visible. The
extended display shows ‘at-a-glance’ indicators of:
quality of the mapped location of the variation; functional classification
(locus-region, transcript, or coding region) if variation is in
a gene region; 95% confidence interval for average heterozygosity;
quality of marker (validation or success rate probability); genotypes
available; and submitter linkouts available.