PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of bmcgenoBioMed Centralsearchsubmit a manuscriptregisterthis articleBMC Genomics
 
BMC Genomics. 2009; 10: 4.
Published online Jan 6, 2009. doi:  10.1186/1471-2164-10-4
PMCID: PMC2637895
Mining for single nucleotide polymorphisms in pig genome sequence data
Hindrik HD Kerstens,1 Sonja Kollers,2 Arun Kommadath,1 Marisol del Rosario,1 Bert Dibbits,1 Sylvia M Kinders,1 Richard P Crooijmans,1 and Martien AM Groenencorresponding author1
1Animal Breeding and Genetics Group, Wageningen University, PO Box 9101, Wageningen, 6701 BH, the Netherlands
2IPG, Institute for Pig Genetics, PO Box 43, Beuningen, 6640 AA, the Netherlands
corresponding authorCorresponding author.
Hindrik HD Kerstens: hindrik.kerstens/at/wur.nl; Sonja Kollers: sonjakollers/at/gmx.de; Arun Kommadath: arun.kommadath/at/wur.nl; Marisol del Rosario: marisol_del_rosario/at/hotmail.com; Bert Dibbits: bert.dibbits/at/wur.nl; Sylvia M Kinders: sylvia.kinders/at/wur.nl; Richard P Crooijmans: richard.crooijmans/at/wur.nl; Martien AM Groenen: martien.groenen/at/wur.nl
Received September 26, 2008; Accepted January 6, 2009.
Abstract
Background
Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is still limited.
Results
A total of 4.8 million whole genome shotgun sequences obtained from the NCBI trace-repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project" were analysed for the presence of SNPs. Available BAC and BAC-end sequences and their naming and mapping information, all obtained from SangerInstitute FTP site, served as a rough assembly of a reference genome. In 1.2 Gb of pig genome sequence, we identified 98,151 SNPs in which one of the sequences in the alignment represented the polymorphism and 6,374 SNPs in which two sequences represent an identical polymorphism. To benchmark the SNP identification method, 163 SNPs, in which the polymorphism was represented twice in the sequence alignment, were selected and tested on a panel of three purebred boar lines and wild boar. Of these 163 in silico identified SNPs, 134 were shown to be polymorphic in our animal panel.
Conclusion
This SNP identification method, which mines for SNPs in publicly available porcine shotgun sequences repositories, provides thousands of high quality SNPs. Benchmarking in an animal panel showed that more than 80% of the predicted SNPs represented true genetic variation.
Articles from BMC Genomics are provided here courtesy of
BioMed Central