PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of bmcpsBioMed Centralsearchsubmit a manuscriptregisterthis articleBMC Plant Biology
 
BMC Plant Biol. 2012; 12: 94.
Published online Jun 19, 2012. doi:  10.1186/1471-2229-12-94
PMCID: PMC3444431
PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
Xiaohong Duan,1 Emily Schmidt,2,3 Pei Li,2 Douglas Lenox,2,3 Lin Liu,2 Changlong Shu,1 Jie Zhang,corresponding author1 and Chun Liangcorresponding author2,3
1State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
2Department of Botany, Miami University, Oxford, OH, 45056, USA
3Department of Computer Science and Software Engineering, Miami University, Oxford, OH, 45056, USA
corresponding authorCorresponding author.
Xiaohong Duan: duanhong1013/at/163.com; Emily Schmidt: schmidee/at/muohio.edu; Pei Li: peil/at/muohio.edu; Douglas Lenox: lenoxdj/at/muohio.edu; Lin Liu: liul2/at/muohio.edu; Changlong Shu: clshu/at/ippcaas.cn; Jie Zhang: jiezhang/at/caas.net.cn; Chun Liang: liangc/at/muohio.edu
Received February 5, 2012; Accepted June 5, 2012.
Abstract
Background
The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource.
Description
With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors.
Conclusion
As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop.
Keywords: Peanut, Arachis hypogaea, Transcriptome sequencing, Transcriptome assembly, Database, PeanutDB, SNP, SSR, Functional annotation
Articles from BMC Plant Biology are provided here courtesy of
BioMed Central