PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-3 (3)
 

Clipboard (0)
None

Select a Filter Below

Journals
Year of Publication
Document Types
1.  Using Prior Information from the Medical Literature in GWAS of Oral Cancer Identifies Novel Susceptibility Variant on Chromosome 4 - the AdAPT Method 
PLoS ONE  2012;7(5):e36888.
Background
Genome-wide association studies (GWAS) require large sample sizes to obtain adequate statistical power, but it may be possible to increase the power by incorporating complementary data. In this study we investigated the feasibility of automatically retrieving information from the medical literature and leveraging this information in GWAS.
Methods
We developed a method that searches through PubMed abstracts for pre-assigned keywords and key concepts, and uses this information to assign prior probabilities of association for each single nucleotide polymorphism (SNP) with the phenotype of interest - the Adjusting Association Priors with Text (AdAPT) method. Association results from a GWAS can subsequently be ranked in the context of these priors using the Bayes False Discovery Probability (BFDP) framework. We initially tested AdAPT by comparing rankings of known susceptibility alleles in a previous lung cancer GWAS, and subsequently applied it in a two-phase GWAS of oral cancer.
Results
Known lung cancer susceptibility SNPs were consistently ranked higher by AdAPT BFDPs than by p-values. In the oral cancer GWAS, we sought to replicate the top five SNPs as ranked by AdAPT BFDPs, of which rs991316, located in the ADH gene region of 4q23, displayed a statistically significant association with oral cancer risk in the replication phase (per-rare-allele log additive p-value [ptrend] = 2.5×10−3). The combined OR for having one additional rare allele was 0.83 (95% CI: 0.76–0.90), and this association was independent of previously identified susceptibility SNPs that are associated with overall UADT cancer in this gene region. We also investigated if rs991316 was associated with other cancers of the upper aerodigestive tract (UADT), but no additional association signal was found.
Conclusion
This study highlights the potential utility of systematically incorporating prior knowledge from the medical literature in genome-wide analyses using the AdAPT methodology. AdAPT is available online (url: http://services.gate.ac.uk/lld/gwas/service/config).
doi:10.1371/journal.pone.0036888
PMCID: PMC3360735  PMID: 22662130
2.  A Naïve Bayes Approach to Classifying Topics in Suicide Notes 
Biomedical Informatics Insights  2012;5(Suppl. 1):87-97.
The authors present a system developed for the 2011 i2b2 Challenge on Sentiment Classification, whose aim was to automatically classify sentences in suicide notes using a scheme of 15 topics, mostly emotions. The system combines machine learning with a rule-based methodology. The features used to represent a problem were based on lexico–semantic properties of individual words in addition to regular expressions used to represent patterns of word usage across different topics. A naïve Bayes classifier was trained using the features extracted from the training data consisting of 600 manually annotated suicide notes. Classification was then performed using the naïve Bayes classifier as well as a set of pattern–matching rules. The classification performance was evaluated against a manually prepared gold standard consisting of 300 suicide notes, in which 1,091 out of a total of 2,037 sentences were associated with a total of 1,272 annotations. The competing systems were ranked using the micro-averaged F-measure as the primary evaluation metric. Our system achieved the F-measure of 53% (with 55% precision and 52% recall), which was significantly better than the average performance of 48.75% achieved by the 26 participating teams.
doi:10.4137/BII.S8945
PMCID: PMC3409485  PMID: 22879764
natural language processing; sentiment analysis; topic classification; naïve Bayes classifier
3.  Temperature effect on tert-butyl alcohol (TBA) biodegradation kinetics in hyporheic zone soils 
Background
Remediation of tert-butyl alcohol (TBA) in subsurface waters should be taken into consideration at reformulated gasoline contaminated sites since it is a biodegradation intermediate of methyl tert-butyl ether (MTBE), ethyl tert-butyl ether (ETBE), and tert-butyl formate (TBF). The effect of temperature on TBA biodegradation has not been not been published in the literature.
Methods
Biodegradation of [U 14C] TBA was determined using hyporheic zone soil microcosms.
Results
First order mineralization rate constants of TBA at 5°C, 15°C and 25°C were 7.84 ± 0.14 × 10-3, 9.07 ± 0.09 × 10-3, and 15.3 ± 0.3 × 10-3 days-1, respectively (or 2.86 ± 0.05, 3.31 ± 0.03, 5.60 ± 0.14 years-1, respectively). Temperature had a statistically significant effect on the mineralization rates and was modelled using the Arrhenius equation with frequency factor (A) and activation energy (Ea) of 154 day-1 and 23,006 mol/J, respectively.
Conclusion
Results of this study are the first to determine mineralization rates of TBA for different temperatures. The kinetic rates determined in this study can be used in groundwater fate and transport modelling of TBA at the Ronan, MT site and provide an estimate for TBA removal at other similar shallow aquifer sites and hyporheic zones as a function of seasonal change in temperature.
doi:10.1186/1475-925X-6-34
PMCID: PMC2174489  PMID: 17877835

Results 1-3 (3)