Search tips
Search criteria

Results 1-3 (3)

Clipboard (0)

Select a Filter Below

Year of Publication
Document Types
1.  Efficient Gaussian process regression for large datasets 
Biometrika  2012;100(1):75-89.
Gaussian processes are widely used in nonparametric regression, classification and spatiotemporal modelling, facilitated in part by a rich literature on their theoretical properties. However, one of their practical limitations is expensive computation, typically on the order of n3 where n is the number of data points, in performing the necessary matrix inversions. For large datasets, storage and processing also lead to computational bottlenecks, and numerical stability of the estimates and predicted values degrades with increasing n. Various methods have been proposed to address these problems, including predictive processes in spatial data analysis and the subset-of-regressors technique in machine learning. The idea underlying these approaches is to use a subset of the data, but this raises questions concerning sensitivity to the choice of subset and limitations in estimating fine-scale structure in regions that are not well covered by the subset. Motivated by the literature on compressive sensing, we propose an alternative approach that involves linear projection of all the data points onto a lower-dimensional subspace. We demonstrate the superiority of this approach from a theoretical perspective and through simulated and real data examples.
PMCID: PMC3712798  PMID: 23869109
Bayesian regression; Compressive sensing; Dimensionality reduction; Gaussian process; Random projection
2.  The genetic landscape of mutations in Burkitt lymphoma 
Nature genetics  2012;44(12):1321-1325.
Burkitt lymphoma is characterized by deregulation of MYC, but the contribution of other genetic mutations to the disease is largely unknown. Here, we describe the first completely sequenced genome from a Burkitt lymphoma tumor and germline DNA from the same affected individual. We further sequenced the exomes of 59 Burkitt lymphoma tumors and compared them to sequenced exomes from 94 diffuse large B-cell lymphoma (DLBCL) tumors. We identified 70 genes that were recurrently mutated in Burkitt lymphomas, including ID3, GNA13, RET, PIK3R1 and the SWI/SNF genes ARID1A and SMARCA4. Our data implicate a number of genes in cancer for the first time, including CCT6B, SALL3, FTCD and PC. ID3 mutations occurred in 34% of Burkitt lymphomas and not in DLBCLs. We show experimentally that ID3 mutations promote cell cycle progression and proliferation. Our work thus elucidates commonly occurring gene-coding mutations in Burkitt lymphoma and implicates ID3 as a new tumor suppressor gene.
PMCID: PMC3674561  PMID: 23143597

Results 1-3 (3)