Sun, Bingyun | Ma, Li | Yan, Xiaowei | Lee, Denis | Alexander, Vinita | Hohmann, Laura J. | Lorang, Cynthia | Chandrasena, Lalangi | Tian, Qiang | Hood, Leroy | Hong, Wanjin
E14.Tg2a mouse embryonic stem (mES) cells are a widely used host in gene trap and gene targeting techniques. Molecular characterization of host cells will provide background information for a better understanding of functions of the knockout genes. Using a highly selective glycopeptide-capture approach but ordinary liquid chromatography coupled mass spectrometry (LC-MS), we characterized the N-glycoproteins of E14.Tg2a cells and analyzed the close relationship between the obtained N-glycoproteome and cell-surface proteomes. Our results provide a global view of cell surface protein molecular properties, in which receptors seem to be much more diverse but lower in abundance than transporters on average. In addition, our results provide a systematic view of the E14.Tg2a N-glycosylation, from which we discovered some striking patterns, including an evolutionarily preserved and maybe functionally selected complementarity between N-glycosylation and the transmembrane structure in protein sequences. We also observed an environmentally influenced N-glycosylation pattern among glycoenzymes and extracellular matrix proteins. We hope that the acquired information enhances our molecular understanding of mES E14.Tg2a as well as the biological roles played by N-glycosylation in cell biology in general.
doi:10.1371/journal.pone.0055722
PMCID: PMC3565968
PMID: 23405203
Rheumatoid arthritis (RA) is a chronic autoimmune disease that primarily attacks synovial joints. Despite the advances in diagnosis and treatment of RA, novel molecular targets are still needed to improve the accuracy of diagnosis and the therapeutic outcomes. Here, we present a systems approach that can effectively 1) identify core RA-associated genes (RAGs), 2) reconstruct RA-perturbed networks, and 3) select potential targets for diagnosis and treatments of RA. By integrating multiple gene expression datasets previously reported, we first identified 983 core RAGs that show RA dominant differential expression, compared to osteoarthritis (OA), in the multiple datasets. Using the core RAGs, we then reconstructed RA-perturbed networks that delineate key RA associated cellular processes and transcriptional regulation. The networks revealed that synovial fibroblasts play major roles in defining RA-perturbed processes, anti-TNF-α therapy restored many RA-perturbed processes, and 19 transcription factors (TFs) have major contribution to deregulation of the core RAGs in the RA-perturbed networks. Finally, we selected a list of potential molecular targets that can act as metrics or modulators of the RA-perturbed networks. Therefore, these network models identify a panel of potential targets that will serve as an important resource for the discovery of therapeutic targets and diagnostic markers, as well as providing novel insights into RA pathogenesis.
doi:10.1371/journal.pone.0051508
PMCID: PMC3519858
PMID: 23240033
Summary: With the rapidly expanding availability of data from personal genomes, exomes and transcriptomes, medical researchers will frequently need to test whether observed genomic variants are novel or known. This task requires downloading and handling large and diverse datasets from a variety of sources, and processing them with bioinformatics tools and pipelines. Alternatively, researchers can upload data to online tools, which may conflict with privacy requirements. We present here Kaviar, a tool that greatly simplifies the assessment of novel variants. Kaviar includes: (i) an integrated and growing database of genomic variation from diverse sources, including over 55 million variants from personal genomes, family genomes, transcriptomes, SNV databases and population surveys; and (ii) software for querying the database efficiently.
Availability: Kaviar is programmed in Perl and offered free of charge as Open Source Software. Kaviar may be used online as a programmatic web service or downloaded for local use from http://db.systemsbiology.net/kaviar. The database is also provided.
Contact: gustavo@systemsbiology.org
Supplementary Information: Supplementary data are available at Bioinformatics online.
doi:10.1093/bioinformatics/btr540
PMCID: PMC3208392
PMID: 21965822
Han, Xu | Fang, Xuefeng | Lou, Xiaoyan | Hua, Dasong | Ding, Wenchao | Foltz, Gregory | Hood, Leroy | Yuan, Ying | Lin, Biaoyang | Klymkowsky, Michael
SOX2 is an important stem cell marker and plays important roles in development and carcinogenesis. However, the role of SOX2 in Epithelial-Mesenchymal Transition has not been investigated. We demonstrated, for the first time, that SOX2 is involved in the Epithelial-Mesenchymal Transition (EMT) process as knock downof SOX2 in colorectal cancer (CRC) SW620 cells induced a Mesenchymal-Epithelial Transition (MET) process with recognized changes in the expression of key genes involved in the EMT process including E-cadherin and vimentin. In addition, we provided a link between SOX2 activity and the WNT pathway by showing that knock down of SOX2 reduced the WNT pathway activity in colorectal cancer (CRC) cells. We further demonstrated that SOX2 is involved in cell migration and invasion in vitro and in metastasis in vivo for CRC cells, and that the process might be mediated through the MMP2 activity. Finally, an IHC analysis of 44 cases of colorectal cancer patients suggested that SOX2 is a prognosis marker for metastasis of colorectal cancers.
doi:10.1371/journal.pone.0041335
PMCID: PMC3422347
PMID: 22912670
Cell
2011;144(6):860-863.
Complexity is the grand challenge for science and engineering in the 21st century. We suggest that biology is a discipline that is uniquely situated to tackle complexity, through a diverse array of technologies for characterizing molecular structure, interactions and function. A major difficulty in the analysis of complex biological systems is dealing with the low signal-to-noise inherent to nearly all large-scale biological data sets. We discuss powerful bioinformatic concepts for boosting signal-to-noise through external knowledge incorporated in processing units we call Filters and Integrators. These concepts are illustrated in four landmark studies that have provided model implementations of Filters, Integrators, or both.
doi:10.1016/j.cell.2011.03.007
PMCID: PMC3102020
PMID: 21414478
Kim, Yongsoo | Kim, Taek-Kyun | Kim, Yungu | Yoo, Jiho | You, Sungyong | Lee, Inyoul | Carlson, George | Hood, Leroy | Choi, Seungjin | Hwang, Daehee
Motivation: Systems biology attempts to describe complex systems behaviors in terms of dynamic operations of biological networks. However, there is lack of tools that can effectively decode complex network dynamics over multiple conditions.
Results: We present principal network analysis (PNA) that can automatically capture major dynamic activation patterns over multiple conditions and then generate protein and metabolic subnetworks for the captured patterns. We first demonstrated the utility of this method by applying it to a synthetic dataset. The results showed that PNA correctly captured the subnetworks representing dynamics in the data. We further applied PNA to two time-course gene expression profiles collected from (i) MCF7 cells after treatments of HRG at multiple doses and (ii) brain samples of four strains of mice infected with two prion strains. The resulting subnetworks and their interactions revealed network dynamics associated with HRG dose-dependent regulation of cell proliferation and differentiation and early PrPSc accumulation during prion infection.
Availability: The web-based software is available at: http://sbm.postech.ac.kr/pna.
Contact: dhhwang@postech.ac.kr; seungjin@postech.ac.kr
Supplementary information: Supplementary data are available at Bioinformatics online.
doi:10.1093/bioinformatics/btq670
PMCID: PMC3031040
PMID: 21193522
Next-generation sequencing (NGS) technologies-based transcriptomic profiling method often called RNA-seq has been widely used to study global gene expression, alternative exon usage, new exon discovery, novel transcriptional isoforms and genomic sequence variations. However, this technique also poses many biological and informatics challenges to extracting meaningful biological information. The RNA-seq data analysis is built on the foundation of high quality initial genome localization and alignment information for RNA-seq sequences. Toward this goal, we have developed RNASEQR to accurately and effectively map millions of RNA-seq sequences. We have systematically compared RNASEQR with four of the most widely used tools using a simulated data set created from the Consensus CDS project and two experimental RNA-seq data sets generated from a human glioblastoma patient. Our results showed that RNASEQR yields more accurate estimates for gene expression, complete gene structures and new transcript isoforms, as well as more accurate detection of single nucleotide variants (SNVs). RNASEQR analyzes raw data from RNA-seq experiments effectively and outputs results in a manner that is compatible with a wide variety of specialized downstream analyses on desktop computers.
doi:10.1093/nar/gkr1248
PMCID: PMC3315322
PMID: 22199257
MicroRNAs (miRNAs) are a recently discovered class of small, non-coding RNAs that regulate protein levels post-transcriptionally. miRNAs play important regulatory roles in many cellular processes, including differentiation, neoplastic transformation, and cell replication and regeneration. Because of these regulatory roles, it is not surprising that aberrant miRNA expression has been implicated in several diseases. Recent studies have reported significant levels of miRNAs in serum and other body fluids, raising the possibility that circulating miRNAs could serve as useful clinical biomarkers. Here, we provide a brief overview of miRNA biogenesis and function, the identification and potential roles of circulating extracellular miRNAs, and the prospective uses of miRNAs as clinical biomarkers. Finally, we address several issues associated with the accurate measurement of miRNAs from biological samples.
doi:10.1016/j.mrfmmm.2011.03.004
PMCID: PMC3199035
PMID: 21402084
Westaway, David | Genovesi, Sacha | Daude, Nathalie | Brown, Rebecca | Lau, Agnes | Lee, Inyoul | Mays, Charles E. | Coomaraswamy, Janaky | Canine, Brenda | Pitstick, Rose | Herbst, Allen | Yang, Jing | Ko, Kerry W. S. | Schmitt-Ulms, Gerold | DeArmond, Stephen J. | McKenzie, Debbie | Hood, Leroy | Carlson, George A. | Mabbott, Neil A.
During prion infections of the central nervous system (CNS) the cellular prion protein, PrPC, is templated to a conformationally distinct form, PrPSc. Recent studies have demonstrated that the Sprn gene encodes a GPI-linked glycoprotein Shadoo (Sho), which localizes to a similar membrane environment as PrPC and is reduced in the brains of rodents with terminal prion disease. Here, analyses of prion-infected mice revealed that down-regulation of Sho protein was not related to Sprn mRNA abundance at any stage in prion infection. Down-regulation was robust upon propagation of a variety of prion strains in Prnpa and Prnpb mice, with the exception of the mouse-adapted BSE strain 301 V. In addition, Sho encoded by a TgSprn transgene was down-regulated to the same extent as endogenous Sho. Reduced Sho levels were not seen in a tauopathy, in chemically induced spongiform degeneration or in transgenic mice expressing the extracellular ADan amyloid peptide of familial Danish dementia. Insofar as prion-infected Prnp hemizygous mice exhibited accumulation of PrPSc and down-regulation of Sho hundreds of days prior to onset of neurologic symptoms, Sho depletion can be excluded as an important trigger for clinical disease or as a simple consequence of neuronal damage. These studies instead define a disease-specific effect, and we hypothesize that membrane-associated Sho comprises a bystander substrate for processes degrading PrPSc. Thus, while protease-resistant PrP detected by in vitro digestion allows post mortem diagnosis, decreased levels of endogenous Sho may trace an early response to PrPSc accumulation that operates in the CNS in vivo. This cellular response may offer new insights into the homeostatic mechanisms involved in detection and clearance of the misfolded proteins that drive prion disease pathogenesis.
Author Summary
In prion infections of the nervous system the cellular prion protein, PrPC, changes to a distinct form, PrPSc. Recent studies have demonstrated that another glycoprotein Shadoo (Sho), which occupies a similar membrane environment as PrPC, is reduced in the brains of rodents with terminal prion disease. Our analyses of prion-infected mice revealed that reduction of Sho protein was not due to reductions in the corresponding messenger RNA. Reduction in Sho was clearly evident upon propagation of a variety of prion strains, but was not seen in mice with other types of neurodegenerative disease. Also, as prion-infected mice with only one copy of the PrP gene exhibited both accumulation of PrPSc and a reduction of Sho protein hundreds of days prior to onset of neurologic symptoms, the drop in Sho protein level can be excluded as an important trigger for clinical disease, or a non-specific consequence of brain cell damage. Instead, our studies define a effect restricted to prion disease and we hypothesize that Sho protein is a “bystander” for degradative processes aimed at destroying PrPSc.
doi:10.1371/journal.ppat.1002391
PMCID: PMC3219720
PMID: 22114562
Summary
An endogenous molecular-cellular network for both normal and abnormal functions is assumed to exist. This endogenous network forms a nonlinear stochastic dynamical system, with many stable attractors in its functional landscape. Normal or abnormal robust states can be decided by this network in a manner similar to the neural network. In this context cancer is hypothesized as one of its robust intrinsic states.
This hypothesis implies that a nonlinear stochastic mathematical cancer model is constructible based on available experimental data and its quantitative prediction is directly testable. Within such model the genesis and progression of cancer may be viewed as stochastic transitions between different attractors. Thus it further suggests that progressions are not arbitrary. Other important issues on cancer, such as genetic vs epigenetics, double-edge effect, dormancy, are discussed in the light of present hypothesis. A different set of strategies for cancer prevention, cure, and care, is therefore suggested.
doi:10.1016/j.mehy.2007.03.043
PMCID: PMC3166219
PMID: 17766049
The pioneering work of Jean Dausset on the HLA system established several principles that were later reflected in the Human Genome Project and contributed to the foundations of predictive, preventive, personalized and participatory (P4) medicine. To effectively develop systems medicine, we should take advantage of the lessons of the HLA saga, emphasizing the importance of exploring a fascinating but mysterious biology, now using systems principles, pioneering new technology developments and creating shared biological and information resources.
doi:10.1186/gm178
PMCID: PMC2945014
PMID: 20804580
Objective
To understand the chemotherapy response program in ovarian cancer cells at deep transcript sequencing levels.
Methods
Two next-generation sequencing technologies—MPSS (massively parallel signature sequencing) and SBS (sequencing by synthesis) — were used to sequence the transcripts of IGROV1 and IGROV1-CP cells, and to sequence the transcripts of a highly chemotherapy responsive and a highly chemotherapy resistant ovarian cancer tissue.
Results
We identified 3,422 signatures (2957 genes) that are significantly different between IGROV1 and IGROV1-CP cells (P <0.001). Gene Ontology (GO) term GO:0001837 (epithelial to mesenchymal transition) and GO:0034330 (cell junction assembly and maintenance) are enriched in genes that are over expressed in IGROV1-CP cells while apoptosis related GO terms are enriched in genes over expressed in IGROV1 cells. We identified 1,187 tags (corresponding to 1,040 genes) that are differentially expressed between the chemotherapy responsive and the persistently chemotherapy resistant ovarian cancer tissues. GO term GO:0050673 (epithelial cell proliferation) and GO:0050678 (regulation of epithelial cell proliferation) are enriched in the genes over expressed in the chemotherapy resistant tissue while the GO:0007229 (integrin-mediated signaling pathway) is enriched in the genes over expressed in the chemotherapy sensitive tissue. An integrative analysis identified 111 common differentially expressed genes including two bone morphogenetic proteins (BMP4 and BMP7), six solute carrier proteins (SLC10A3, SLC16A3, SLC25A1, SLC35B3, SLC7A5 and SLC7A7), transcription factor POU5F1 (POU class 5 homeobox 1), and KLK10 (kallikrein-related peptidase 10). A network analysis revealed a subnetwork with three gene BMP7, NR2F2 and AP2B1 that were consistently over expressed in the chemoresistant tissue or cells compared to the chemosensitive tissue or cells.
Conclusion
Our database offers the first comprehensive view of the digital transcriptomes of ovarian cancer cell lines and tissues with different chemotherapy response phenotypes.
doi:10.1016/j.ygyno.2010.01.041
PMCID: PMC2849907
PMID: 20181382
Roach, Jared C. | Glusman, Gustavo | Smit, Arian F.A. | Huff, Chad D. | Hubley, Robert | Shannon, Paul T. | Rowen, Lee | Pant, Krishna P. | Goodman, Nathan | Bamshad, Michael | Shendure, Jay | Drmanac, Radoje | Jorde, Lynn B. | Hood, Leroy | Galas, David J.
We analyzed the whole genome sequences of a family of four, consisting of two siblings and their parents. Family-based sequencing allowed us to delineate recombination sites precisely, identify 70% of the sequencing errors, and identify very rare SNVs. We also directly estimated a human intergeneration mutation rate of ∼1.1×10-8 per position per haploid genome. Both offspring in this family have two recessive disorders--Miller syndrome, for which the gene was concurrently identified, and primary ciliary dyskinesia, for which causative genes have been previously identified. Family-based genome analysis enabled us to narrow the candidate genes for both of these Mendelian disorders to only four. Our results demonstrate the unique value of complete genome sequencing in families.
doi:10.1126/science.1186802
PMCID: PMC3037280
PMID: 20220176
whole genome sequencing; rare genetic disease; inheritance analysis; recessive models; de novo mutations; recombination hotspot; crossover; haploidentity; haploidentical block; inheritance state; inheritance vector; HMM; haplotype; Miller syndrome; POADS; DHODH; DNAH5; KIAA0556; CES1
Cancer stem cells (CSC), also called tumor initiating cells (TIC), are considered to be the origin of replicating malignant tumor cells in a variety of human cancers. Their presence in the tumor may herald malignancy potential, mediate resistance to conventional chemotherapy or radiotherapy, and confer poor survival outcomes. Thus, CSC may serve as critical cellular targets for treatment. The ability to therapeutically target CSC hinges upon identifying their unique cell surface markers and the underlying survival signaling pathways. While accumulating evidence suggests cell-surface antigens (such as CD44, CD133) as CSC markers for several tumor tissues, emerging clinical needs exist for the identification of new markers to completely separate CSC from normal stem cells. Recent studies have demonstrated the critical role of the tumor suppressor PTEN/PI3 kinase pathway in regulating TIC in leukemia, brain, and intestinal tissues. The successful eradication of tumors by therapies targeting CSC will require an in-depth understanding of the molecular mechanisms governing CSC self renewal, differentiation, and escape from conventional therapy. Here we review recent progress from brain tumor and intestinal stem cell research with a focus on the PTEN-Akt-Wnt pathway, and how the components of CSC pathways may serve as biomarkers for diagnosis, prognosis, and therapeutics.
PMCID: PMC3034385
PMID: 19442199
cancer stem cells; cell surface marker; CD133; PTEN; Akt; Wnt
Systems biology is an approach to the science that views biology as an information science, studies biological systems as a whole and their interactions with the environment. This approach, for the reasons described here, has particular power in the search for informative diagnostic biomarkers of diseases because it focuses on the fundamental causes and keys on the identification and understanding of disease- perturbed molecular networks. In this review, we describe some recent developments that have used systems biology to address complex diseases – prion disease and drug induced liver injury- and use these as examples to illustrate the importance of understanding network structure and dynamics. The knowledge of network dynamics through in vitro experimental perturbation and modeling allows us to determine the state of the networks, to identify molecular correlates, and to derive new disease treatment approaches to reverse the pathology or prevent its progress into a more severe state through the manipulation of network states. This general approach, including diagnostics and therapeutics, is becoming known as systems medicine.
doi:10.3233/DMA-2010-0697
PMCID: PMC3021550
PMID: 20534905
Systems biology; biomarkers; systems medicine; prion disease; drug induced liver injury; microRNA; organ-specific proteins
Fang, Xuefeng | Yoon, Jae-Geun | Li, Lisha | Yu, Wei | Shao, Jiaofang | Hua, Dasong | Zheng, Shu | Hood, Leroy | Goodlett, David R | Foltz, Gregory | Lin, Biaoyang
Background
SOX2 is a key gene implicated in maintaining the stemness of embryonic and adult stem cells. SOX2 appears to re-activate in several human cancers including glioblastoma multiforme (GBM), however, the detailed response program of SOX2 in GBM has not yet been defined.
Results
We show that knockdown of the SOX2 gene in LN229 GBM cells reduces cell proliferation and colony formation. We then comprehensively characterize the SOX2 response program by an integrated analysis using several advanced genomic technologies including ChIP-seq, microarray profiling, and microRNA sequencing. Using ChIP-seq technology, we identified 4883 SOX2 binding regions in the GBM cancer genome. SOX2 binding regions contain the consensus sequence wwTGnwTw that occurred 3931 instances in 2312 SOX2 binding regions. Microarray analysis identified 489 genes whose expression altered in response to SOX2 knockdown. Interesting findings include that SOX2 regulates the expression of SOX family proteins SOX1 and SOX18, and that SOX2 down regulates BEX1 (brain expressed X-linked 1) and BEX2 (brain expressed X-linked 2), two genes with tumor suppressor activity in GBM. Using next generation sequencing, we identified 105 precursor microRNAs (corresponding to 95 mature miRNAs) regulated by SOX2, including down regulation of miR-143, -145, -253-5p and miR-452. We also show that miR-145 and SOX2 form a double negative feedback loop in GBM cells, potentially creating a bistable system in GBM cells.
Conclusions
We present an integrated dataset of ChIP-seq, expression microarrays and microRNA sequencing representing the SOX2 response program in LN229 GBM cells. The insights gained from our integrated analysis further our understanding of the potential actions of SOX2 in carcinogenesis and serves as a useful resource for the research community.
doi:10.1186/1471-2164-12-11
PMCID: PMC3022822
PMID: 21211035
Li, Runsheng | Guo, Yan | Han, Bang Ming | Yan, Xiaowei | Utleg, Angelita G. | Li, Wei | Tu, Lan Chun | Wang, Jian | Hood, Leroy | Xia, Shujie | Lin, Biaoyang
Expressed prostatic secretions (EPS) contain proteins of prostate origin that may reflect the health status of the prostate and be used as diagnostic markers for prostate diseases including prostatitis, benign prostatic hyperplasia, and prostate cancer. Despite their importance and potential applications, a complete catalog of EPS proteins is not yet available. We, therefore, undertook a comprehensive analysis of the EPS proteome using 2-D micro-LC combined with MS/MS. Using stringent filtering criteria, we identified a list of 114 proteins with at least two unique-peptide hits and an additional 75 proteins with only a single unique-peptide hit. The proteins identified include kallikrein 2 (KLK2), KLK3 (prostate-specific antigen), KLK11, and nine cluster of differentiation (CD) molecules including CD10, CD13, CD14, CD26, CD66a, CD66c, CD 143, CD177, and CD224. To our knowledge, this list represents the first comprehensive characterization of the EPS proteome, and it provides a candidate biomarker list for targeted quantitative proteomics analysis using a multiple reaction monitoring (MRM) approach. To help prioritize candidate biomarkers, we constructed a protein–protein interaction network of the EPS proteins using Cytoscape (www.cytoscape.org), and overlaid the expression level changes from the Oncomine database onto the network.
doi:10.1002/prca.200780159
PMCID: PMC2886281
PMID: 20559448
Biomarker; Expressed prostatic secretions; Mass spectrometry; Prostate cancer
Epithelial ovarian cancer (EOC) ranks fifth as a cause of cancer deaths in women. Current diagnostic and monitoring markers have limited reliability for the detection of disease. We have tested the possibility of identifying candidate biomarkers present at low nanogram to picogram levels after removing both the 12 most abundant and 77 moderately abundant proteins from serum samples of EOC patients using antibody affinity columns. We showed that this approach allows the identification of proteins that are expressed at nanogram per liter levels in the serum. Using ICAT/MS/MS analysis, we identified 51 proteins that are differentially expressed by at least twofold. These proteins include leucine-rich α-2-glycoprotein, matrix metalloproteinase-9 (MMP-9), inter-α-trypsin inhibitor heavy chain H1, insulin-like growth factor-binding protein 6, insulin-like growth factor-binding protein 3, isoform 1 of epidermal growth factor receptor, angiopoietin-like protein 3 (ANGPTL3) and phosphatidylcholine-sterol acyltransferase. We confirmed the differential expression of MMP9 and ANGPTL3 in normal and ovarian cancer sera by ELISA assays. Further robust clinical evaluation of the candidate markers identified is necessary.
doi:10.1002/prca.200800141
PMCID: PMC2886282
PMID: 20559449
Angiopoietin-like protein 3; Matrix metalloproteinase-9; MS; Ovarian cancer; Serum depletion
A powerful way to separate signal from noise in biology is to convert the molecular data from individual genes or proteins into an analysis of comparative biological network behaviors. One of the limitations of previous network analyses is that they do not take into account the combinatorial nature of gene interactions within the network. We report here a new technique, Differential Rank Conservation (DIRAC), which permits one to assess these combinatorial interactions to quantify various biological pathways or networks in a comparative sense, and to determine how they change in different individuals experiencing the same disease process. This approach is based on the relative expression values of participating genes—i.e., the ordering of expression within network profiles. DIRAC provides quantitative measures of how network rankings differ either among networks for a selected phenotype or among phenotypes for a selected network. We examined disease phenotypes including cancer subtypes and neurological disorders and identified networks that are tightly regulated, as defined by high conservation of transcript ordering. Interestingly, we observed a strong trend to looser network regulation in more malignant phenotypes and later stages of disease. At a sample level, DIRAC can detect a change in ranking between phenotypes for any selected network. Variably expressed networks represent statistically robust differences between disease states and serve as signatures for accurate molecular classification, validating the information about expression patterns captured by DIRAC. Importantly, DIRAC can be applied not only to transcriptomic data, but to any ordinal data type.
Author Summary
The systems approach to medicine derives from the idea that diseased cells arise from one or more perturbed biological networks due to the net effect of interactions among multiple molecular agents; by measuring differences in the abundance of biomolecules (e.g., mRNA, proteins, metabolites) we can identify reporters of network states and uncover molecular signatures of disease. However, a major limitation of previously published network analyses is the focus on small numbers of individual, differentially-expressed genes, hence the failure to take into account combinatorial interactions. We report a new technique, Differential Rank Conservation, for identifying and measuring network-level perturbations. Our rank conservation index is based entirely on the relative levels of expression for participating genes and allows us to detect differences in network orderings between networks for a given phenotype and between phenotypes for a given network. In examining cancer subtypes and neurological disorders, we identified networks that are tightly and loosely regulated, as defined by the level of conservation of transcript ordering, and observed a strong trend to looser network regulation in more malignant phenotypes and later stages of disease. We also demonstrate that variably expressed networks represent robust differences between disease states.
doi:10.1371/journal.pcbi.1000792
PMCID: PMC2877722
PMID: 20523739
Background
Quantitative proteomics technologies have been developed to comprehensively identify and quantify proteins in two or more complex samples. Quantitative proteomics based on differential stable isotope labeling is one of the proteomics quantification technologies. Mass spectrometric data generated for peptide quantification are often noisy, and peak detection and definition require various smoothing filters to remove noise in order to achieve accurate peptide quantification. Many traditional smoothing filters, such as the moving average filter, Savitzky-Golay filter and Gaussian filter, have been used to reduce noise in MS peaks. However, limitations of these filtering approaches often result in inaccurate peptide quantification. Here we present the WaveletQuant program, based on wavelet theory, for better or alternative MS-based proteomic quantification.
Results
We developed a novel discrete wavelet transform (DWT) and a 'Spatial Adaptive Algorithm' to remove noise and to identify true peaks. We programmed and compiled WaveletQuant using Visual C++ 2005 Express Edition. We then incorporated the WaveletQuant program in the Trans-Proteomic Pipeline (TPP), a commonly used open source proteomics analysis pipeline.
Conclusions
We showed that WaveletQuant was able to quantify more proteins and to quantify them more accurately than the ASAPRatio, a program that performs quantification in the TPP pipeline, first using known mixed ratios of yeast extracts and then using a data set from ovarian cancer cell lysates. The program and its documentation can be downloaded from our website at http://systemsbiozju.org/data/WaveletQuant.
doi:10.1186/1471-2105-11-219
PMCID: PMC2878310
PMID: 20429928
Lin, Biaoyang | Madan, Anup | Yoon, Jae-Geun | Fang, Xuefeng | Yan, Xiaowei | Kim, Taek-Kyun | Hwang, Daehee | Hood, Leroy | Foltz, Gregory | Jones, Chris
Background
A comprehensive network-based understanding of molecular pathways abnormally altered in glioblastoma multiforme (GBM) is essential for developing effective therapeutic approaches for this deadly disease.
Methodology/Principal Findings
Applying a next generation sequencing technology, massively parallel signature sequencing (MPSS), we identified a total of 4535 genes that are differentially expressed between normal brain and GBM tissue. The expression changes of three up-regulated genes, CHI3L1, CHI3L2, and FOXM1, and two down-regulated genes, neurogranin and L1CAM, were confirmed by quantitative PCR. Pathway analysis revealed that TGF- β pathway related genes were significantly up-regulated in GBM tumor samples. An integrative pathway analysis of the TGF β signaling network identified two alternative TGF−β signaling pathways mediated by SOX4 (sex determining region Y-box 4) and TGFBI (Transforming growth factor beta induced). Quantitative RT-PCR and immunohistochemistry staining demonstrated that SOX4 and TGFBI expression is elevated in GBM tissues compared with normal brain tissues at both the RNA and protein levels. In vitro functional studies confirmed that TGFBI and SOX4 expression is increased by TGF- β stimulation and decreased by a specific inhibitor of TGF- β receptor 1 kinase.
Conclusions/Significance
Our MPSS database for GBM and normal brain tissues provides a useful resource for the scientific community. The identification of non-SMAD mediated TGF−β signaling pathways acting through SOX4 and TGFBI (GENE ID:7045) in GBM indicates that these alternative pathways should be considered, in addition to the canonical SMAD mediated pathway, in the development of new therapeutic strategies targeting TGF−β signaling in GBM. Finally, the construction of an extended TGF- β signaling network with overlaid gene expression changes between GBM and normal brain extends our understanding of the biology of GBM.
doi:10.1371/journal.pone.0010210
PMCID: PMC2856677
PMID: 20419098
Summary: We describe an integrative software platform, Prequips, for comparative proteomics-based systems biology analysis that: (i) integrates all information generated from mass spectrometry (MS)-based proteomics as well as from basic proteomics data analysis tools, (ii) visualizes such information for various proteomic analyses via graphical interfaces and (iii) links peptide and protein abundances to external tools often used in systems biology studies.
Availability: http://prequips.sourceforge.net
Contact: dhhwang@postech.ac.kr
doi:10.1093/bioinformatics/btp005
PMCID: PMC2720692
PMID: 19129212
Epigenetic inactivation of tumor suppressor genes is common in human cancer. Using a large-scale whole-genome approach in an earlier study, the authors identified epigenetically silenced genes with potential tumor suppressor function in glioblastoma (GBM). Three genes identified in this analysis—DKK1, SFRP1, and WIF1—are potent inhibitors of the Wnt signal transduction pathway. Here, the authors confirm decreased expression of these genes in GBM tumor tissue samples relative to nontumor brain tissue samples using real-time PCR. They then show that expression of all 3 genes is restored in T98 GBM cells by treatment with the histone deacetylase inhibitor Trichostatin A (TSA), but only DKK1 expression is restored by treatment with the demethylating agent 5-azacytidine. Bisulfite sequencing did not reveal significant methylation in the promoter region of DKK1, whereas histone acetylation and chromatin accessibility increased significantly for all 3 genes after TSA treatment. Ectopic expression of DKK1 significantly reduces colony formation and increases chemotherapy-induced apoptosis in T98 cells. Ectopic expression of the canonical Wnt pathway inhibitors WIF1 and SFRP1 shows a relative lack of response. Chronic Wnt3a stimulation only partially reverses growth suppression after DKK1 reexpression, whereas a specific inhibitor of the JNK pathway significantly reverses the effect of DKK1 reexpression on colony formation and apoptosis in T98 cells. These results support a potential growth-suppressive function for epigenetically silenced DKK1 in GBM and suggest that DKK1 restoration could modulate Wnt signaling through both canonical and noncanonical pathways.
doi:10.1177/1947601909356103
PMCID: PMC3092177
PMID: 21779426
epigenetics; histone modifications; methylation; glioblastoma
Pascal, Laura E | Vêncio, Ricardo ZN | Page, Laura S | Liebeskind, Emily S | Shadle, Christina P | Troisch, Pamela | Marzolf, Bruz | True, Lawrence D | Hood, Leroy E | Liu, Alvin Y
Background
Prostate cancer cells in primary tumors have been typed CD10-/CD13-/CD24hi/CD26+/CD38lo/CD44-/CD104-. This CD phenotype suggests a lineage relationship between cancer cells and luminal cells. The Gleason grade of tumors is a descriptive of tumor glandular differentiation. Higher Gleason scores are associated with treatment failure.
Methods
CD26+ cancer cells were isolated from Gleason 3+3 (G3) and Gleason 4+4 (G4) tumors by cell sorting, and their gene expression or transcriptome was determined by Affymetrix DNA array analysis. Dataset analysis was used to determine gene expression similarities and differences between G3 and G4 as well as to prostate cancer cell lines and histologically normal prostate luminal cells.
Results
The G3 and G4 transcriptomes were compared to those of prostatic cell types of non-cancer, which included luminal, basal, stromal fibromuscular, and endothelial. A principal components analysis of the various transcriptome datasets indicated a closer relationship between luminal and G3 than luminal and G4. Dataset comparison also showed that the cancer transcriptomes differed substantially from those of prostate cancer cell lines.
Conclusions
Genes differentially expressed in cancer are potential biomarkers for cancer detection, and those differentially expressed between G3 and G4 are potential biomarkers for disease stratification given that G4 cancer is associated with poor outcomes. Differentially expressed genes likely contribute to the prostate cancer phenotype and constitute the signatures of these particular cancer cell types.
doi:10.1186/1471-2407-9-452
PMCID: PMC2809079
PMID: 20021671
Summary: MS-BID (MS Biomarker Discovery Platform) is an integrative computational pipeline for biomarker discovery using LC-MS-based comparative proteomic analysis. This platform consists of several computational tools for: (i) detecting peptides in the collected patterns; (ii) matching detected peptides across a number of LC-MS datasets and (iii) selecting discriminatory peptides between classes of samples.
Availability: MS-BID source codes, binaries and documentations are freely available under LGPL from http://tools.proteomecenter.org/msBID.php.
Contact: dhhwang@postech.ac.kr
doi:10.1093/bioinformatics/btn491
PMCID: PMC2579714
PMID: 18805798