Search tips
Search criteria

Results 1-11 (11)

Clipboard (0)

Select a Filter Below

Year of Publication
Document Types
author:("Qian, jiliang")
1.  Genomic and transcriptome profiling identified both human and HBV genetic variations and their interactions in Chinese hepatocellular carcinoma 
Genomics Data  2015;6:1-3.
Interaction between HBV and host genome integrations in hepatocellular carcinoma (HCC) development is a complex process and the mechanism is still unclear. Here we described in details the quality controls and data mining of aCGH and transcriptome sequencing data on 50 HCC samples from the Chinese patients, published by Dong et al. (2015) (GEO#: GSE65486). In additional to the HBV-MLL4 integration discovered, we also investigated the genetic aberrations of HBV and host genes as well as their genetic interactions. We reported human genome copy number changes and frequent transcriptome variations (e.g. TP53, CTNNB1 mutation, especially MLL family mutations) in this cohort of the patients. For HBV genotype C, we identified a novel linkage disequilibrium region covering HBV replication regulatory elements, including basal core promoter, DR1, epsilon and poly-A regions, which is associated with HBV core antigen over-expression and almost exclusive to HBV-MLL4 integration.
PMCID: PMC4664659  PMID: 26697315
2.  Identification and validation of dysregulated MAPK7 (ERK5) as a novel oncogenic target in squamous cell lung and esophageal carcinoma 
BMC Cancer  2015;15:454.
MAPK7/ERK5 (extracellular-signal-regulated kinase 5) functions within a canonical three-tiered MAPK (mitogen activated protein kinase) signaling cascade comprising MEK (MAPK/ERK kinase) 5, MEKK(MEK kinase) 2/3 and ERK5 itself. Despite being the least well studied of the MAPK-modules, evidence supports a role for MAPK7-signaling in the pathology of several cancer types.
Methods and results
Fluorescence in situ hybridization (FISH) analysis identified MAPK7 gene amplification in 4 % (3/74) of non-small cell lung cancers (NSCLC) (enriched to 6 % (3/49) in squamous cell carcinoma) and 2 % (2/95) of squamous esophageal cancers (sqEC). Immunohistochemical (IHC) analysis revealed a good correlation between MAPK7 gene amplification and protein expression. MAPK7 was validated as a proliferative oncogenic driver by performing in vitro siRNA knockdown of MAPK7 in tumor cell lines. Finally, a novel MEK5/MAPK7 co-transfected HEK293 cell line was developed and used for routine cell-based pharmacodynamic screening. Phosphorylation antibody microarray analysis also identified novel downstream pharmacodynamic (PD) biomarkers of MAPK7 kinase inhibition in tumor cells (pMEF2A and pMEF2D).
Together, these data highlight a broader role for dysregulated MAPK7 in driving tumorigenesis within niche populations of highly prevalent tumor types, and describe current efforts in establishing a robust drug discovery screening cascade.
Electronic supplementary material
The online version of this article (doi:10.1186/s12885-015-1455-y) contains supplementary material, which is available to authorized users.
PMCID: PMC4453990  PMID: 26040563
MAPK7; ERK5; Oncogene; Kinase; Inhibitor
3.  Identification of HBV-MLL4 Integration and Its Molecular Basis in Chinese Hepatocellular Carcinoma 
PLoS ONE  2015;10(4):e0123175.
To gain molecular insights of HBV integration that may contribute to HCC tumorigenesis, we performed whole transcriptome sequencing and whole genome copy number profiling of hepatocellular carcinoma (HCC) samples from 50 Chinese patients. We identified a total of 33 HBV-human integration sites in 16 of 44 HBV-positive HCC tissues, which were enriched in HBV genotype C-infected patients. In addition, significantly recurrent HBV-MLL4 integration (18%; 8/44) was found in this cohort of patients. Using long-range PCR and Sanger sequencing, we comprehensively characterized gDNA and cDNA sequences that encode for the HBV-MLL4 transcripts, and we revealed that HBV integration into MLL4 exons led to much higher mRNA expression of MLL4 than the integration into MLL4 introns due to an alternative splicing mechanism. Moreover, the HBV-MLL4 integration occurred almost exclusively in CTNNB1 and TP53 wild-type patients. The integration was also associated with a distinct gene expression profile. In conclusion, this is the first report on the molecular basis of the MLL4 integration driving MLL4 over-expression. HBV-MLL4 integration occurred frequently in Chinese HCC patients, representing a unique molecular segment for HCC with HBV infection.
PMCID: PMC4406717  PMID: 25901726
4.  Oncogenic HER2 fusions in gastric cancer 
Genetic amplification of HER2 drives tumorigenesis and cancer progression in a subset of patients with gastric cancer (GC), and treatment with trastuzumab, a humanized HER2-neutralizing antibody, improves the overall survival rate of HER2-positive patients. However, a considerable portion of the patients does not respond to trastuzumab and the molecular mechanisms underlying the intrinsic resistance to anti-HER2 therapy in GC is not fully understood.
We performed whole-transcriptome sequencing on 21 HER2-positive tumor specimens from Chinese GC patients. Whole genome sequencing was performed on the three samples with HER2 fusion to discover the DNA integration structure. A multicolor FISH assay for HER2 split screening was conducted to confirm HER2 fusion and IHC (HercepTest™) was used to detect the membranous expression of HER2. Fusion cDNA were transfected into NIH/3T3 cells and generate stable cell line by lentivirus. The expression of exogenous HER2 fusion proteins and pHER2 were examined by western blot analysis. In vitro efficacy studies were also conducted by PD assay and softagar assay in cell line expression wild type and fusion HER2. T-DM1 was used to assess its binding to NIH/3T3 cells ectopically expressing wild-type and fusion HER2. Finally, the anti-tumor efficacy of trastuzumab was tested in NIH/3 T3 xenografts expressing the HER2 fusion variants.
We identified three new HER2 fusions with ZNF207, MDK, or NOS2 in 21 HER2-amplified GC samples (14%; 3/21). Two of the fusions, ZNF207-HER2, and MDK-HER2, which are oncogenic, lead to aberrant activation of HER2 kinase. Treatment with trastuzumab inhibited tumor growth significantly in xenografts expressing MDK-HER2 fusion. In contrast, trastuzumab had no effect on the growth of xenografts expressing ZNF207-HER2 fusion, due to its inability to bind to trastuzumab.
Our results provide the molecular basis of a novel resistance mechanism to trastuzumab-based anti-HER2 therapy, supporting additional molecule stratification within HER2-positive GC patients for more effective therapy options.
Electronic supplementary material
The online version of this article (doi:10.1186/s12967-015-0476-2) contains supplementary material, which is available to authorized users.
PMCID: PMC4396883  PMID: 25889497
HER2; Fusion-gene; Gastric cancer; Trastuzumab; Lapatinib
5.  Genetic amplification of PPME1 in gastric and lung cancer and its potential as a novel therapeutic target 
Cancer Biology & Therapy  2013;15(1):128-134.
Protein phosphatase methylesterase 1 (PPME1) is a protein phosphatase 2A (PP2A)-specific methyl esterase that negatively regulates PP2A through demethylation at its carboxy terminal leucine 309 residue. Emerging evidence shows that the upregulation of PPME1 is associated with poor prognosis in glioblastoma patients. By performing an array comparative genomic hybridization analysis to detect copy number changes, we have been the first to identify PPME1 gene amplification in 3.8% (5/131) of Chinese gastric cancer (GC) samples and 3.1% (4/124) of Chinese lung cancer (LC) samples. This PPME1 gene amplification was confirmed by fluorescence in situ hybridization analysis and is correlated with elevated protein expression, as determined by immunohistochemistry analysis. To further investigate the role of PPME1 amplification in tumor growth, short-hairpin RNA-mediated gene silencing was employed. A knockdown of PPME1 expression resulted in a significant inhibition of cell proliferation and induction of cell apoptosis in PPME1-amplified human cancer cell lines SNU668 (GC) and Oka-C1 (LC), but not in nonamplified MKN1 (GC) and HCC95 (LC) cells. The PPME1 gene knockdown also led to a consistent decrease in PP2A demethylation at leucine 309, which was correlated with the downregulation of cellular Erk and AKT phosphorylation. Our data indicate that PPME1 could be an attractive therapeutic target for a subset of GCs and LCs.
PMCID: PMC3938515  PMID: 24253382
gastric cancer; lung cancer; PPME1 amplification; PP2A; shRNA-knockdown
6.  Using GeneReg to construct time delay gene regulatory networks 
BMC Research Notes  2010;3:142.
Understanding gene expression and regulation is essential for understanding biological mechanisms. Because gene expression profiling has been widely used in basic biological research, especially in transcription regulation studies, we have developed GeneReg, an easy-to-use R package, to construct gene regulatory networks from time course gene expression profiling data; More importantly, this package can provide information about time delays between expression change in a regulator and that of its target genes.
The R package GeneReg is based on time delay linear regression, which can generate a model of the expression levels of regulators at a given time point against the expression levels of their target genes at a later time point. There are two parameters in the model, time delay and regulation coefficient. Time delay is the time lag during which expression change of the regulator is transmitted to change in target gene expression. Regulation coefficient expresses the regulation effect: a positive regulation coefficient indicates activation and negative indicates repression. GeneReg was implemented on a real Saccharomyces cerevisiae cell cycle dataset; more than thirty percent of the modeled regulations, based entirely on gene expression files, were found to be consistent with previous discoveries from known databases.
GeneReg is an easy-to-use, simple, fast R package for gene regulatory network construction from short time course gene expression data. It may be applied to study time-related biological processes such as cell cycle, cell differentiation, or causal inference.
PMCID: PMC2892504  PMID: 20500822
7.  The combination approach of SVM and ECOC for powerful identification and classification of transcription factor 
BMC Bioinformatics  2008;9:282.
Transcription factors (TFs) are core functional proteins which play important roles in gene expression control, and they are key factors for gene regulation network construction. Traditionally, they were identified and classified through experimental approaches. In order to save time and reduce costs, many computational methods have been developed to identify TFs from new proteins and to classify the resulted TFs. Though these methods have facilitated screening of TFs to some extent, low accuracy is still a common problem. With the fast growing number of new proteins, more precise algorithms for identifying TFs from new proteins and classifying the consequent TFs are in a high demand.
The support vector machine (SVM) algorithm was utilized to construct an automatic detector for TF identification, where protein domains and functional sites were employed as feature vectors. Error-correcting output coding (ECOC) algorithm, which was originated from information and communication engineering fields, was introduced to combine with support vector machine (SVM) methodology for TF classification. The overall success rates of identification and classification achieved 88.22% and 97.83% respectively. Finally, a web site was constructed to let users access our tools (see Availability and requirements section for URL).
The SVM method was a valid and stable means for TFs identification with protein domains and functional sites as feature vectors. Error-correcting output coding (ECOC) algorithm is a powerful method for multi-class classification problem. When combined with SVM method, it can remarkably increase the accuracy of TF classification using protein domains and functional sites as feature vectors. In addition, our work implied that ECOC algorithm may succeed in a broad range of applications in biological data mining.
PMCID: PMC2440765  PMID: 18554421
8.  Gene-Centric Characteristics of Genome-Wide Association Studies 
PLoS ONE  2007;2(12):e1262.
The high-throughput genotyping chips have contributed greatly to genome-wide association (GWA) studies to identify novel disease susceptibility single nucleotide polymorphisms (SNPs). The high-density chips are designed using two different SNP selection approaches, the direct gene-centric approach, and the indirect quasi-random SNPs or linkage disequilibrium (LD)-based tagSNPs approaches. Although all these approaches can provide high genome coverage and ascertain variants in genes, it is not clear to which extent these approaches could capture the common genic variants. It is also important to characterize and compare the differences between these approaches.
Methodology/Principal Findings
In our study, by using both the Phase II HapMap data and the disease variants extracted from OMIM, a gene-centric evaluation was first performed to evaluate the ability of the approaches in capturing the disease variants in Caucasian population. Then the distribution patterns of SNPs were also characterized in genic regions, evolutionarily conserved introns and nongenic regions, ontologies and pathways. The results show that, no mater which SNP selection approach is used, the current high-density SNP chips provide very high coverage in genic regions and can capture most of known common disease variants under HapMap frame. The results also show that the differences between the direct and the indirect approaches are relatively small. Both have similar SNP distribution patterns in these gene-centric characteristics.
This study suggests that the indirect approaches not only have the advantage of high coverage but also are useful for studies focusing on various functional SNPs either in genes or in the conserved regions that the direct approach supports. The study and the annotation of characteristics will be helpful for designing and analyzing GWA studies that aim to identify genetic risk factors involved in common diseases, especially variants in genes and conserved regions.
PMCID: PMC2092383  PMID: 18060058
9.  The use of global transcriptional analysis to reveal the biological and cellular events involved in distinct development phases of Trichophyton rubrum conidial germination 
BMC Genomics  2007;8:100.
Conidia are considered to be the primary cause of infections by Trichophyton rubrum.
We have developed a cDNA microarray containing 10250 ESTs to monitor the transcriptional strategy of conidial germination. A total of 1561 genes that had their expression levels specially altered in the process were obtained and hierarchically clustered with respect to their expression profiles. By functional analysis, we provided a global view of an important biological system related to conidial germination, including characterization of the pattern of gene expression at sequential developmental phases, and changes of gene expression profiles corresponding to morphological transitions. We matched the EST sequences to GO terms in the Saccharomyces Genome Database (SGD). A number of homologues of Saccharomyces cerevisiae genes related to signalling pathways and some important cellular processes were found to be involved in T. rubrum germination. These genes and signalling pathways may play roles in distinct steps, such as activating conidial germination, maintenance of isotropic growth, establishment of cell polarity and morphological transitions.
Our results may provide insights into molecular mechanisms of conidial germination at the cell level, and may enhance our understanding of regulation of gene expression related to the morphological construction of T. rubrum.
PMCID: PMC1871584  PMID: 17428342
10.  An efficient method for statistical significance calculation of transcription factor binding sites 
Bioinformation  2007;2(5):169-174.
Various statistical models have been developed to describe the DNA binding preference of transcription factors, by which putative transcription factor binding sites (TFBS) can be identified according to scores assigned. Statistical significance of these scores, usually known as the p-value, play a critical role in identification. We developed an efficient algorithm to provide precise calculation of the statistical significance, remarkably enhancing the calculation efficiency by reducing the time complexity from an exponent scale to a linear scale, and successfully extended the application of this algorithm to a wide range of models, from the commonly used position weight matrix models to the complicated Bayesian Network models. Further, we calculated p-values of all transcription factor DNA binding sites recorded in the database, JASPAR, and based on these, we investigated some unseen properties of p-values as a whole, such as the p-value distribution of different models and the p-value variance according to changed scoring schemes. We hope that our algorithm and the result of computational experiments would offer an improved solution to the statistical significance of transcription factor binding sites. The software to implement our method can be downloaded from
PMCID: PMC2241927  PMID: 18305824
transcription factor; DNA; binding sites; Bayesian network
11.  Genomic characterization of ribitol teichoic acid synthesis in Staphylococcus aureus: genes, genomic organization and gene duplication 
BMC Genomics  2006;7:74.
Staphylococcus aureus or MRSA (Methicillin Resistant S. aureus), is an acquired pathogen and the primary cause of nosocomial infections worldwide. In S. aureus, teichoic acid is an essential component of the cell wall, and its biosynthesis is not yet well characterized. Studies in Bacillus subtilis have discovered two different pathways of teichoic acid biosynthesis, in two strains W23 and 168 respectively, namely teichoic acid ribitol (tar) and teichoic acid glycerol (tag). The genes involved in these two pathways are also characterized, tarA, tarB, tarD, tarI, tarJ, tarK, tarL for the tar pathway, and tagA, tagB, tagD, tagE, tagF for the tag pathway. With the genome sequences of several MRSA strains: Mu50, MW2, N315, MRSA252, COL as well as methicillin susceptible strain MSSA476 available, a comparative genomic analysis was performed to characterize teichoic acid biosynthesis in these S. aureus strains.
We identified all S. aureus tar and tag gene orthologs in the selected S. aureus strains which would contribute to teichoic acids sythesis.Based on our identification of genes orthologous to tarI, tarJ, tarL, which are specific to tar pathway in B. subtilis W23, we also concluded that tar is the major teichoic acid biogenesis pathway in S. aureus. Further analyses indicated that the S. aureus tar genes, different from the divergon organization in B. subtilis, are organized into several clusters in cis. Most interesting, compared with genes in B. subtilis tar pathway, the S. aureus tar specific genes (tarI,J,L) are duplicated in all six S. aureus genomes.
In the S. aureus strains we analyzed, tar (teichoic acid ribitol) is the main teichoic acid biogenesis pathway. The tar genes are organized into several genomic groups in cis and the genes specific to tar (relative to tag): tarI, tarJ, tarL are duplicated. The genomic organization of the S. aureus tar pathway suggests their regulations are different when compared to B. subtilis tar or tag pathway, which are grouped in two operons in a divergon structure.
PMCID: PMC1458327  PMID: 16595020

Results 1-11 (11)