BEACH (named after ‘Beige and Chediak-Higashi’) is a conserved ~280 residue domain, present in nine human BEACH domain containing proteins (BDCPs). Most BDCPs are large, containing a PH-like domain for membrane association preceding their BEACH domain, and containing WD40 and other domains for ligand binding. Recent studies found that mutations in individual BDCPs cause several human diseases. BDCP alterations affect lysosome size (LYST and NSMAF), apoptosis (NSMAF), autophagy (LYST, WDFY3, LRBA), granule size (LYST, NBEAL2, NBEA), or synapse formation (NBEA). However, the roles of each BDCP in these membrane events remain controversial. After reviewing studies on individual BDCPs, we propose a unifying hypothesis that BDCPs act as scaffolding proteins that facilitate membrane events, including both fission and fusion, determined by their binding partners. BDCPs may also bind each other, enabling fusion or fission of vesicles that are not necessarily of the same type. Such mechanisms explain why different BDCPs may have roles in autophagy; each BDCP is specific for the cell type or the cargo, but not necessarily specific for attaching to the autophagosome. Further elucidation of these mechanisms, preferably carrying out the same experiment on multiple BDCPs, and possibly using patients’ cells, may identify potential targets for therapy.
We describe a pedigree of 71 individuals from the Republic of Cameroon in which at least 33 individuals have a clinical diagnosis of stuttering. The high concentration of stuttering individuals suggests that the pedigree either contains a single highly penetrant gene variant or that assortative mating led to multiple stuttering-associated variants being transmitted in different parts of the pedigree. No single locus displayed significant linkage to stuttering in initial genome-wide scans with microsatellite and SNP markers. By dividing the pedigree into five sub-pedigrees, we found evidence for linkage to previously reported loci on 3q and 15q, and to novel loci on 2p, 3p, 14q, and a different region of 15q. Using the two-locus mode of Superlink, we showed that combining the recessive locus on 2p and a single-locus additive representation of the 15q loci is sufficient to achieve a two-locus score over 6 on the entire pedigree. For this 2p+15q analysis, we show LOD scores ranging from 4.69 to 6.57, and the scores are sensitive to which marker is chosen for 15q. Our findings provide strong evidence for linkage at several loci.
PSEUDOMARKER is a software package that performs joint linkage and linkage disequilibrium analysis between a marker and a putative disease locus. A key feature of PSEUDOMARKER is that it can combine case-controls and pedigrees of varying structure into a single unified analysis. Thus it maximizes the full likelihood of the data over marker allele frequencies or conditional allele frequencies on disease and recombination fraction.
The new version 2.0 uses the software package NOMAD to maximize likelihoods, resulting in generally comparable or better optima with many fewer evaluations of the likelihood functions.
After being modified substantially to use modern optimization methods, PSEUDOMARKER version 2.0 is more robust and substantially faster than version 1.0. NOMAD may be useful in other bioinformatics problems where complex likelihood functions are optimized.
In vivo animal models have proven very useful to understand basic biological pathways of the immune system, a prerequisite for the development of innovate therapies. This manuscript addresses currently available models for defined human monogenetic defects of neutrophil granulocytes, including murine, zebrafish and larger mammalian species. Strengths and weaknesses of each system are summarized, and clinical investigators may thus be inspired to develop further lines of research to improve diagnosis and therapy by use of the appropriate animal model system.
Chronic granulomatous disease; leukocyte adhesion deficiency; severe congenital neutropenia; neutrophils; mouse models; zebrafish models
This study investigates the association between bereavement and the mortality of a surviving spouse among Amish couples. We hypothesised that the bereavement effect would be relatively small in the Amish due to the unusually cohesive social structure of the Amish that might attenuate the loss of spousal support.
Population-based cohort study.
10 892 Amish couples born during 1725–1900 located in Pennsylvania, Ohio and Indiana. All the participants are deceased.
The survival time is ‘age’; event is ‘death’. Hazard ratios (HRs) of widowed individuals with respect to gender, age at widowhood, remarriage, the number of surviving children and time since bereavement.
We observed HRs for widowhood ranging from 1.06 to 1.26 over the study period (nearly all differences significant at p<0.05). Mortality risks tended to be higher in men than in women and in younger compared with older bereaved spouses. There were significantly increased mortality risks in widows and widowers who did not remarry. We observed a higher number of surviving children to be associated with increased mortality in men and women. Mortality risk following bereavement was higher in the first 6 months among men and women.
We conclude that bereavement effects remain apparent even in this socially cohesive Amish community. Remarriage is associated with a significant decrease in the mortality risk among Amish individuals. Contrary to results from previous studies, an increase in the number of surviving children was associated with decreased survival rate.
Epidemiology; Geriatric Medicine; Mental Health; Public Health
Digenic inheritance (DI) is the simplest form of inheritance for genetically complex diseases. In contrast to the thousands of reports that mutations in single genes cause human diseases, there are only dozens of human disease phenotypes with evidence for DI in some pedigrees. The advent of high-throughput sequencing (HTS) has made it simpler to identify monogenic disease causes and could similarly simplify proving DI because one can simultaneously find mutations in two genes in the same sample. However, through 2012, I could find only one example of human DI in which HTS was used; in that example, HTS found only the second of the two genes. To explore the gap between expectation and reality, I tried to collect all examples of human DI with a narrow definition and characterize them according to the types of evidence collected and whether there has been replication. Two strong trends are that knowledge of candidate genes and knowledge of protein-protein interactions have been helpful in most published examples of human DI. In contrast, the positional method of genetic linkage analysis, has been mostly unsuccessful in identifying genes underlying human DI. Based on the empirical data, I suggest that combining HTS with growing networks of established protein-protein interactions may expedite future discoveries of human DI and strengthen the evidence for them.
Digenic inheritance; protein-protein interactions; high-throughput sequencing; epistasis; facioscapulohumeral muscular dystrophy; deafness; Bardet-Biedl syndrome; nephrotic syndrome; hypogonadotropic hypogonadism; ciliopathies; genetic linkage analysis
The largest chromosome in the river buffalo karyotype, BBU1, is a submetacentric chromosome with reported homology between BBU1q and bovine chromosome 1 and between BBU1p and BTA27. We present the first radiation hybrid map of this chromosome containing 69 cattle derived markers including 48 coding genes, 17 microsatellites and four ESTs distributed in two linkage groups spanning a total length of 1330.1 cR5000. The RH map was constructed based on analysis of a recently developed river buffalo-hamster whole genome radiation hybrid (BBURH5000) panel. The retention frequency of individual markers across the panel ranged from 17.8% to 52.2%. With few exceptions, the order of markers within linkage groups is identical to the order established for corresponding cattle RH maps. The BBU1 map provides a starting point for comparison of gene order rearrangements between river buffalo chromosome 1 and its bovine homologs.
A primary immunodeficiency syndrome caused by loss-of-function mutations in the IL-21 receptor exhibits impaired B, T, and NK cell function.
Primary immunodeficiencies (PIDs) represent exquisite models for studying mechanisms of human host defense. In this study, we report on two unrelated kindreds, with two patients each, who had cryptosporidial infections associated with chronic cholangitis and liver disease. Using exome and candidate gene sequencing, we identified two distinct homozygous loss-of-function mutations in the interleukin-21 receptor gene (IL21R; c.G602T, p.Arg201Leu and c.240_245delCTGCCA, p.C81_H82del). The IL-21RArg201Leu mutation causes aberrant trafficking of the IL-21R to the plasma membrane, abrogates IL-21 ligand binding, and leads to defective phosphorylation of signal transducer and activator of transcription 1 (STAT1), STAT3, and STAT5. We observed impaired IL-21–induced proliferation and immunoglobulin class-switching in B cells, cytokine production in T cells, and NK cell cytotoxicity. Our study indicates that human IL-21R deficiency causes an immunodeficiency and highlights the need for early diagnosis and allogeneic hematopoietic stem cell transplantation in affected children.
We report the construction of a 1.5 Mb resolution radiation hybrid map of the domestic cat genome. This new map includes novel microsatellite loci and markers derived from the 2X genome sequence that target previous gaps in the feline-human comparative map. Ninety-six percent of the 1793 cat markers we mapped have identifiable orthologues in the canine and human genome sequences. The updated autosomal and X chromosome comparative maps identify 152 cat-human and 134 cat-dog homologous synteny blocks. Comparative analysis shows the marked change in chromosomal evolution in the canid lineage relative to the felid lineage since divergence from their carnivoran ancestor. The canid lineage has a thirty-fold difference in the number of interchromosomal rearrangments relative to felids, while the felid lineage has primarily undergone intrachromosomal rearrangements. We have also refined the pseudoautosomal region and boundary in the cat and show that it is markedly longer than those of human or mouse. This improved RH comparative map provides a useful tool to facilitate positional cloning studies in the feline model.
domestic cat; radiation hybrid map; canine genome; genome evolution; synteny; chromosome rearrangement
To determine the genetic regulation of hair length in the domestic cat, a whole genome scan was performed in a multi-generational pedigree in which the long-haired phenotype was segregating. The two markers that demonstrated the greatest linkage to the long-haired trait (LOD≥6), flanked an estimated 10 Mb region on cat chromosome B1 containing the Fibroblast Growth Factor 5 gene (FGF5), a candidate gene implicated in regulating hair follicle growth cycle in other species. Sequence analyses of FGF5 in 26 cat breeds and two pedigrees of non-breed cats, revealed four separate mutations predicted to disrupt the biological activity of the FGF5 protein. Pedigree analyses demonstrated that different combinations of paired mutant FGF5 alleles segregated with the long-haired phenotype in an autosomal recessive manner. Association analyses of over 380 genotyped breed and non-breed cats were consistent with mutations in the FGF5 gene causing the long-haired phenotype in an autosomal recessive manner. In combination, these genomic approaches demonstrated that FGF5 is the major genetic determinant of hair length in the domestic cat.
Motivation: Development and progression of solid tumors can be attributed to a process of mutations, which typically includes changes in the number of copies of genes or genomic regions. Although comparisons of cells within single tumors show extensive heterogeneity, recurring features of their evolutionary process may be discerned by comparing multiple regions or cells of a tumor. A useful source of data for studying likely progression of individual tumors is fluorescence in situ hybridization (FISH), which allows one to count copy numbers of several genes in hundreds of single cells. Novel algorithms for interpreting such data phylogenetically are needed, however, to reconstruct likely evolutionary trajectories from states of single cells and facilitate analysis of tumor evolution.
Results: In this article, we develop phylogenetic methods to infer likely models of tumor progression using FISH copy number data and apply them to a study of FISH data from two cancer types. Statistical analyses of topological characteristics of the tree-based model provide insights into likely tumor progression pathways consistent with the prior literature. Furthermore, tree statistics from the resulting phylogenies can be used as features for prediction methods. This results in improved accuracy, relative to unstructured gene copy number data, at predicting tumor state and future metastasis.
Availability: Source code for software that does FISH tree building (FISHtrees) and the data on cervical and breast cancer examined here are available at ftp://ftp.ncbi.nlm.nih.gov/pub/FISHtrees.
Supplementary data are available at Bioinformatics online.
The molecular mechanisms contributing to the development and progression of gingivobuccal complex (GBC) cancers–a sub-site of oral cancer, comprising the buccal mucosa, the gingivobuccal sulcus, the lower gingival region and the retromolar trigone-remain poorly understood. Identifying the GBC cancer-related gene expression signature and the driver genes residing on the altered chromosomal regions is critical for understanding the molecular basis of its pathogenesis. Genome-wide expression profiling of 27 GBC cancers with known chromosomal alterations was performed to reveal differentially expressed genes. Putative driver genes were identified by integrating copy number and gene expression data. A total of 315 genes were found differentially expressed (P≤0.05, logFC>2.0) of which eleven genes were validated by real-time quantitative reverse transcriptase-PCR (qRT-PCR) in tumors (n=57) and normal GBC tissues (n=18). Overexpression of LY6K, in chromosome band 8q24.3, was validated by immunohistochemical (IHC) analysis. We found that 78.5% (2,417/3,079) of the genes located in regions of recurrent chromosomal alterations show copy number dependent expression indicating that copy number alteration has a direct effect on global gene expression. The integrative analysis revealed BIRC3 in 11q22.2 as a candidate driver gene associated with poor clinical outcome. Our study identified previously unreported differentially expressed genes in a homogeneous subtype of oral cancer and the candidate driver genes that may contribute to the development and progression of the disease.
A common approach to genetic mapping of loci for complex diseases is to perform a genome-wide association study (GWAS) by analyzing a vast number of SNP markers in cohorts of unrelated cases and controls. A direct motivation for the case–control design is that unrelated, affected individuals can be easier to collect than large families with multiple affected persons in the Western world. Despite its higher potential power, investigators have not actively pursued family ascertainment in part because of a dearth of methods for analyzing such correlated data on a large scale. We examine the statistical properties of several commonly used family-based association tests, as to their performance using real-life mixtures of families and singletons taken from our own migraine and schizophrenia studies, as well as population-based data for a complex trait simulated with the evolutionary phenogenetic simulator, ForSim. In virtually every situation, the full likelihood-based methods in the PSEUDOMARKER program outperformed those implemented in FBAT, GENEHUNTER TDT, PLINK (family-based options), HRR/HHRR, QTDT, TRANSMIT, UNPHASED, MENDEL, and LAMP. We further show that GWAS is much more powerful when family samples are used rather than unrelateds, on a genotype-by-genotype basis.
power; type-I error; genetic linkage analysis; linkage disequilibrium; family-based association; genome-wide association studies
Lifespan increases observed in the United States and elsewhere throughout the developed world, have been attributed in part to improvements in medical care access and technology and to healthier lifestyles. To differentiate the relative contributions of these two factors, we have compared lifespan in the Old Order Amish (OOA), a population with historically low use of medical care, with that of Caucasian participants from the Framingham Heart Study (FHS), focusing on individuals who have reached at least age 30 years.
Analyses were based on 2,108 OOA individuals from the Lancaster County, PA community born between 1890 and 1921 and 5,079 FHS participants born approximately the same time. Vital status was ascertained on 96.9% of the OOA cohort through 2011 and through systematic follow-up of the FHS cohort. The lifespan part of the study included an enlargement of the Anabaptist Genealogy Database to 539,822 individuals, which will be of use in other studies of the Amish. Mortality comparisons revealed that OOA men experienced better longevity (p<0.001) and OOA women comparable longevity than their FHS counterparts.
We further documented all OOA hospital discharges in Lancaster County, PA during 2002–2004 and compared OOA discharge rates to Caucasian national rates obtained from the National Hospital Discharge Survey for the same time period. Both OOA men and women experienced markedly lower rates of hospital discharges than their non-Amish counterparts, despite the increased lifespan.
We speculate that lifestyle factors may predispose the OOA to greater longevity and perhaps to lesser hospital use. Identifying these factors, which might include behaviors such as lesser tobacco use, greater physical activity, and/or enhanced community assimilation, and assessing their transferability to non-Amish communities may produce significant gains to the public health.
The regions encoding the coordinately regulated Th2 cytokines IL5, IL4 and IL13 are located on chromosomes 5 of man and 11 of mouse. They have been intensively studied because these interleukins have protective roles in helminth infections, but may lead to detrimental effects such as allergy, asthma, and fibrosis in lung and liver. We added to previous studies by comparing sequences of syntenic regions on chromosome 3 of the rabbit (Oryctolagus cuniculus) genome OryCun 2.0 assembly from a tuberculosis-susceptible strain, with the corresponding region of ENCODE ENm002 from a normal rabbit as well as with 9 other mammalian species. We searched for rabbit transcription factor binding sites in putative promoter and other non-coding regions of IL5, RAD50, IL13 and IL4. Although we identified several differences between the two donor rabbits in coding and non-coding regions of potential functional significance, confirmation awaits additional sequencing of other rabbits.
A decade ago, there was widespread enthusiasm for the prospects of genome-wide association studies to identify common variants related to common chronic diseases using samples of unrelated individuals from populations. Although technological advancements allow us to query more than a million SNPs across the genome at low cost, a disappointingly small fraction of the genetic portion of common disease etiology has been uncovered. This has led to the hypothesis that less frequent variants might be involved, stimulating a renaissance of the traditional approach of seeking genes using multiplex families from less diverse populations. However, by using the modern genotyping and sequencing technology, we can now look not just at linkage, but jointly at linkage and linkage disequilibrium (LD) in such samples. Software methods that can look simultaneously at linkage and LD in a powerful and robust manner have been lacking. Most algorithms cannot jointly analyze datasets involving families of varying structures in a statistically or computationally efficient manner. We have implemented previously proposed statistical algorithms in a user-friendly software package, PSEUDOMARKER. This paper is an announcement of this software package. We describe the motivation behind the approach, the statistical methods, and software, and we briefly demonstrate PSEUDOMARKER's advantages over other packages by example.
Computer software; Family-based association; Genome-wide association; Likelihood methods; Linkage analysis; Linkage disequilibrium; Study design
DNA aberrations that cause colorectal cancer (CRC) occur in multiple steps that involve microsatellite instability (MSI) and chromosomal instability (CIN). Herein, we studied CRCs from AA patients for their CIN and MSI status.
Array CGH was performed on 30 AA colon tumors. The MSI status was established. The CGH data from AA were compared to published lists of 41 TSG and oncogenes in Caucasians and 68 cancer genes, proposed via systematic sequencing for somatic mutations in colon and breast tumors. The patient-by-patient CGH profiles were organized into a maximum parsimony cladogram to give insights into the tumors' aberrations lineage.
The CGH analysis revealed that CIN was independent of age, gender, stage or location. However, both the number and nature of aberrations seem to depend on the MSI status. MSI-H tumors clustered together in the cladogram. The chromosomes with the highest rates of CGH aberrations were 3, 5, 7, 8, 20 and X. Chromosome X was primarily amplified in male patients. A comparison with Caucasians revealed an overall similar aberration profile with few exceptions for the following genes; THRB, RAF1, LPL, DCC, XIST, PCNT, STS and genes on the 20q12-q13 cytoband. Among the 68 CAN genes, all showed some level of alteration in our cohort.
Chromosome X amplification in male patients with CRC merits follow-up. The observed CIN may play a distinctive role in CRC in AAs. The clustering of MSI-H tumors in global CGH data analysis suggests that chromosomal aberrations are not random.
Simulation of genotypes in pedigrees is an important tool to evaluate the power of a linkage or an association study and to assess the empirical significance of results. SLINK is a widely-used package for pedigree simulations, but its implementation has not previously been described in a published paper. SLINK was initially derived from the LINKAGE programs. Over the 20 years since its release, SLINK has been modified to incorporate faster algorithms, notably from the linkage analysis package FASTLINK, also derived from LINKAGE. While SLINK can simulate genotypes on pedigrees of high complexity, one limitation of SLINK, as with most methods based on peeling algorithms to evaluate pedigree likelihoods, is the small number of linked markers that can be generated. The software package SUP includes an elegant wrapper for SLINK that circumvents the limitation on number of markers by using descent markers generated by SLINK to simulate a much larger number of markers on the same chromosome, linked and possibly associated with a trait locus. We have released new coordinated versions of SLINK (3.0; available from http://watson.hgen.pitt.edu) and SUP (v090804; available from http://mlemire.freeshell.org/software or http://watson.hgen.pitt.edu) that integrate the two software packages. Thereby, we have removed some of the previous limitations on the joint functionality of the programs, such as the number of founders in a pedigree. We review the history of SLINK and describe how SLINK and SUP are now coordinated to permit the simulation of large numbers of markers linked and possibly associated with a trait in large pedigrees.
Coordinated conditional simulation; SLINK; SUP; Linkage study; Association study; Pedigree, large; Pedigree, complex
BLAST is a commonly-used software package for comparing a query sequence to a database of known sequences; in this study, we focus on protein sequences. Position-specific-iterated BLAST (PSI-BLAST) iteratively searches a protein sequence database, using the matches in round i to construct a position-specific score matrix (PSSM) for searching the database in round i + 1. Biegert and Söding developed Context-sensitive BLAST (CS-BLAST), which combines information from searching the sequence database with information derived from a library of short protein profiles to achieve better homology detection than PSI-BLAST, which builds its PSSMs from scratch.
We describe a new method, called domain enhanced lookup time accelerated BLAST (DELTA-BLAST), which searches a database of pre-constructed PSSMs before searching a protein-sequence database, to yield better homology detection. For its PSSMs, DELTA-BLAST employs a subset of NCBI’s Conserved Domain Database (CDD). On a test set derived from ASTRAL, with one round of searching, DELTA-BLAST achieves a ROC5000 of 0.270 vs. 0.116 for CS-BLAST. The performance advantage diminishes in iterated searches, but DELTA-BLAST continues to achieve better ROC scores than CS-BLAST.
DELTA-BLAST is a useful program for the detection of remote protein homologs. It is available under the “Protein BLAST” link at http://blast.ncbi.nlm.nih.gov.
This article was reviewed by Arcady Mushegian, Nick V. Grishin, and Frank Eisenhaber.
There are sparse data on genetic, epigenetic and vitamin D exposure in African Americans (AA) with colon polyp. Consequently, we evaluated serum 25(OH) D levels, vitamin D receptor (VDR) polymorphisms and the methylation status of the tumor suppressor gene dickkopf homolog 1 (DKK1) as risk factors for colon polyp in this population.
The case-control study consisted of 93 patients with colon polyp (cases) and 187 healthy individuals (controls) at Howard University Hospital. Serum levels of 25(OH)D (including D3, D2, and total) were measured by liquid chromatography-mass spectrometry. DNA analysis focused on 49 single nucleotide polymorphisms (SNPs) in the VDR gene. Promoter methylation analysis of DKK1 was also performed. The resulting data were processed in unadjusted and multivariable logistic regression analyses.
Cases and controls differed in vitamin D status (D3<50 nmol/L: Median of 35.5 in cases vs. 36.8 in controls nmol/L; P = 0.05). Low levels of 25(OH)D3 (<50 nmol/L) were observed in 86% of cases and 68% of controls and it was associated with higher risks of colon polyp (odds ratio of 2.7, 95% confidence interval 1.3–3.4). The SNP analysis showed no association between 46 VDR polymorphisms and colon polyp. The promoter of the DKK1 gene was unmethylated in 96% of the samples.
We found an inverse association between serum 25(OH)D3 and colon polyp in AAs. VDR SNPs and DKK1 methylation were not associated with colon polyp. Vitamin D levels may in part explain the higher incidence of polyp in AAs.
We recently reported autosomal recessive fetal-onset neuroaxonal dystrophy (FNAD) in a large family of dogs that is not caused by mutation in the PLA2G6 locus (2010 J Comp Neurol 518:3771–3784). Here we report a genome-wide linkage analysis using 333 microsatellite markers to map canine FNAD to the telomeric end of chromosome 2. The interval of zero recombination was refined by single nucleotide polymorphism (SNP) haplotype analysis to ~ 200 kb, and the included genes were sequenced. We found a homozygous 3-nucleotide deletion in exon 13 of mitofusin 2 (MFN2), predicting loss of a glutamate residue at position 539 in the protein of affected dogs. RT-PCR demonstrated near normal expression of the mutant mRNA, but MFN2 expression was undetectable to very low on western blots of affected dog brainstem, cerebrum, kidney, and cultured fibroblasts and by immunohistochemistry on brainstem sections. MFN2 is a multifunctional, membrane-bound GTPase of mitochondria and endoplasmic reticulum most commonly associated with human Charcot-Marie-Tooth disease type 2A2. The canine disorder extends the range of MFN2-associated phenotypes and suggests MFN2 as a candidate gene for rare cases of human FNAD.
Neuroaxonal dystrophy; mitofusin 2; linkage analysis; animal model; disease gene
Biallelic mutations in the gene encoding HCLS-associated protein X-1 (HAX1) cause autosomal recessive severe congenital neutropenia. Some of these patients display neurological abnormalities including developmental delay, cognitive impairment and/or epilepsy. Recent genotype-phenotype studies have shown that mutations in HAX1 affecting transcripts A (NM_006118.3) and B (NM_001018837.1) cause the phenotype of SCN with neurological impairment, while mutations affecting isoform A but not B lead to SCN without neurological aberrations.
In this study, we identified a consanguineous family with two patients suffering from SCN and neurological disease caused by a novel, homozygous genomic deletion including exons 4–7 of the HAX1 gene. Quantitative MRI analyses revealed general alterations in cerebral proton density in both of the patients, as well as in an additional unrelated patient with another HAX1 mutation (Arg86X) known to be associated with neurological manifestations. This study provides first in vivo evidence of general neurodegeneration associated with HAX1 deficiency in SCN patients.
This study evaluated the clinicopathological and prognostic implications of genetic alterations characterizing oral squamous cell carcinoma(OSCC). Comparative genomic hybridization(CGH) was used to identify chromosomal alterations present in primary OSCCs obtained from 97 pateints. In this population, tobacco use was a significant risk factor for OSCC. By contrast, all 97 of our samples are negative for human papillomavirus (HPV) DNA integration, which is another known risk factor for OSCC in certain populations. Results of the Fisher’s exact test followed by Benjamini-Hochberg correction for multiple testing, showed a correlation of 7p gain and 8p loss with node-positive OSCC (p≤0.04 for both genetic alterations) and association of 11q13 gain with high-grade OSCC (p≤0.05). Univariate Cox-proportional hazard models, also corrected for multiple testing, showed significant association of 11q13 gain and 18q loss with decreased survival (p≤0.05). These findings were supported by multivariate analysis which revealed that 11q13 gain and 18q loss together serve as a strong bivariate predictor of poor prognosis. In conclusion, our study has identified genetic alterations that correlate significantly with nodal status, grade, and poor survival status of OSCC. These potential biomarkers may aid the current TNM system for better prediction of clinical outcome.
Oral cancer; CGH; prognosis; survival analysis; gain of 11q13; loss of 18q
This study evaluated the clinicopathological and prognostic implications of genetic alterations characterizing oral squamous cell carcinoma (OSCC). Comparative genomic hybridization was used to identify chromosomal alterations present in primary OSCCs obtained from 97 patients. In this population, tobacco use was a significant risk factor for OSCC. By contrast, the 97 samples were negative for human papillomavirus (HPV) DNA integration, another known risk factor for OSCC in certain populations. Results of the Fisher’s exact test, followed by the Benjamini-Hochberg correction for multiple testing, showed a correlation of 7p gain and 8p loss with node-positive OSCC (p≤0.04 for both genetic alterations) and an association of 11q13 gain with high-grade OSCC (p≤0.05). Univariate Cox-proportional hazard models, also corrected for multiple testing, showed a significant association of 11q13 gain and 18q loss with decreased survival (p≤0.05). The findings were supported by multivariate analysis, which revealed that 11q13 gain and 18q loss together serve as a strong bivariate predictor of poor prognosis. In conclusion, our study identified genetic alterations that correlate significantly with nodal status, grade and the poor survival status of OSCC. These potential biomarkers may aid the current tumor node metastasis system for better prediction of clinical outcome.
oral cancer; comparative genomic hybridization; prognosis; survival analysis; gain of 11q13; loss of 18q
Identifying oral cancer lesions associated with high risk of relapse and predicting clinical outcome remain challenging questions in clinical practice. Genomic alterations may add prognostic information and indicate biological aggressiveness thereby emphasizing the need for genome-wide profiling of oral cancers. High-resolution array comparative genomic hybridization was performed to delineate the genomic alterations in clinically annotated primary gingivo-buccal complex and tongue cancers (n = 60). The specific genomic alterations so identified were evaluated for their potential clinical relevance. Copy-number changes were observed on chromosomal arms with most frequent gains on 3q (60%), 5p (50%), 7p (50%), 8q (73%), 11q13 (47%), 14q11.2 (47%), and 19p13.3 (58%) and losses on 3p14.2 (55%) and 8p (83%). Univariate statistical analysis with correction for multiple testing revealed chromosomal gain of region 11q22.1–q22.2 and losses of 17p13.3 and 11q23–q25 to be associated with loco-regional recurrence (P = 0.004, P = 0.003, and P = 0.0003) and shorter survival (P = 0.009, P = 0.003, and P 0.0001) respectively. The gain of 11q22 and loss of 11q23-q25 were validated by interphase fluorescent in situ hybridization (I-FISH). This study identifies a tractable number of genomic alterations with few underlying genes that may potentially be utilized as biological markers for prognosis and treatment decisions in oral cancers.