Wiebers, Joyce L. (Purdue University, West Lafayette, Ind.), and Harold R. Garner. Use of S-methylcysteine and cystathionine by methionineless Neurospora mutants. J. Bacteriol. 88:1798–1804. 1964.—Radioactive methionine was found in hydrolysates of various strains of Neurospora crassa when either S-methylcysteine (SMC)-C14H3 or SMC-S35 is the sole addition to minimal medium. Isotope product-precursor specific activity ratios are very similar for the two sources of label. Wild-type and methionineless mutants use sulfur from SMC in the biosynthesis of methionine, but not of cysteine, when grown in regular medium. With a medium nearly free from sulfate, SMC served as a source of sulfur for both cysteine and methionine. Suppressed methionineless mutants incorporated sulfur from SMC into cellular cysteine even in the presence of normal amounts of sulfate. SMC as a possible metabolic precursor of methionine was compared to cystathionine in an experiment with wild-type Neurospora. The four sources of label used were: SMC-C14H3, SMC-S35, cystathionine-U-C14, and cystathionine-S35. In each flask, the organism was offered one of the labeled compounds plus an equivalent amount of the other compound without label. The amount of each compound was sufficient for either to supply its contribution to all of the cellular methionine, if it were successful in competing with endogenous sources. To avoid adaptive breakdown of substrates, the compounds were added continuously at a rate consistent with the amount of growth present. The ratio of specific activity of cellular methionine to precursor was determined for each labeled compound. The results show that SMC sulfur and methyl carbon are used equally well. Cystathionine carbon and sulfur appear to be equally utilized also. A preference for cystathionine is indicated.
Ovarian cancer (OV) ranks fifth in cancer deaths among women, yet there remain few informative biomarkers for this disease. Microsatellites are repetitive genomic regions which we hypothesize could be a source of novel biomarkers for OV and have traditionally been under-appreciated relative to Single Nucleotide Polymorphisms (SNPs). In this study, we explore microsatellite variation as a potential novel source of genomic variation associated with OV. Exomes from 305 OV patient germline samples and 54 tumors, sequenced as part of The Cancer Genome Atlas, were analyzed for microsatellite variation and compared to healthy females sequenced as part of the 1,000 Genomes Project. We identified a subset of 60 microsatellite loci with genotypes that varied significantly between the OV and healthy female populations. Using these loci as a signature set, we classified germline genomes as ‘at risk’ for OV with a sensitivity of 90.1% and a specificity of 87.6%. Cross-analysis with a similar set of breast cancer associated loci identified individuals ‘at risk’ for both diseases. This study revealed a genotype-based microsatellite signature present in the germlines of individuals diagnosed with OV, and provides the basis for a potential novel risk assessment diagnostic for OV and new personal genomics targets in tumors.
Ovarian Cancer; Biomarkers; The Cancer Genome Atlas; Breast Cancer; 1,000 Genomes Project
Microsatellites (MSTs) are short tandem repeated genetic motifs that comprise ~3% of the genome. MST instability (MSI), defined as acquired/lost primary alleles at a small subset of microsatellite loci (e.g. Bethesda markers), is a clinically relevant marker for colorectal cancer. However, these markers are not applicable to other types of cancers, specifically, for liver cancer which has a high mortality rate. Here we show that somatic MST variability (SMV), defined as the presence of additional, non-primary (aka minor) alleles at MST loci, is a complementary measure of MSI, and a genetic marker for colorectal and liver cancer. Re-analysis of Illumina sequenced exomes from The Cancer Genome Atlas indicates that SMV may distinguish a subpopulation of African American patients with colorectal cancer, which represents ~33% of the population in this study. Further, for liver cancer, a higher rate of SMV may be indicative of an earlier age of onset. The work presented here suggests that classical MSI should be expanded to include SMV, going beyond alterations of the primary alleles at a small number of microsatellite loci. This measure of SMV may represent a potential new diagnostic for a variety of cancers and may provide new information for colorectal cancer patients.
Microsatellites; Colorectal cancer; Liver cancer; Somatic Variability
Genomic studies of glioma sub-types have amassed new disease specific mutations, yet these only partially explain how mutations are linked to predisposition or progression. We hypothesized that microsatellite variation could expand the understanding of glioma etiology. Furthermore, germline markers for gliomas are typically undetectable; therefore we also hypothesize that the predictability of cancer-associated microsatellite loci in germline DNA may support the current hypothesis of a glioma cell of origin.
In this study, “normal” germline exome sequenced DNA from the 1000 Genomes Project (n=390) were compared with exome sequences from germlines of subjects with WHO grade II and III lower-grade glioma (LGG, n=136) and WHO grade IV glioblastoma (GBM, n=252) from The Cancer Genome Atlas to identify microsatellite loci non-randomly associated with glioma. From germline data, we identified 48 GBM-specific loci, 42 Lower-grade glioma specific loci and 29 loci that distinguish GBM from LGG (p≤ 0.01). We then attempted to distinguish WHO grade II glioma (n=67) from GBM resulting in 8 informative loci. Significantly, in all glioma grades, comparisons between tumor and matched germline sequences demonstrated no significant differences in these variants (p≥ 0.01). Therefore, these microsatellite loci are considered to be components of grade-specific signatures for glioma which distinguish germline sequences of individuals with cancer from those of individuals that are “normal”. In order to better understand the significance of these loci, we identified biological processes enriched in genes with these variants. Most strikingly, six helicase genes were enriched in the GBM cohort (p≤ 1.0 ×10−3). The preservation of these glioma-specific loci could therefore serve as valuable diagnostic and therapeutic markers; especially since the heterogeneity of tumor cell populations can obscure the identification of mutations preceding a metastatic phenotype.
GBM; microsatellite; glioma; oligodendroglioma; helicase; ubiquitin proteasome system
Although the connection between cancer and cigarette smoke is well established, nicotine is not characterized as a carcinogen. Here, we used exome sequencing to identify nicotine and oxidative stress-induced somatic mutations in normal human epithelial cells and its correlation with cancer. We identified over 6,400 SNVs, indels and microsatellites in each of the stress exposed cells relative to the control, of which, 2,159 were consistently observed at all nicotine doses. These included 429 nsSNVs including 158 novel and 79 cancer-associated. Over 80% of consistently nicotine induced variants overlap with variations detected in oxidative stressed cells, indicating that nicotine induced genomic alterations could be mediated through oxidative stress. Nicotine induced mutations were distributed across 1,585 genes, of which 49% were associated with cancer. MUC family genes were among the top mutated genes. Analysis of 591 lung carcinoma tumor exomes from The Cancer Genome Atlas (TCGA) revealed that 20% of non-small-cell lung cancer tumors in smokers have mutations in at least one of the MUC4, MUC6 or MUC12 genes in contrast to only 6% in non-smokers. These results indicate that nicotine induces genomic variations, promotes instability potentially mediated by oxidative stress, implicating nicotine in carcinogenesis, and establishes MUC genes as potential targets.
Nicotine; Exome sequencing; MUC4; Biomarker; Mutation targets
A singular genome used for inference into population-based studies is a standard method in genomics. Recent studies show that spontaneous genomic variants can propagate into new generations and these changes can contribute to individual cell aging with environmental and evolutionary elements contributing to cumulative genomic variation. However, the contribution of aging to genomic changes in tissue samples remains uncharacterized. Here, we report the impact of aging on individual human exomes and their implications. We found the human genome to be dynamic, acquiring a varying number of mutations with age (5,000 to 50,000 in 9 to 16 years). This equates to a variation rate of 9.6×10−7 to 8.4×10−6 bp−1 year−1 for nonsynonymous single nucleotide variants and 2.0×10−4 to 1.0×10−3 locus−1 year−1 for microsatellite loci in these individuals. These mutations span across 3,000 to 13,000 genes, which commonly showed association with Wnt signaling and Gonadotropin releasing hormone receptor pathways, and indicated for individuals a specific and significant enrichment for increased risk for diabetes, kidney failure, cancer, Rheumatoid arthritis, and Alzheimer's disease– conditions usually associated with aging. The results suggest that “age” is an important variable while analyzing an individual human genome to extract individual-specific clinically significant information necessary for personalized genomics.
Aging; dynamic exome; exome sequencing; personalized genomics; personalized medicine
Sequencing data analysis remains limiting and problematic, especially for low complexity repeat sequences and transposon elements due to inherent sequencing errors and short sequence read lengths. We have developed a program, ReviSeq, which uses a hybrid method comprised of iterative remapping and local assembly upon a bacterial sequence backbone. Application of this method to six Brucella suis field isolates compared to the newly revised Brucella suis 1330 reference genome identified on average 13, 15, 19 and 9 more variants per sample than STAMPY/SAMtools, BWA/SAMtools, iCORN and BWA/PINDEL pipelines, and excluded on average 4, 2, 3 and 19 variants per sample, respectively. In total, using this iterative approach, we identified on average 87 variants including SNVs, short INDELs and long INDELs per strain when compared to the reference. Our program outperforms other methods especially for long INDEL calling.
The program is available at http://reviseq.sourceforge.net.
Brucella; sequence assembly; resequencing; variant calling; comparative genomics; iterative mapping
Brucella suis is the causative agent of swine brucellosis and is known to be able to infect several different hosts, including cattle, dogs, and horses, without causing disease symptoms. Here we report the complete genome sequence of Brucella suis VBI22, which was isolated from raw milk from an infected cow.
The poor performance of current tests for predicting the onset, progression and treatment response of diabetic nephropathy has engendered a search for more sensitive and specific urinary biomarkers. Our goal was to develop a new method for protein biomarker discovery in urine from these patients.
We analyzed urine from normal subjects and patients with early and advanced nephropathy. Proteins were separated using a novel analysis process including immunodepletion of high abundance proteins followed by two stage LC fractionation of low abundance proteins. The proteins in the fractions were sequenced using MS/MS.
Immunodepletion of selected high abundance proteins followed by two stage LC produced approximately 700 fractions, each less complex and more amenable to analysis than the mixture and requiring minimal processing for MS identification. Comparison of fractions between normal and diabetic nephropathy subjects revealed several low abundance proteins that reproducibly distinguished low glomerular filtration rate (GFR) from both high GFR diabetic and normal subjects, including uteroglobin, a protein previously associated with renal scarring.
Conclusions and clinical relevance
We developed a novel method to identify low abundance urinary proteins that enables the discovery of potential biomarkers to improve the diagnosis and management of patients with diabetic nephropathy.
diabetic nephropathy; immunodepletion; liquid chromatography; proteomic methods; urinary biomarkers
Using a custom CGH-like oligonucleotide array to measure the global microsatellite content in the genomes of 72 cancer, cancer-free, and high risk patient and cell line samples (56 germline DNA and 16 in tumor or tumor cell line DNA) we found a unique, reproducible, and statistically significant pattern of 18 motif-specific microsatellite families (out of 962 possible 1-6 mer repeats) in breast cancer patient germline and tumor DNA, but not in germline DNA of cancer-free volunteer controls or in breast cancer patients with BRCA1/2 mutations. These high-similarity A/T rich repetitive motifs were also more pronounced in the germlines and tumors of colon cancer tumor patients (3/6 samples) and microsatellite unstable colon cancer cell lines; however, germline DNA of sporadic breast cancer patients exhibited the largest global content shift for those motifs with extreme AT/GC ratios. These results indicate that global microsatellite variability is complex, suggest the existence of a previously unknown genomic destabilization mechanism in breast cancer patients' germline DNA, and warrant further testing of such microsatellite variability as a predictor of future breast cancer development.
Motivation: Inferring lengths of inherited microsatellite alleles with single base pair resolution from short sequence reads is challenging due to several sources of noise caused by the repetitive nature of microsatellites and the technologies used to generate raw sequence data.
Results: We have developed a program, GenoTan, using a discretized Gaussian mixture model combined with a rules-based approach to identify inherited variation of microsatellite loci from short sequence reads without paired-end information. It effectively distinguishes length variants from noise including insertion/deletion errors in homopolymer runs by addressing the bidirectional aspect of insertion and deletion errors in sequence reads. Here we first introduce a homopolymer decomposition method which estimates error bias toward insertion or deletion in homopolymer sequence runs. Combining these approaches, GenoTan was able to genotype 94.9% of microsatellite loci accurately from simulated data with 40x sequence coverage quickly while the other programs showed <90% correct calls for the same data and required 5∼30× more computational time than GenoTan. It also showed the highest true-positive rate for real data using mixed sequence data of two Drosophila inbred lines, which was a novel validation approach for genotyping.
Availability: GenoTan is open-source software available at http://genotan.sourceforge.net.
Supplementary data are available at Bioinformatics online
Microsatellites (MST), tandem repeats of 1–6 nucleotide motifs, are mutational hot-spots with a bias for insertions and deletions (INDELs) rather than single nucleotide polymorphisms (SNPs). The majority of MST instability studies are limited to a small number of loci, the Bethesda markers, which are only informative for a subset of colorectal cancers. In this paper we evaluate non-haplotype alleles present within next-gen sequencing data to evaluate somatic MST variation (SMV) within DNA repair proficient and DNA repair defective cell lines. We confirm that alleles present within next-gen data that do not contribute to the haplotype can be reliably quantified and utilized to evaluate the SMV without requiring comparisons of matched samples. We observed that SMV patterns found in DNA repair proficient cell lines without DNA repair defects, MCF10A, HEK293 and PD20 RV:D2, had consistent patterns among samples. Further, we were able to confirm that changes in SMV patterns in cell lines lacking functional BRCA2, FANCD2 and mismatch repair were consistent with the different pathways perturbed. Using this new exome sequencing analysis approach we show that DNA instability can be identified in a sample and that patterns of instability vary depending on the impaired DNA repair mechanism, and that genes harboring minor alleles are strongly associated with cancer pathways. The MST Minor Allele Caller used for this study is available at https://github.com/zalmanv/MST_minor_allele_caller.
Motivation: Simple tandem repeats are highly variable genetic elements and widespread in genomes of many organisms. Next-generation sequencing technologies have enabled a robust comparison of large numbers of simple tandem repeat loci; however, analysis of their variation using traditional sequence analysis approaches still remains limiting and problematic due to variants occurring in repeat sequences confusing alignment programs into mapping sequence reads to incorrect loci when the sequence reads are significantly different from the reference sequence.
Results: We have developed a program, ReviSTER, which is an automated pipeline using a ‘local mapping reference reconstruction method’ to revise mismapped or partially misaligned reads at simple tandem repeat loci. RevisSTER estimates alleles of repeat loci using a local alignment method and creates temporary local mapping reference sequences, and finally remaps reads to the local mapping references. Using this approach, ReviSTER was able to successfully revise reads misaligned to repeat loci from both simulated data and real data.
Availability: ReviSTER is open-source software available at http://revister.sourceforge.net.
Supplementary data are available at Bioinformatics online.
The wealth of publicly available gene expression and genomic data provides unique opportunities for computational inference to discover groups of genes that function to control specific cellular processes. Such genes are likely to have co-evolved and be expressed in the same tissues and cells. Unfortunately, the expertise and computational resources required to compare tens of genomes and gene expression data sets make this type of analysis difficult for the average end-user. Here, we describe the implementation of a web server that predicts genes involved in affecting specific cellular processes together with a gene of interest. We termed the server ‘EvoCor’, to denote that it detects functional relationships among genes through evolutionary analysis and gene expression correlation. This web server integrates profiles of sequence divergence derived by a Hidden Markov Model (HMM) and tissue-wide gene expression patterns to determine putative functional linkages between pairs of genes. This server is easy to use and freely available at http://pilot-hmm.vbi.vt.edu/.
microsatellites; n-globin; evolution; microsatellite life cycle; gorilla
Nicotine is a known risk factor for cancer development and has been shown to alter gene expression in cells and tissue upon exposure. We used Illumina® Next Generation Sequencing (NGS) technology to gain unbiased biological insight into the transcriptome of normal epithelial cells (MCF-10A) to nicotine exposure. We generated expression data from 54,699 transcripts using triplicates of control and nicotine stressed cells. As a result, we identified 138 differentially expressed transcripts, including 39 uncharacterized genes. Additionally, 173 transcripts that are primarily associated with DNA replication, recombination, and repair showed evidence for alternative splicing. We discovered the greatest nicotine stress response by HPCAL4 (up-regulated by 4.71 fold) and NPAS3 (down-regulated by -2.73 fold); both are genes that have not been previously implicated in nicotine exposure but are linked to cancer. We also discovered significant down-regulation (-2.3 fold) and alternative splicing of NEAT1 (lncRNA) that may have an important, yet undiscovered regulatory role. Gene ontology analysis revealed nicotine exposure influenced genes involved in cellular and metabolic processes. This study reveals previously unknown consequences of nicotine stress on the transcriptome of normal breast epithelial cells and provides insight into the underlying biological influence of nicotine on normal cells, marking the foundation for future studies.
We have developed a hyperspectral microscopic imaging (HMI) platform that can precisely identify and quantify 10 molecular markers in individual cancer cells in a single pass. Exploitation of an improved separation of circulating tumor cells and the application of HMI has provided an opportunity to identify molecular changes in these cells, the recognition of co-expression of these markers, and poses an important opportunity for non-invasive diagnosis, and the use of targeted therapy. We have balanced the intensity of 10 fluorochromes bound to 10 different antibodies, each specific to a particular tumor marker, so that the intensity of each fluorochrome can be discerned from overlapping emissions. Using 2 touch preps from each primary breast cancer, the average molecular marker-intensities of 25 tumor cells gave a representative molecular signature for the tumor despite some cellular heterogeneity. The intensities determined by the HMI correlate well with the conventional 0-3+ analysis by experts in cellular pathology. Since additional multiplexes can be developed using the same fluorochromes but different antibodies, this analysis allows quantification of a large number of molecular markers on individual tumor cells. HMI can be completely automated and, eventually, could allow standardization of protein biomarkers and improve reproducibility among clinical pathology laboratories.
Brucella spp. infect hosts primarily by adhering and penetrating mucosal surfaces; however the initial molecular phenomena of this host:pathogen interaction remain poorly understood. Using cDNA microarray analysis, we characterized the transcriptional profile of the intracellular pathogen Brucella melitensis at 4 h (adaptational period) and 12 h (replicative phase) following HeLa cells infection. The intracellular pathogen transcriptome was determined using initially enriched and then amplified B. melitensis RNA from total RNA of B. melitensis-infected HeLa cells. Analysis of microarray results identified 161 and 115 pathogen genes differentially expressed at 4 and 12 h p.i., respectively. In concordance with phenotypic studies, most of the genes expressed were involved in pathogen growth and metabolism, and were down-regulated at the earliest time point (78%), but up-regulated at 12 h p.i. (75%). Further characterization of specific genes identified in this study will elucidate biological processes and pathways to help understand how both host and Brucella interact during the early infectious process to the eventual benefit of the pathogen and to the detriment of the naïve host.
Brucella melitensis; Gene expression; Microarray; Bacterial pathogenesis
We have developed a method for the parallel analysis of multiple CpG sites in genomic DNA for their state of methylation. Hypermethylation of CpG islands within the promoters and 5′ exons of genes has been found to be a mechanism of transcriptional inactivation associated with a variety of tumors. The method that we developed relies on the differential reactivity of methylated and unmethylated cytosines with sodium bisulfite, which exclusively converts unmethylated cytosines to deoxyuracils. The resulting sequence changes are determined with single-nucleotide resolution by hybridization to an oligonucleotide array. Cohybridization with a reference sample containing a different label provides an internal standard for assessment of methylation state. This method provides advantages in parallelism over existing methods of methylation analysis. We have demonstrated this technique with a region from the promoter of the tumor suppressor gene p16, which is hypermethylated in many cancers.
Hypermethylation; CpG island; Oligonucleotide array; Sodium bisulfite; Tumor suppressor
Survival and persistence of Mycobacterium avium subsp. paratuberculosis (MAP) in the intestinal mucosa is associated with host immune tolerance. However, the initial events during MAP interaction with its host that lead to pathogen survival, granulomatous inflammation, and clinical disease progression are poorly defined. We hypothesize that immune tolerance is initiated upon initial contact of MAP with the intestinal Peyer's patch. To test our hypothesis, ligated ileal loops in neonatal calves were infected with MAP. Intestinal tissue RNAs were collected (0.5, 1, 2, 4, 8 and 12 hrs post-infection), processed, and hybridized to bovine gene expression microarrays. By comparing the gene transcription responses of calves infected with the MAP, informative complex patterns of expression were clearly visible. To interpret these complex data, changes in the gene expression were further analyzed by dynamic Bayesian analysis, and genes were grouped into the specific pathways and gene ontology categories to create a holistic model. This model revealed three different phases of responses: i) early (30 min and 1 hr post-infection), ii) intermediate (2, 4 and 8 hrs post-infection), and iii) late (12 hrs post-infection). We describe here the data that include expression profiles for perturbed pathways, as well as, mechanistic genes (genes predicted to have regulatory influence) that are associated with immune tolerance. In the Early Phase of MAP infection, multiple pathways were initiated in response to MAP invasion via receptor mediated endocytosis and changes in intestinal permeability. During the Intermediate Phase, perturbed pathways involved the inflammatory responses, cytokine-cytokine receptor interaction, and cell-cell signaling. During the Late Phase of infection, gene responses associated with immune tolerance were initiated at the level of T-cell signaling. Our study provides evidence that MAP infection resulted in differentially regulated genes, perturbed pathways and specifically modified mechanistic genes contributing to the colonization of Peyer's patch.
Brucella suis is a causative agent of porcine brucellosis. We report the resequencing of the original sample upon which the published sequence of Brucella suis 1330 is based and describe the differences between the published assembly and our assembly at 12 loci.
Aortic valve calcification is the most common form of valvular heart disease, but the mechanisms of calcific aortic valve disease (CAVD) are unknown. NOTCH1 mutations are associated with aortic valve malformations and adult-onset calcification in families with inherited disease. The Notch signaling pathway is critical for multiple cell differentiation processes, but its role in the development of CAVD is not well understood. The aim of this study was to investigate the molecular changes that occur with inhibition of Notch signaling in the aortic valve. Notch signaling pathway members are expressed in adult aortic valve cusps, and examination of diseased human aortic valves revealed decreased expression of NOTCH1 in areas of calcium deposition. To identify downstream mediators of Notch1, we examined gene expression changes that occur with chemical inhibition of Notch signaling in rat aortic valve interstitial cells (AVICs). We found significant downregulation of Sox9 along with several cartilage-specific genes that were direct targets of the transcription factor, Sox9. Loss of Sox9 expression has been published to be associated with aortic valve calcification. Utilizing an in vitro porcine aortic valve calcification model system, inhibition of Notch activity resulted in accelerated calcification while stimulation of Notch signaling attenuated the calcific process. Finally, the addition of Sox9 was able to prevent the calcification of porcine AVICs that occurs with Notch inhibition. In conclusion, loss of Notch signaling contributes to aortic valve calcification via a Sox9-dependent mechanism.
Salmonella enterica Serovar Typhimurium (S. Typhimurium) causes enterocolitis with diarrhea and polymorphonuclear cell (PMN) influx into the intestinal mucosa in humans and calves. The Salmonella Type III Secretion System (T3SS) encoded at Pathogenicity Island I translocates Salmonella effector proteins SipA, SopA, SopB, SopD, and SopE2 into epithelial cells and is required for induction of diarrhea. These effector proteins act together to induce intestinal fluid secretion and transcription of C-X-C chemokines, recruiting PMNs to the infection site. While individual molecular interactions of the effectors with cultured host cells have been characterized, their combined role in intestinal fluid secretion and inflammation is less understood. We hypothesized that comparison of the bovine intestinal mucosal response to wild type Salmonella and a SipA, SopABDE2 effector mutant relative to uninfected bovine ileum would reveal heretofore unidentified diarrhea-associated host cellular pathways. To determine the coordinated effects of these virulence factors, a bovine ligated ileal loop model was used to measure responses to wild type S. Typhimurium (WT) and a ΔsipA, sopABDE2 mutant (MUT) across 12 hours of infection using a bovine microarray. Data were analyzed using standard microarray analysis and a dynamic Bayesian network modeling approach (DBN). Both analytical methods confirmed increased expression of immune response genes to Salmonella infection and novel gene expression. Gene expression changes mapped to 219 molecular interaction pathways and 1620 gene ontology groups. Bayesian network modeling identified effects of infection on several interrelated signaling pathways including MAPK, Phosphatidylinositol, mTOR, Calcium, Toll-like Receptor, CCR3, Wnt, TGF-β, and Regulation of Actin Cytoskeleton and Apoptosis that were used to model of host-pathogen interactions. Comparison of WT and MUT demonstrated significantly different patterns of host response at early time points of infection (15 minutes, 30 minutes and one hour) within phosphatidylinositol, CCR3, Wnt, and TGF-β signaling pathways and the regulation of actin cytoskeleton pathway.
The ability to differentiate a bioterrorist attack or an accidental release of a research pathogen from a naturally occurring pandemic or disease event is crucial to the safety and security of this nation by enabling an appropriate and rapid response. It is critical in samples from an infected patient, the environment, or a laboratory to quickly and accurately identify the precise pathogen including natural or engineered variants and to classify new pathogens in relation to those that are known. Current approaches for pathogen detection rely on prior genomic sequence information. Given the enormous spectrum of genetic possibilities, a field deployable, robust technology, such as a universal (any species) microarray has near-term potential to address these needs.
A new and comprehensive sequence-independent array (Universal Bio-Signature Detection Array) was designed with approximately 373,000 probes. The main feature of this array is that the probes are computationally derived and sequence independent. There is one probe for each possible 9-mer sequence, thus 49 (262,144) probes. Each genome hybridized on this array has a unique pattern of signal intensities corresponding to each of these probes. These signal intensities were used to generate an un-biased cluster analysis of signal intensity hybridization patterns that can easily distinguish species into accepted and known phylogenomic relationships. Within limits, the array is highly sensitive and is able to detect synthetically mixed pathogens. Examples of unique hybridization signal intensity patterns are presented for different Brucella species as well as relevant host species and other pathogens. These results demonstrate the utility of the UBDA array as a diagnostic tool in pathogen forensics.
This pathogen detection system is fast, accurate and can be applied to any species. Hybridization patterns are unique to a specific genome and these can be used to decipher the identity of a mixed pathogen sample and can separate hosts and pathogens into their respective phylogenomic relationships. This technology can also differentiate between different species and classify genomes into their known clades. The development of this technology will result in the creation of an integrated biomarker-specific bio-signature, multiple select agent specific detection system.