Different classes of small RNAs (sRNAs) refine the expression of numerous genes in higher eukaryotes by directing protein partners to complementary nucleic acids, where they mediate gene silencing. Plants encode a unique class of sRNAs, called trans-acting small interfering RNAs (tasiRNAs), which post-transcriptionally regulate protein-coding transcripts, as do microRNAs (miRNAs), and both sRNA classes control development through their targets. TasiRNA biogenesis requires multiple components of the siRNA pathway and also miRNAs. But while 21mer siRNAs originating from transgenes can mediate silencing across several cell layers, miRNA action seems spatially restricted to the producing or closely surrounding cells.
We have previously described the isolation of a genetrap reporter line for TAS3a, the major locus producing AUXIN RESPONS FACTOR (ARF)-regulating tasiRNAs in the Arabidopsis shoot. Its activity is limited to the adaxial (upper) side of leaf primordia, thus spatially isolated from ARF-activities, which are located in the abaxial (lower) side. We show here by in situ hybridization and reporter fusions that the silencing activities of ARF-regulating tasiRNAs are indeed manifested non-cell autonomously to spatially control ARF activities.
Endogenous tasiRNAs are thus mediators of a mobile developmental signal and might provide effective gene silencing at a distance beyond the reach of most miRNAs.
In plants, small interfering RNAs (siRNAs) can trigger a silencing signal that may spread within a tissue to adjacent cells or even systemically to other organs. Movement of the signal is initially limited to a few cells, but in some cases the signal can be amplified and travel over larger distances. How far silencing initiated by other classes of plant small RNAs (sRNAs) than siRNAs can extend has been less clear. Using a system based on the silencing of the CH42 gene, we have tracked the mobility of silencing signals initiated in phloem companion cells by artificial microRNAs (miRNA) and trans-acting siRNA (tasiRNA) that have the same primary sequence. In this system, both the ta-siRNA and the miRNA act at a distance. Non-autonomous effects of the miRNA can be triggered by several different miRNA precursors deployed as backbones. While the tasiRNA also acts non-autonomously, it has a much greater range than the miRNA or hairpin-derived siRNAs directed against CH42, indicating that biogenesis can determine the non-autonomous effects of sRNAs. In agreement with this hypothesis, the silencing signals initiated by different sRNAs differ in their genetic requirements.
Transcription factors (TFs), microRNAs (miRNAs), small interfering RNAs (siRNAs) and other functional non-coding small RNAs (sRNAs) are important gene regulators. Comparison of sRNA expression profiles between transgenic barley over-expressing a drought tolerant TF (TaDREB3) and non-transgenic control barley revealed many group-specific sRNAs. In addition, 42% of the shared sRNAs were differentially expressed between the two groups (|log2| >1). Furthermore, TaDREB3-derived sRNAs were only detected in transgenic barley despite the existence of homologous genes in non-transgenic barley. These results demonstrate that the TF strongly affects the expression of sRNAs and siRNAs could in turn affect the TF stability. The TF also affects size distribution and abundance of sRNAs including miRNAs. About half of the sRNAs in each group were derived from chloroplast. A sRNA derived from tRNA-His(GUG) encoded by the chloroplast genome is the most abundant sRNA, accounting for 42.2% of the total sRNAs in transgenic barley and 28.9% in non-transgenic barley. This sRNA, which targets a gene (TC245676) involved in biological processes, was only present in barley leaves but not roots. 124 and 136 miRNAs were detected in transgenic and non-transgenic barley, respectively. miR156 was the most abundant miRNA and up-regulated in transgenic barley, while miR168 was the most abundant miRNA and up-regulated in non-transgenic barley. Eight out of 20 predicted novel miRNAs were differentially expressed between the two groups. All the predicted novel miRNA targets were validated using a degradome library. Our data provide an insight into the effect of TF on the expression of sRNAs in barley.
MicroRNAs (miRNAs) are small RNAs (sRNAs) approximately 21 nucleotides in length that negatively control gene expression by cleaving or inhibiting the translation of target gene transcripts. Within this context, miRNAs and siRNAs are coming to the forefront as molecular mediators of gene regulation in plant responses to annual temperature cycling and cold stress. For this reason, we chose to identify and characterize the conserved and non-conserved miRNA component of peach (Prunus persica (L.) Batsch) focusing our efforts on both the recently released whole genome sequence of peach and sRNA transcriptome sequences from two tissues representing non-dormant leaves and dormant leaf buds. Conserved and non-conserved miRNAs, and their targets were identified. These sRNA resources were used to identify cold-responsive miRNAs whose gene targets co-localize with previously described QTLs for chilling requirement (CR).
Analysis of 21 million peach sRNA reads allowed us to identify 157 and 230 conserved and non-conserved miRNA sequences. Among the non-conserved miRNAs, we identified 205 that seem to be specific to peach. Comparative genome analysis between peach and Arabidopsis showed that conserved miRNA families, with the exception of miR5021, are similar in size. Sixteen of these conserved miRNA families are deeply rooted in land plant phylogeny as they are present in mosses and/or lycophytes. Within the other conserved miRNA families, five families (miR1446, miR473, miR479, miR3629, and miR3627) were reported only in tree species (Populustrichocarpa, Citrus trifolia, and Prunus persica). Expression analysis identified several up-regulated or down-regulated miRNAs in winter buds versus young leaves. A search of the peach proteome allowed the prediction of target genes for most of the conserved miRNAs and a large fraction of non-conserved miRNAs. A fraction of predicted targets in peach have not been previously reported in other species. Several conserved and non-conserved miRNAs and miRNA-regulated genes co-localize with Quantitative Trait Loci (QTLs) for chilling requirement (CR-QTL) and bloom date (BD-QTL).
In this work, we identified a large set of conserved and non-conserved miRNAs and describe their evolutionary footprint in angiosperm lineages. Several of these miRNAs were induced in winter buds and co-localized with QTLs for chilling requirement and bloom date thus making their gene targets potential candidates for mediating plant responses to cold stress. Several peach homologs of genes participating in the regulation of vernalization in Arabidopsis were identified as differentially expressed miRNAs targets, potentially linking these gene activities to cold responses in peach dormant buds. The non-conserved miRNAs may regulate cellular, physiological or developmental processes specific to peach and/or other tree species.
microRNAs; Distribution; Expression; Cold stress; Chilling requirement; Bud development
Small non-coding RNAs (21 to 24 nucleotides) regulate a number of developmental processes in plants and animals by silencing genes using multiple mechanisms. Among these, the most conserved classes are microRNAs (miRNAs) and small interfering RNAs (siRNAs), both of which are produced by RNase III-like enzymes called Dicers. Many plant miRNAs play critical roles in nutrient homeostasis, developmental processes, abiotic stress and pathogen responses. Currently, only 70 miRNA have been identified in soybean.
We utilized Illumina's SBS sequencing technology to generate high-quality small RNA (sRNA) data from four soybean (Glycine max) tissues, including root, seed, flower, and nodules, to expand the collection of currently known soybean miRNAs. We developed a bioinformatics pipeline using in-house scripts and publicly available structure prediction tools to differentiate the authentic mature miRNA sequences from other sRNAs and short RNA fragments represented in the public sequencing data.
The combined sequencing and bioinformatics analyses identified 129 miRNAs based on hairpin secondary structure features in the predicted precursors. Out of these, 42 miRNAs matched known miRNAs in soybean or other species, while 87 novel miRNAs were identified. We also predicted the putative target genes of all identified miRNAs with computational methods and verified the predicted cleavage sites in vivo for a subset of these targets using the 5' RACE method. Finally, we also studied the relationship between the abundance of miRNA and that of the respective target genes by comparison to Solexa cDNA sequencing data.
Our study significantly increased the number of miRNAs known to be expressed in soybean. The bioinformatics analysis provided insight on regulation patterns between the miRNAs and their predicted target genes expression. We also deposited the data in a soybean genome browser based on the UCSC Genome Browser architecture. Using the browser, we annotated the soybean data with miRNA sequences from four tissues and cDNA sequencing data. Overlaying these two datasets in the browser allows researchers to analyze the miRNA expression levels relative to that of the associated target genes. The browser can be accessed at http://digbio.missouri.edu/soybean_mirna/.
Small RNAs (sRNA), including microRNAs (miRNA) and small interfering RNAs (siRNA), are produced abundantly in plants and animals and function in regulating gene expression or in defense against virus or viroid infection. Analysis of siRNA profiles upon virus infection in plant may allow for virus identification, strain differentiation, and de novo assembly of virus genomes. In the present study, four suspected virus-infected tomato samples collected in the U.S. and Mexico were used for sRNA library construction and deep sequencing. Each library generated between 5–7 million sRNA reads, of which more than 90% were from the tomato genome. Upon in-silico subtraction of the tomato sRNAs, the remaining highly enriched, virus-like siRNA pools were assembled with or without reference virus or viroid genomes. A complete genome was assembled for Potato spindle tuber viroid (PSTVd) using siRNA alone. In addition, a near complete virus genome (98%) also was assembled for Pepino mosaic virus (PepMV). A common mixed infection of two strains of PepMV (EU and US1), which shared 82% of genome nucleotide sequence identity, also could be differentially assembled into their respective genomes. Using de novo assembly, a novel potyvirus with less than 60% overall genome nucleotide sequence identity to other known viruses was discovered and its full genome sequence obtained. Taken together, these data suggest that the sRNA deep sequencing technology will likely become an efficient and powerful generic tool for virus identification in plants and animals.
Direct cloning and parallel sequencing, an extremely powerful method for microRNA (miRNA) discovery, has not yet been applied to bacterial transcriptomes. Here we present sRNA-Seq, an unbiased method that allows for interrogation of the entire small, non-coding RNA (sRNA) repertoire in any prokaryotic or eukaryotic organism. This method includes a novel treatment that depletes total RNA fractions of highly abundant tRNAs and small subunit rRNA, thereby enriching the starting pool for sRNA transcripts with novel functionality. As a proof-of-principle, we applied sRNA-Seq to the human pathogen Vibrio cholerae. Our results provide information, at unprecedented depth, on the complexity of the sRNA component of a bacterial transcriptome. From 407 039 sequence reads, all 20 known V. cholerae sRNAs, 500 new, putative intergenic sRNAs and 127 putative antisense sRNAs were identified in a limited number of growth conditions examined. In addition, characterization of a subset of the newly identified transcripts led to the identification of a novel sRNA regulator of carbon metabolism. Collectively, these results strongly suggest that the number of sRNAs in bacteria has been greatly underestimated and that future efforts to analyze bacterial transcriptomes will benefit from direct cloning and parallel sequencing experiments aided by 5S/tRNA depletion.
Small RNAs (sRNAs) are a class of short (20–25 nt) non-coding RNAs that play important regulatory roles in gene expression. An essential first step in understanding their function is to confidently identify sRNA targets. In plants, several classes of sRNAs such as microRNAs (miRNAs) and trans-acting small interfering RNAs have been shown to bind with near-perfect complementarity to their messenger RNA (mRNA) targets, generally leading to cleavage of the mRNA. Recently, a high-throughput technique known as Parallel Analysis of RNA Ends (PARE) has made it possible to sequence mRNA cleavage products on a large-scale. Computational methods now exist to use these data to find targets of conserved and newly identified miRNAs. Due to speed limitations such methods rely on the user knowing which sRNA sequences are likely to target a transcript. By limiting the search to a tiny subset of sRNAs it is likely that many other sRNA/mRNA interactions will be missed. Here, we describe a new software tool called PAREsnip that allows users to search for potential targets of all sRNAs obtained from high-throughput sequencing experiments. By searching for targets of a complete ‘sRNAome’ we can facilitate large-scale identification of sRNA targets, allowing us to discover regulatory interaction networks.
Tobamoviruses encode a silencing suppressor that binds small RNA (sRNA) duplexes
in vitro and supposedly in vivo to
counteract antiviral silencing. Here, we used sRNA deep-sequencing combined with
transcriptome profiling to determine the global impact of tobamovirus infection
on Arabidopsis sRNAs and their mRNA targets. We found that
infection of Arabidopsis plants with Oilseed rape
mosaic tobamovirus causes a global size-specific enrichment of
miRNAs, ta-siRNAs, and other phased siRNAs. The observed patterns of sRNA
enrichment suggest that in addition to a role of the viral silencing suppressor,
the stabilization of sRNAs might also occur through association with unknown
host effector complexes induced upon infection. Indeed, sRNA enrichment concerns
primarily 21-nucleotide RNAs with a 5′-terminal guanine. Interestingly,
ORMV infection also leads to accumulation of novel miRNA-like sRNAs from miRNA
precursors. Thus, in addition to canonical miRNAs and miRNA*s, miRNA
precursors can encode additional sRNAs that may be functional under specific
conditions like pathogen infection. Virus-induced sRNA enrichment does not
correlate with defects in miRNA-dependent ta-siRNA biogenesis nor with global
changes in the levels of mRNA and ta-siRNA targets suggesting that the enriched
sRNAs may not be able to significantly contribute to the normal activity of
pre-loaded RISC complexes. We conclude that tobamovirus infection induces the
stabilization of a specific sRNA pool by yet unknown effector complexes. These
complexes may sequester viral and host sRNAs to engage them in yet unknown
mechanisms involved in plant:virus interactions.
Small non-coding RNAs (sRNAs) that act as regulators of gene expression have been identified in all kingdoms of life, including microRNA (miRNA) and small interfering RNA (siRNA) in eukaryotic cells. Numerous sRNAs identified in Salmonella are encoded by genes located at Salmonella pathogenicity islands (SPIs) that are commonly found in pathogenic strains. Whether these sRNAs are important for Salmonella pathogenesis and virulence in animals has not been reported. In this study, we provide the first direct evidence that a pathogenicity island-encoded sRNA, IsrM, is important for Salmonella invasion of epithelial cells, intracellular replication inside macrophages, and virulence and colonization in mice. IsrM RNA is expressed in vitro under conditions resembling those during infection in the gastrointestinal tract. Furthermore, IsrM is found to be differentially expressed in vivo, with higher expression in the ileum than in the spleen. IsrM targets the mRNAs coding for SopA, a SPI-1 effector, and HilE, a global regulator of the expression of SPI-1 proteins, which are major virulence factors essential for bacterial invasion. Mutations in IsrM result in disregulation of expression of HilE and SopA, as well as other SPI-1 genes whose expression is regulated by HilE. Salmonella with deletion of isrM is defective in bacteria invasion of epithelial cells and intracellular replication/survival in macrophages. Moreover, Salmonella with mutations in isrM is attenuated in killing animals and defective in growth in the ileum and spleen in mice. Our study has shown that IsrM sRNA functions as a pathogenicity island-encoded sRNA directly involved in Salmonella pathogenesis in animals. Our results also suggest that sRNAs may represent a distinct class of virulence factors that are important for bacterial infection in vivo.
Regulated expression of virulence factors is essential for infection by human pathogens such as Salmonella. Small non-coding RNAs (sRNAs) that act as regulators of gene expression have been identified in all kingdoms of life, and many sRNAs in Salmonella are encoded by genes located at Salmonella pathogenicity islands commonly found in pathogenic strains. In this study, we demonstrated that a pathogenicity island-encoded sRNA directly targets the expression of both a global regulator of virulence genes as well as a specific virulence factor critical for Salmonella pathogenesis. The sRNA is important for Salmonella invasion of epithelial cells, replication inside macrophages, and virulence/colonization in mice, representing the first example of a pathogenicity island-encoded sRNA that is directly involved in Salmonella pathogenesis in vivo. Our study suggests that sRNA may function as a distinct class of virulence factors that significantly contribute to bacterial infection in vivo. Furthermore, our results raise the possibility of developing new strategies against bacterial infection by preventing the expression of regulatory sRNAs.
Small non-coding RNAs (sRNAs) are regarded as important regulators in prokaryotes and play essential roles in diverse cellular processes. Xanthomonas oryzae pathovar oryzae (Xoo) is an important plant pathogenic bacterium which causes serious bacterial blight of rice. However, little is known about the number, genomic distribution and biological functions of sRNAs in Xoo.
Here, we performed a systematic screen to identify sRNAs in the Xoo strain PXO99. A total of 850 putative non-coding RNA sequences originated from intergenic and gene antisense regions were identified by cloning, of which 63 were also identified as sRNA candidates by computational prediction, thus were considered as Xoo sRNA candidates. Northern blot hybridization confirmed the size and expression of 6 sRNA candidates and other 2 cloned small RNA sequences, which were then added to the sRNA candidate list. We further examined the expression profiles of the eight sRNAs in an hfq deletion mutant and found that two of them showed drastically decreased expression levels, and another exhibited an Hfq-dependent transcript processing pattern. Deletion mutants were obtained for seven of the Northern confirmed sRNAs, but none of them exhibited obvious phenotypes. Comparison of the proteomic differences between three of the ΔsRNA mutants and the wild-type strain by two-dimensional gel electrophoresis (2-DE) analysis showed that these sRNAs are involved in multiple physiological and biochemical processes.
We experimentally verified eight sRNAs in a genome-wide screen and uncovered three Hfq-dependent sRNAs in Xoo. Proteomics analysis revealed Xoo sRNAs may take part in various metabolic processes. Taken together, this work represents the first comprehensive screen and functional analysis of sRNAs in rice pathogenic bacteria and facilitates future studies on sRNA-mediated regulatory networks in this important phytopathogen.
Small non-coding RNAs (smRNAs) are known to have major roles in gene regulation in eukaryotes. In plants, knowledge of the biogenesis and mechanisms of action of smRNA classes including microRNAs (miRNAs), short interfering RNAs (siRNAs), and trans-acting siRNAs (tasiRNAs) has been gained mostly through studies with Arabidopsis. In recent years, high throughput sequencing of smRNA populations has enabled extension of knowledge from model systems to plants with larger, more complex genomes. Soybean (Glycine max) now has many genomics resources available including a complete genome sequence and predicted gene models. Relatively little is known, however, about the full complement of its endogenous smRNAs populations and the silenced genes.
Using Illumina sequencing and computational analysis, we characterized eight smRNA populations from multiple tissues and organs of soybean including developing seed and vegetative tissues. A total of 41 million raw sequence reads collapsed into 135,055 unique reads were mapped to the soybean genome and its predicted cDNA gene models. Bioinformatic analyses were used to distinguish miRNAs and siRNAs and to determine their genomic origins and potential target genes. In addition, we identified two soybean TAS3 gene homologs, the miRNAs that putatively guide cleavage of their transcripts, and the derived tasiRNAs that could target soybean genes annotated as auxin response factors. Tissue-differential expression based on the flux of normalized miRNA and siRNA abundances in the eight smRNA libraries was evident, some of which was confirmed by smRNA blotting. Our global view of these smRNA populations also revealed that the size classes of smRNAs varied amongst different tissues, with the developing seed and seed coat having greater numbers of unique smRNAs of the 24-nt class compared to the vegetative tissues of germinating seedlings. The 24-nt class is known to be derived from repetitive elements including transposons. Detailed analysis of the size classes associated with ribosomal RNAs and transposable element families showed greater diversity of smRNAs in the 22- and 24-nt size classes.
The flux of endogenous smRNAs within multiple stages and tissues of seed development was contrasted with vegetative tissues of soybean, one of the dominant sources of protein and oil in world markets. The smRNAs varied in size class, complexity of origins, and possible targets. Sequencing revealed tissue-preferential expression for certain smRNAs and expression differences among closely related miRNA family members.
The importance of small RNA (sRNA) regulators has been recognized across all domains of life. In bacteria, sRNAs typically control the expression of virulence and stress response genes via antisense base pairing with mRNA targets. Originally dubbed “non-coding RNAs,” a number of bacterial antisense sRNAs have been found to encode functional proteins. Although very few of these dual-function sRNAs have been characterized, they have been found in both gram-negative and gram-positive organisms. Among the few known examples, the functions and mechanisms of regulation by dual-function sRNAs are variable. Some dual-function sRNAs depend on the RNA chaperone Hfq for base pairing-dependent regulation (riboregulation); this feature appears so far exclusive to gram-negative bacterial sRNAs. Other variations can be found in the spatial organization of the coding region with respect to the riboregulation determinants. How the functions of encoded proteins relate to riboregulation is for the most part not understood. However, in one case it appears that there is physiological redundancy between protein and riboregulation functions. This mini-review focuses on the two best-studied bacterial dual-function sRNAs: RNAIII from Staphylococcus aureus and SgrS from Escherichia coli and includes a discussion of what is known about the structure, function and physiological roles of these sRNAs as well as what questions remain outstanding.
Hfq; RNAIII; glucose phosphate stress; SgrS; riboregulation
Small non-coding RNAs (sRNAs) are widespread effectors of post-transcriptional gene regulation in bacteria. Currently extensive information exists on the sRNAs of Listeria monocytogenes expressed during growth in extracellular environments. We used deep sequencing of cDNAs obtained from fractioned RNA (<500 nt) isolated from extracellularly growing bacteria and from L. monocytogenes infected macrophages to catalog the sRNA repertoire during intracellular bacterial growth. Here, we report on the discovery of 150 putative regulatory RNAs of which 71 have not been previously described. A total of 29 regulatory RNAs, including small non-coding antisense RNAs, are specifically expressed intracellularly. We validated highly expressed sRNAs by northern blotting and demonstrated by the construction and characterization of isogenic mutants of rli31, rli33-1 and rli50* for intracellular expressed sRNA candidates, that their expression is required for efficient growth of bacteria in macrophages. All three mutants were attenuated when assessed for growth in mouse and insect models of infection. Comparative genomic analysis revealed the presence of lineage specific sRNA candidates and the absence of sRNA loci in genomes of naturally occurring infection-attenuated bacteria, with additional loss in non-pathogenic listerial genomes. Our analyses reveal extensive sRNA expression as an important feature of bacterial regulation during intracellular growth.
Plant microRNAs (miRNAs) are involved in post-transcriptional regulatory mechanisms of several processes, including the response to biotic and abiotic stress, often contributing to the adaptive response of the plant to adverse conditions. In addition to conserved miRNAs, found in a wide range of plant species a number of novel species-specific miRNAs, displaying lower levels of expression can be found. Due to low abundance, non conserved miRNAs are difficult to identify and isolate using conventional approaches. Conversely, deep-sequencing of small RNA (sRNA) libraries can detect even poorly expressed miRNAs.
No miRNAs from globe artichoke have been described to date. We analyzed the miRNAome from artichoke by deep sequencing four sRNA libraries obtained from NaCl stressed and control leaves and roots.
Conserved and novel miRNAs were discovered using accepted criteria. The expression level of selected miRNAs was monitored by quantitative real-time PCR. Targets were predicted and validated for their cleavage site. A total of 122 artichoke miRNAs were identified, 98 (25 families) of which were conserved with other plant species, and 24 were novel. Some miRNAs were differentially expressed according to tissue or condition, magnitude of variation after salt stress being more pronounced in roots. Target function was predicted by comparison to Arabidopsis proteins; the 43 targets (23 for novel miRNAs) identified included transcription factors and other genes, most of which involved in the response to various stresses. An unusual cleaved transcript was detected for miR393 target, transport inhibitor response 1.
The miRNAome from artichoke, including novel miRNAs, was unveiled, providing useful information on the expression in different organs and conditions. New target genes were identified. We suggest that the generation of secondary short-interfering RNAs from miR393 target can be a general rule in the plant kingdom.
Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20–30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular breeding in Chinese fir. Recently, major advances in transcriptome sequencing have provided fast and cost-effective approaches to generate large expression datasets that have proven to be powerful tools to profile the transcriptomes of non-model organisms with undetermined genomes.
In this study, the transcriptomes of nine tissues from Chinese fir were analyzed using the Illumina HiSeq™ 2000 sequencing platform. Approximately 40 million paired-end reads were obtained, generating 3.62 gigabase pairs of sequencing data. These reads were assembled into 83,248 unique sequences (i.e. Unigenes) with an average length of 449 bp, amounting to 37.40 Mb. A total of 73,779 Unigenes were supported by more than 5 reads, 42,663 (57.83%) had homologs in the NCBI non-redundant and Swiss-Prot protein databases, corresponding to 27,224 unique protein entries. Of these Unigenes, 16,750 were assigned to Gene Ontology classes, and 14,877 were clustered into orthologous groups. A total of 21,689 (29.40%) were mapped to 119 pathways by BLAST comparison against the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The majority of the genes encoding the enzymes in the biosynthetic pathways of cellulose and lignin were identified in the Unigene dataset by targeted searches of their annotations. And a number of candidate Chinese fir genes in the two metabolic pathways were discovered firstly. Eighteen genes related to cellulose and lignin biosynthesis were cloned for experimental validating of transcriptome data. Overall 49 Unigenes, covering different regions of these selected genes, were found by alignment. Their expression patterns in different tissues were analyzed by qRT-PCR to explore their putative functions.
A substantial fraction of transcript sequences was obtained from the deep sequencing of Chinese fir. The assembled Unigene dataset was used to discover candidate genes of cellulose and lignin biosynthesis. This transcriptome dataset will provide a comprehensive sequence resource for molecular genetics research of C. lanceolata.
Chinese fir; De novo assembly; RNA-Seq; Transcriptome; Cellulose and lignin biosynthesis; Gene expression
Bacterial small non-coding RNA (sRNA) plays an important role in post-transcriptional gene regulation. Although the number of annotated sRNA is steadily increasing, their functional characterization is still lagging behind. Various computational strategies for finding sRNA–mRNA interactions, and thus putative sRNA targets, were developed. Most of them suffer from a high false positive rate. Here, we present a qualitative model to simulate the effect of an sRNA on the translation initiation of a potential target. Information about the ribosome–mRNA interaction, sRNA–mRNA interaction and expression information from deep sequencing experiments is integrated to calculate the change in translation initiation complex formation, as a proxy for translational activity. This model can be used to post-evaluate predicted targets, hence condensing the list of potential targets. We show that our translation initiation model, under the influence of an sRNA, can successfully simulate thirteen out of fifteen tested sRNA–mRNA interactions in a qualitative manner. To show the gain in specificity, we applied our method to a target search for the Escherichia coli sRNA RyhB. Compared with simple target prediction without post-evaluation, we reduce the number of targets to less than one fourth potential targets, considerably reducing the burden of experimental validation.
sRNA; sRNA target prediction; translation initiation
The Arabidopsis thaliana (Arabidopsis) DOUBLE-STRANDED RNA BINDING (DRB) protein family consists of five members, DRB1 to DRB5. The biogenesis of two developmentally important small RNA (sRNA) species, the microRNAs (miRNAs) and trans-acting small interfering RNAs (tasiRNAs) by DICER-LIKE (DCL) endonucleases requires the assistance of DRB1 and DRB4 respectively. The importance of miRNA-directed target gene expression in plant development is exemplified by the phenotypic consequence of loss of DRB1 activity (drb1 plants).
Here we report that the developmental phenotype of the drb235 triple mutant plant is the result of deregulated miRNA biogenesis in the shoot apical meristem (SAM) region. The expression of DRB2, DRB3 and DRB5 in wild-type seedlings is restricted to the SAM region. Small RNA sequencing of the corresponding tissue of drb235 plants revealed altered miRNA accumulation. Approximately half of the miRNAs detected remained at levels equivalent to those of wild-type plants. However, the accumulation of the remaining miRNAs was either elevated or reduced in the triple mutant. Examination of different single and multiple drb mutants revealed a clear association between the loss of DRB2 activity and altered accumulation for both the elevated and reduced miRNA classes. Furthermore, we show that the constitutive over-expression of DRB2 outside of its wild-type expression domain can compensate for the loss of DRB1 activity in drb1 plants.
Our results suggest that in the SAM region, DRB2 is both antagonistic and synergistic to the role of DRB1 in miRNA biogenesis, adding an additional layer of gene regulatory complexity in this developmentally important tissue.
The regulatory role of small RNAs (sRNAs) in various biological processes is an active area of investigation; however, there has been limited information available on the role of sRNAs in plant-insect interactions. This study was designed to identify sRNAs in cotton-melon aphid (Aphis gossypii) during the Vat-mediated resistance interaction with melon (Cucumis melo).
The role of miRNAs was investigated in response to aphid herbivory, during both resistant and susceptible interactions. sRNA libraries made from A. gossypii tissues feeding on Vat+ and Vat− plants revealed an unexpected abundance of 27 nt long sRNA sequences in the aphids feeding on Vat+ plants. Eighty-one conserved microRNAs (miRNAs), twelve aphid-specific miRNAs, and nine novel candidate miRNAs were also identified. Plant miRNAs found in the aphid libraries were most likely ingested during phloem feeding. The presence of novel miRNAs was verified by qPCR experiments in both resistant Vat+ and susceptible Vat− interactions. The comparative analyses revealed that novel miRNAs were differentially regulated during the resistant and susceptible interactions. Gene targets predicted for the miRNAs identified in this study by in silico analyses revealed their involvement in morphogenesis and anatomical structure determination, signal transduction pathways, cell differentiation and catabolic processes.
In this study, conserved and novel miRNAs were reported in A. gossypii. Deep sequencing data showed differences in the abundance of miRNAs and piRNA-like sequences in A. gossypii. Quantitative RT-PCR revealed that A. gossypii miRNAs were differentially regulated during resistant and susceptible interactions. Aphids can also ingest plant miRNAs during phloem feeding that are stable in the insect.
Endogenous 24 nt short interfering RNAs (siRNAs), derived mostly from intergenic and repetitive genomic regions, constitute a major class of endogenous small RNAs in flowering plants. Accumulation of Arabidopsis thaliana 24 nt siRNAs requires the Dicer family member DCL3, and clear homologs of DCL3 exist in both flowering and non-flowering plants. However, the absence of a conspicuous 24 nt peak in the total RNA populations of several non-flowering plants has raised the question of whether this class of siRNAs might, in contrast to the ancient 21 nt microRNAs (miRNAs) and 21–22 nt trans-acting siRNAs (tasiRNAs), be an angiosperm-specific innovation. Analysis of non-miRNA, non-tasiRNA hotspots of small RNA production within the genome of the moss Physcomitrella patens revealed multiple loci that consistently produced a mixture of 21–24 nt siRNAs with a peak at 23 nt. These Pp23SR loci were significantly enriched in transposon content, depleted in overlap with annotated genes, and typified by dense concentrations of the 5-methyl cytosine (5 mC) DNA modification. Deep sequencing of small RNAs from two independent Ppdcl3 mutants showed that the P. patens DCL3 homolog is required for the accumulation of 22–24 nt siRNAs, but not 21 nt siRNAs, at Pp23SR loci. The 21 nt component of Pp23SR-derived siRNAs was also unaffected by a mutation in the RNA-dependent RNA polymerase mutant Pprdr6. Transcriptome-wide, Ppdcl3 mutants failed to accumulate 22–24 nt small RNAs from repetitive regions while transcripts from two abundant families of long terminal repeat (LTR) retrotransposon-associated reverse transcriptases were up-regulated. Ppdcl3 mutants also displayed an acceleration of leafy gametophore production, suggesting that repetitive siRNAs may play a role in the development of P. patens. We conclude that intergenic/repeat-derived siRNAs are indeed a broadly conserved, distinct class of small regulatory RNAs within land plants.
Very small RNAs (between ∼21 and ∼30 single-stranded bases) are a ubiquitous component of gene regulation in nearly all eukaryotic organisms. The small RNA repertoire of angiosperms (the flowering plants) is exceptionally diverse and includes conspicuous populations of 21 nt microRNAs, as well a diverse set of 24 nt short, interfering RNAs (siRNAs). The 24 nt siRNAs have well-documented roles in enforcing the silence of parasitic regions of the genome, but are not readily apparent in the small RNA populations of several lineages of ancient, non-flowering plants. We found numerous “hotspots” of small RNA production from the genome of the moss P. patens that produced a mix of 21–24 nt siRNAs. Except for their broad mix of sizes, these hotspots were reminiscent of the 24 nt siRNA loci of angiosperms: they tended to associate with decayed transposons, to avoid annotated genes, and to be densely modified with the epigenetic mark 5-methyl cytosine. Deletion of a P. patens Dicer gene abolished production of 22–24 nt siRNAs both from these loci and transcriptome-wide, especially from repetitive regions. We conclude that both microRNAs and intergenic/repeat-associated siRNAs are ancient small RNA regulators in plants, but that the sizes of the siRNAs themselves have drifted over time.
Small untranslated RNAs (sRNAs) are widespread regulators of gene expression in bacteria. This study reports on a comprehensive screen for sRNAs in the symbiotic nitrogen-fixing alpha-proteobacterium Sinorhizobium meliloti applying deep sequencing of cDNAs and microarray hybridizations.
A total of 1,125 sRNA candidates that were classified as trans-encoded sRNAs (173), cis-encoded antisense sRNAs (117), mRNA leader transcripts (379), and sense sRNAs overlapping coding regions (456) were identified in a size range of 50 to 348 nucleotides. Among these were transcripts corresponding to 82 previously reported sRNA candidates. Enrichment for RNAs with primary 5'-ends prior to sequencing of cDNAs suggested transcriptional start sites corresponding to 466 predicted sRNA regions. The consensus σ70 promoter motif CTTGAC-N17-CTATAT was found upstream of 101 sRNA candidates. Expression patterns derived from microarray hybridizations provided further information on conditions of expression of a number of sRNA candidates. Furthermore, GenBank, EMBL, DDBJ, PDB, and Rfam databases were searched for homologs of the sRNA candidates identified in this study. Searching Rfam family models with over 1,000 sRNA candidates, re-discovered only those sequences from S. meliloti already known and stored in Rfam, whereas BLAST searches suggested a number of homologs in related alpha-proteobacteria.
The screening data suggests that in S. meliloti about 3% of the genes encode trans-encoded sRNAs and about 2% antisense transcripts. Thus, this first comprehensive screen for sRNAs applying deep sequencing in an alpha-proteobacterium shows that sRNAs also occur in high number in this group of bacteria.
The small RNAs (sRNA) are a regulatory class of RNA mainly represented by the 21 and 24-nucleotide size classes. The cellular sRNAs are processed by RNase III family enzyme dicer (Dicer like in plant) from a self-complementary hairpin loop or other type of RNA duplexes. The papaya genome has been sequenced, but its microRNAs and other regulatory RNAs are yet to be analyzed.
We analyzed the genomic features of the papaya sRNA population from three sRNA deep sequencing libraries made from leaves, flowers, and leaves infected with Papaya Ringspot Virus (PRSV). We also used the deep sequencing data to annotate the micro RNA (miRNA) in papaya. We identified 60 miRNAs, 24 of which were conserved in other species, and 36 of which were novel miRNAs specific to papaya. In contrast to the Chargaff’s purine-pyrimidine equilibrium, cellular sRNA was significantly biased towards a purine rich population. Of the two purine bases, higher frequency of adenine was present in 23nt or longer sRNAs, while 22nt or shorter sRNAs were over represented by guanine bases. However, this bias was not observed in the annotated miRNAs in plants. The 21nt species were expressed from fewer loci but expressed at higher levels relative to the 24nt species. The highly expressed 21nt species were clustered in a few isolated locations of the genome. The PRSV infected leaves showed higher accumulation of 21 and 22nt sRNA compared to uninfected leaves. We observed higher accumulation of miRNA* of seven annotated miRNAs in virus-infected tissue, indicating the potential function of miRNA* under stressed conditions.
We have identified 60 miRNAs in papaya. Our study revealed the asymmetric purine-pyrimidine distribution in cellular sRNA population. The 21nt species of sRNAs have higher expression levels than 24nt sRNA. The miRNA* of some miRNAs shows higher accumulation in PRSV infected tissues, suggesting that these strands are not totally functionally redundant. The findings open a new avenue for further investigation of the sRNA silencing pathway in plants.
miRNA; siRNA; Papaya Ringspot Virus (PRSV); Small RNA strand selection; Transgene silencing
MicroRNAs (miRNAs) play a pivotal role in plant development. The expression patterns of the miRNA genes significantly influence their regulatory activities. By utilizing small RNA (sRNA) high-throughput sequencing (HTS) data, the miRNA expression patterns were investigated in four organs (flowers, leaves, roots and seedlings) of Arabidopsis. Based on a set of criteria, dozens of organ-specific miRNAs were discovered. A dominant portion of the organ-specific miRNAs identified from the ARGONAUTE 4-enriched sRNA HTS libraries were highly expressed in flowers. Additionally, the expression of the precursors of the organ-specific miRNAs was analyzed. Degradome sequencing data-based approach was employed to identify the targets of the organ-specific miRNAs. The miRNA–target interactions were used for network construction. Subnetwork analysis unraveled some novel regulatory cascades, such as the feedback regulation mediated by miR161, the potential self-regulation of the genes miR172, miR396, miR398 and miR860, and the miR863-guided cleavage of the SERRATE transcript. Our bioinformatics survey expanded the organ-specific miRNA–target list in Arabidopsis, and could deepen the biological view of the miRNA expression and their regulatory roles.
Only few small RNAs (sRNAs) have been characterized in Mycobacterium tuberculosis and their role in regulatory networks is still poorly understood. Here we report a genome-wide characterization of sRNAs in M. tuberculosis integrating experimental and computational analyses. Global RNA-seq analysis of exponentially growing cultures of M. tuberculosis H37Rv had previously identified 1373 sRNA species. In the present report we show that 258 (19%) of these were also identified by microarray expression. This set included 22 intergenic sRNAs, 84 sRNAs mapping within 5′/3′ UTRs, and 152 antisense sRNAs. Analysis of promoter and terminator consensus sequences identified sigma A promoter consensus sequences for 121 sRNAs (47%), terminator consensus motifs for 22 sRNAs (8.5%), and both motifs for 35 sRNAs (14%). Additionally, 20/23 candidates were visualized by Northern blot analysis and 5′ end mapping by primer extension confirmed the RNA-seq data. We also used a computational approach utilizing functional enrichment to identify the pathways targeted by sRNA regulation. We found that antisense sRNAs preferentially regulated transcription of membrane-bound proteins. Genes putatively regulated by novel cis-encoded sRNAs were enriched for two-component systems and for functional pathways involved in hydrogen transport on the membrane.
In bacteria, small regulatory non-coding RNAs (sRNAs) are the most abundant class of post-transcriptional regulators. They are involved in diverse processes including quorum sensing, stress response, virulence and carbon metabolism. Recent developments in high-throughput techniques, such as genomic tiling arrays and RNA-Seq, have allowed efficient detection and characterization of bacterial sRNAs. However, a comprehensive repository to host sRNAs and their annotations is not available. Existing databases suffer from a limited number of bacterial species or sRNAs included. In addition, these databases do not have tools to integrate or analyse high-throughput sequencing data. Here, we have developed BSRD (http://kwanlab.bio.cuhk.edu.hk/BSRD), a comprehensive bacterial sRNAs database, as a repository for published bacterial sRNA sequences with annotations and expression profiles. BSRD contains over nine times more experimentally validated sRNAs than any other available databases. BSRD also provides combinatorial regulatory networks of transcription factors and sRNAs with their common targets. We have built and implemented in BSRD a novel RNA-Seq analysis platform, sRNADeep, to characterize sRNAs in large-scale transcriptome sequencing projects. We will update BSRD regularly.