Transcription factors (TFs) regulate gene expression by binding selectively to DNA sequences in promoters, and genes regulated by the same TFs have been assumed to share the common binding sites in their promoter regions and exhibit similar expression patterns [
1]. Numerous experimental and computational studies [
2] have been done on locating transcriptional regulator DNA binding sequences and understanding their working mechanisms. These binding motifs can be used as building blocks of gene regulatory networks and several approaches were developed to identify how a set of
cis-regulatory elements in a gene's promoter region governed its behavior and explained the observed expression profiles [
3-
5]. Using different approaches, Segal et al. [
3] and Beer and Tavazoie [
4] both showed that a substantial fraction of yeast gene expression profiles could be explained in terms of the combination of
cis-regulatory elements. However, a limitation of such approaches is that many genes are posttranscriptionally regulated [
3]. The progresses have been made in mapping transcriptional regulatory networks in recent years, whereas posttranscriptional regulatory roles just begin to be uncovered [
6,
7]. Posttranscriptional regulation through RNA-RNA interaction has arrested much attention due to the discovery of microRNAs (miRNAs).
miRNAs regulate gene expression by inducing mRNA cleavage or translational repression of their targets [
8]. Plant miRNAs are usually perfectly complementary to their targets and cause the cleavage of their targets by RNA-induced silencing complex (RISC), whereas in animals targets with weaker complementarities appear to have decreased translational efficacy [
9]. However, the role of miRNA in regulatory networks needs to be further explored [
7]. To address this need, we introduced a combinatorial approach to determine the transcriptional and posttranscriptional regulatory elements based on gene expression profiles.
Various plant growth and development processes are critically influenced by light [
10-
12]. Wild type
Arabidopsis seedling development follows two patterns, etiolation in darkness and photomorphogenesis in the light [
13].
COP/DET/FUS (CONSTITUTIVE PHOTOMORPHOGENIC/DE-ETIOLATED/FUSCA) is a class of genes which were identified as downstream signalling components of all photoreceptors [
14-
16]. Mutation in
COP/DET/FUS causes constitutive photomorphogenic development even in the dark [
14,
17]. One important light-signalling component involved in plant light responses is COP1 [
14], which regulates not only photomorphogenesis but also other developmental processes. The constitutive photomorphogenic phenotype of
cop1 mutation indicates that COP1 acts as a negative regulator of photomorphogenesis [
13,
18].
We applied this approach to a
CONSTITUTIVE PHOTOMORPHOGENIC1 (
COP1) mutant time course microarray dataset to detect sequence elements that selectively bind to TFs and miRNAs in the process. Inspired by Beer and Tavazoie [
4], we used Bayesian network – a probabilistic model to integrate gene expression profiles, transcription factor binding sites (TFBSs) as well as miRNA target motifs to deduce the combination of sequence elements that modulate gene expression, and we tried to explain the observed gene expression profiles in terms of the adopted motifs. Firstly, we conducted a genome-wide screening to detect potential miRNA target motifs in
Arabidopsis based on an inhomogeneous Hidden Markov model (HMM), and cross-species conservation as well as minimum binding energy of miRNA/mRNA duplex were used as additional filters to reduce the rate of false positives. Secondly, genes in the
cop1 mutant time course microarray dataset were clustered into 12 expression patterns and overrepresented sequence elements in the upstream of the genes belonged to the same cluster were detected using AlignACE [
19]. Thirdly, Bayesian network strategy was applied to selecting these motifs in both upstream sequences and transcript sequences that were most related to the gene expression patterns. Lastly, we measured the degree to which gene expression could be determined merely by these adopted regulatory motifs. Figure illustrated the flow diagram of the approach.