|Home | About | Journals | Submit | Contact Us | Français|
Sox6 is a multi-faceted transcription factor involved in the terminal differentiation of many different cell types in vertebrates. It has been suggested that in mice as well as in zebrafish Sox6 plays a role in the terminal differentiation of skeletal muscle by suppressing transcription of slow fiber specific genes. In order to understand how Sox6 coordinately regulates the transcription of multiple fiber type specific genes during muscle development, we have performed ChIP-seq analyses to identify Sox6 target genes in mouse fetal myotubes and generated muscle-specific Sox6 knockout (KO) mice to determine the Sox6 null muscle phenotype in adult mice.
We have identified 1,066 Sox6 binding sites using mouse fetal myotubes. The Sox6 binding sites were found to be associated with slow fiber-specific, cardiac, and embryonic isoform genes that are expressed in the sarcomere as well as transcription factor genes known to play roles in muscle development. The concurrently performed RNA polymerase II (Pol II) ChIP-seq analysis revealed that 84% of the Sox6 peak-associated genes exhibited little to no binding of Pol II, suggesting that the majority of the Sox6 target genes are transcriptionally inactive. These results indicate that Sox6 directly regulates terminal differentiation of muscle by affecting the expression of sarcomere protein genes as well as indirectly through influencing the expression of transcription factors relevant to muscle development. Gene expression profiling of Sox6 KO skeletal and cardiac muscle revealed a significant increase in the expression of the genes associated with Sox6 binding. In the absence of the Sox6 gene, there was dramatic upregulation of slow fiber-specific, cardiac, and embryonic isoform gene expression in Sox6 KO skeletal muscle and fetal isoform gene expression in Sox6 KO cardiac muscle, thus confirming the role Sox6 plays as a transcriptional suppressor in muscle development.
Our present data indicate that during development, Sox6 functions as a transcriptional suppressor of fiber type-specific and developmental isoform genes to promote functional specification of muscle which is critical for optimum muscle performance and health.
Skeletal muscle in vertebrates has evolved to be a major organ system with great adaptability in order to respond to constantly changing physical demands placed upon it. This adaptability is achieved by the ability of muscle fibers to change their contractile and metabolic properties. Adult skeletal muscle consists of two major fiber groups, slow-twitch and fast-twitch. In general, slow fibers are best fit for long-lasting aerobic activity whereas fast fibers are best fit for short bouts of anaerobic activity . At the molecular level, a coordinated expression of multiple fiber type-specific genes, both structural and enzymatic, is required to give each fiber type its unique characteristics. Slow and fast muscle fibers are operationally defined by the expression of the isoforms of myosin heavy chain (MyHC) . In adult rodent skeletal muscle, slow fibers are defined by the expression of MyHC-β, whereas fast fibers are defined by the expression of three MyHC isoforms, IIa, IIx/d, and IIb (contractive speed: IIa<IIx/d<IIb) . In developing fetal rodent muscle, instead of MyHC-IIa, IIx/d, and IIb, there are two developmental MyHC isoforms (embryonic and perinatal) that are expressed, along with MyHC-β, at different stages of development [4,5]. After birth, expression of embryonic and perinatal MyHC isoforms as well as MyHC-β is significantly downregulated and the majority of the rodent muscle becomes fast MyHC-expressing fibers with exception of weight bearing core muscles such as soleus where slow MyHC-β is highly expressed [5-7]. In adult muscle, the main determinant of muscle fiber type is motoneuron input [3,8-11]. Several mediators and transcription factors have been identified for the nerve dependent fiber type regulation in adult skeletal muscle . In contrast, our knowledge about factors that regulate fiber type differentiation during skeletal muscle development is still limited. We have previously reported that Sox6 mutant fetal and perinatal skeletal muscle exhibits a significant increase in slow fiber type-specific gene expression accompanied by a significant decrease in fast fiber type-specific gene expression [12,13]. Based on these observations, we have proposed that Sox6 functions as a transcriptional suppressor of slow fiber specific genes in developing skeletal muscle.
Sox6 is a member of the evolutionarily highly conserved Sox transcription factor family [14-17]. Between mice and humans, the overall amino acid sequence of the Sox6 protein is approximately 95% conserved, and the functional domains are 100% conserved . The Sox proteins contain the Sry-related HMG box domain which mediates sequence-specific DNA binding [16,17]. In general, the specificity of Sox protein targets in each cell type is regulated by their cofactors [16,19], a property that is especially important for the Sox6 protein since it lacks a regulatory domain (activator or repressor). Therefore, when Sox6 is involved in transcriptional regulation, cofactors of Sox6 dictate whether the outcome is activation or repression [15,16]. For example, Sox6 activates cartilage specific gene transcription as part of the Sox trio proteins (Sox5, Sox6 and Sox9) [20-22]. In other cell types, Sox6 suppresses transcription of the fgf3 gene or the cyclinD1 gene by associating with repressors [23,24]. In the case of skeletal muscle, we have shown that Sox6 suppresses transcription of slow fiber specific genes during development, thus playing a critical role in initial muscle fiber type differentiation [12,13].
In the present study, to start to uncover how Sox6 regulates transcription of fiber type specific genes at the molecular level, we used a conditional Sox6 allele  to inactivate Sox6 in developing skeletal muscle. The muscle specific inactivation of Sox6 allowed us to overcome the perinatal lethality of Sox6 mutant mice [26,27] and obtain Sox6 knockout (KO) adult skeletal muscle for in-depth analysis. To identify Sox6 target genes and assess their transcriptional status, we conducted ChIP-seq analyses using Sox6 and RNA polymerase II (Pol II) antibodies. Combining these methods, we demonstrate that: (1) Inactivation of Sox6 results in an extreme upregulation in expression of slow fiber specific, cardiac and fetal isoform genes, suggesting that Sox6 is required for the functional maturation of skeletal muscle, and (2) Sox6 binds to the DNA sequences in the vicinity of these genes, and thus is directly involved in the transcriptional suppression of its target genes. These results indicate that Sox6 plays a critical role in functional specification of muscle during development.
We have previously shown that in the Sox6 null fetal skeletal muscle, nascent fast muscles maintain slow MyHC-β expression . In addition to MyHC-β, other slow fiber specific genes (e.g. Tnnc1, Tnni1, Tnnt1, and Myl2) are also upregulated in the Sox6 null muscle, along with significant downregulation of multiple fast fiber specific genes [12,13]. Based on these results, we proposed that Sox6 functions as a suppressor of slow fiber specific genes, thus the loss of Sox6 leads to an increase in slow muscle fibers. Since Sox6 null mutations cause early postnatal lethality [26,27], we were unable to determine whether this Sox6 null fetal phenotype is maintained through postnatal development. To overcome the lethal phenotype, we utilized mice carrying a Sox6 conditional allele  to inactivate Sox6 specifically in skeletal muscle. To start assessing the phenotype of adult Sox6 KO muscle, we first used the Myf5-Cre mouse . In this Cre-transgenic mouse, the Cre recombinase under the control of the Myf5 promoter is expressed very early in the skeletal muscle lineage (starting at approximately E8 in somites); therefore, the inactivation of Sox6 occurs significantly earlier than the beginning of fiber type specification [4,5]. To conduct a comprehensive analysis of the Sox6 KO muscle phenotype, we examined four different muscles in the hindlimb, the tibialis anterior (TA, fast), extensor digitorum longus (EDL, fast), gastrocnemius (fast), and soleus (slow) . The mRNA expression of the following four genes: slow MyHC-β (Myh7), fast MyHC-IIb (Myh4), peroxisome proliferative activated receptor γ coactivator 1α (Ppargc1a), and succinate dehydrogenase complex subunit A (Sdha) were determined by reverse transcription-quantitative PCR (RT-qPCR) and compared between Sox6 KO (Sox6f/f; Myf5-Cre) and control (Sox6f/f) mice. As summarized in Table Table1,1, Sox6 inactivation caused a significant increase in the mRNA expression of Myh7 and a concurrent decrease in Myh4 in the TA, EDL, and gastrocnemius muscles. The Sox6 KO soleus muscle showed the least change in expression of these two MyHC isoforms (Table (Table1).1). This result likely reflects the observation that Sox6 expression in soleus is significantly lower than the other three fast muscles (Additional file 1, Figure S1A), therefore, Sox6 inactivation may have had a less impact in soleus compared to the other muscles. Regarding the Sox6 inactivation levels in adult muscle, we noticed that a higher level of Sox6 inactivation, determined by Sox6 mRNA level, did not necessarily correlate with an increase in Myh7 mRNA level. There are a few possible hypotheses to explain this observation. First, Sox6 is not a muscle specific gene and is also expressed in fibroblasts, which can obscure an accurate quantification of Sox6 mRNA specific to muscle cells. Second, the Sox6 mutation is recessive in nature. Therefore, although two independent Sox6 KO muscle samples show 50% reduction in Sox6 mRNA level, one sample may have more homozygous Sox6 null cells and the other may have more heterozygous cells, leading to a significant difference in Myh7 expression. Third, skeletal muscle is multinucleated, which adds another layer of complexity as to how Sox6 inactivation in each nucleus influences Myh7 expression in a myotube as a whole.
To sort out these issues, we performed immunohistochemistry to examine the Sox6 and Myh7 (MyHC-β) protein expression at the cellular level in fetal, early postnatal and adult muscle. We focused our observation on the TA-EDL region, composed of fast-twitch myofibers in the adult mouse. As shown in Figure Figure1A,1A, in E18.5 control (Sox6f/f), nuclear Sox6 staining was well correlated with the absence of cytoplasmic MyHC-β staining. Also at P7, the presence of Sox6 nuclear staining corresponded to MyHC-β negative myotubes (Figure (Figure1B).1B). In Sox6 KO muscle (Sox6f/f; Myf5-Cre), at both stages, nearly 100% of myofibers displayed MyHC-β expression (Figures (Figures1A1A and and1B).1B). These data show that Sox6 expression does not coincide with slow-twitch fiber gene expression, supporting our idea that Sox6 functions as a suppressor of the slow-twitch fiber gene program. Therefore, at the protein level, the loss of Sox6 expression clearly leads to upregulation of MyHC-β during the early stages of muscle development. During the normal mouse fast muscle development, the number of MyHC-β positive slow-twitch fibers significantly decreases as postnatal skeletal muscles functionally mature . We observed this trend in the developing control mouse muscles (Figures (Figures1A1A and and1B),1B), resulting in adult TA-EDL muscle with extremely rare MyHC-β positive myofibers (Figure (Figure1C).1C). In contrast to the control, at E18.5 and P7, nearly all Sox6 KO myofibers were MyHC-β positive (Figures (Figures1A1A and and1B),1B), indicating that at these early stages, muscle-specific Sox6 inactivation led to extensive upregulation of MyHC-β expression in the entire Sox6 KO muscle. In the adult Sox6 KO muscle, on the other hand, approximately 50% of myofibers were MyHC-β positive, a significant increase compared to the control (Figure (Figure1C,1C, Sox6f/f); however a significant decrease compared to the P7 Sox6 KO muscle (Figure (Figure1B,1B, Sox6f/f; Myf5-Cre). When Sox6 staining signals in the control and Sox6 KO adult muscles were compared, overall Sox6 staining signals were lower in Sox6 KO, however, it was hard to make a clear correlation with MyHC-β staining, since Sox6 staining in adult muscle was quite diffused (Figure (Figure1C,1C, Sox6f/f). In light of this, we noticed that in control P7 muscle, some MyHC-β-negative myofibers did not show nuclear Sox6 staining, but rather dispersed cytoplasmic Sox6 staining (Figure (Figure1B,1B, marked with * in Sox6f/f). This observation may suggest an unknown additional mechanism to relocate the Sox6 protein from the nucleus and/or degrade it in differentiated, more mature myotubes. A recent report on Six1/Six4 double KO muscle suggests that these two proteins positively regulate fast-twitch fiber differentiation and may also influence Sox6 nuclear localization during fetal muscle development (E18.5) . In adult muscle, therefore, not only Sox6 expression, but other mechanisms such as the Six1/Six4 regulated Sox6 shuttling may be in place to finalize fiber type gene expression in response to the environmental cues.
In addition to the muscle structural protein genes, we also examined mRNA expression of the genes playing a role in muscle metabolism, Ppargc1a (PGC-1α) and Sdha in adult muscle. Ppargca1 is a co-regulator of mitochondrial biogenesis and oxidative phosphorylation [30,31] and Sdha is a component of TCA cycle and complex II of the mitochondrial respiratory chain, whose expression is activated by Ppargc1a . We speculated that Ppargca1 and Sdha mRNA expression would also be upregulated in Sox6 KO muscle, because of a correlation between oxidative metabolism and slow fiber content reported in adult skeletal muscle [1,32,33]. In spite of this expectation, neither Ppargc1a nor Sdha mRNA showed a noticeable increase in the Sox6 KO muscles (Table (Table1).1). This lack of correlation of the two gene programs was also observed in Sox6 KO muscles generated using MCK-Cre transgenic mice (Table (Table2,2, discussed later in the text). In a recent report on the adult Sox6 KO muscle phenotype, Quiat et al. also reported that expression of Ppargc1a was not changed . Therefore, these results suggest that Sox6 plays a role in transcriptional regulation of the structural protein genes which define muscle fiber types, but not of the genes which define the metabolic state of skeletal muscle. In order to uncover the mechanisms of muscle differentiation that are regulated by Sox6 at the molecular level, we next performed Sox6 ChIP-seq analysis.
To identify genome-wide binding of Sox6 in mouse skeletal muscle, we performed ChIP-seq analysis. As the chromatin source, we chose wild type fetal (E.18.5) myotubes differentiated for 48 hours in vitro, because at this time point, a significant differential expression of slow fiber specific genes was observed between Sox6 null and wild type myotubes , suggesting an ideal time point to capture Sox6 acting as a transcriptional suppressor of those genes. Also, since Sox6 is highly expressed in fibroblasts (unpublished data), using a pure muscle cell population was necessary to identify muscle-specific Sox6 binding. We conducted two independent ChIP-seq experiments and obtained 3 and 1.5 million reads unambiguously mapped to the mouse genome for each experiment (out of ~20 million total reads). As a result, we identified 1,066 Sox6 peaks common to the two ChIP-seq data sets. These peaks were assigned to a total of 867 mouse RefSeq genes. The vast majority of the Sox6 binding sites were located in intronic regions (48.4%), followed by intergenic regions (more than 20 kb away from transcription start site (TSS) or transcript end) (29.2%) and 5'-upstream region (within 20 kb of TSS including promoter) (13.6%) (Figure (Figure22).
To determine whether any known transcription factor consensus sequences are over-represented within the Sox6 peak regions, a motif search was performed. Motif analysis using MEME (Multiple EM for Motif Elicitation)  identified four known transcription factor consensus motifs in the Sox6 peaks (Figure (Figure3).3). When the occurrence of a single motif was set to 0 or 1 per peak, 723 Sox motifs (P < 10-4) and 636 E-box motifs (P < 10-4) were identified. The fact that the Sox consensus motifs were found in the overwhelming majority of the Sox6 peaks (723 out of 1,066) suggests that the Sox6 binding sites identified here are bona fide Sox6 targets. The E-box motifs (CAG[C/G]TG) identified using the in silico method here were identical to the E-box motifs which were enriched in MyoD binding sites detected using C2C12 myotubes . Comparing our data with the MyoD ChIP-seq data obtained from adult mouse primary myotubes  revealed that 96% of the Sox6 peaks were localized within 50 bp of the MyoD peaks (data not shown).
In addition to Sox motif and E-box, Runx and Tead/MCAT motifs were also found in the Sox6 peaks. When the occurrence of a single motif was set to 1, we identified 559 Runx motifs (P < 10-4) and 203 Tead/MCAT motifs (P < 10-4). A recent report has shown that Runx1 has a role in skeletal muscle terminal differentiation ; therefore, Runx transcription factors might be involved in muscle specific gene expression together with Sox6. Tead/MCAT elements are known to play an important role in transcriptional regulation of many skeletal and cardiac muscle-specific genes . A significant presence of Tead/MCAT motifs in the Sox6 peaks, therefore, implies possible interactions between Sox6 and the Tead transcription factors during muscle differentiation.
In order to determine the transcriptional status of Sox6 peak-associated genes in differentiating fetal myotubes, we performed ChIP-seq analysis using an antibody recognizing a phosphorylated form of Pol II, which is considered to be a transcriptionally active form and associated with highly transcribed genes . To quantify Pol II binding levels of RefSeq genes associated with Sox6 binding sites, Pol II binding events in the corresponding gene regions were measured in RPKM (reads per kilobase of gene region per million reads), a unit used to quantify transcriptional levels in RNA-seq analysis . RPKM was calculated from read (tag) numbers in peak regions, length of RefSeq gene regions, and total number of uniquely mapped reads (details in Methods). By this method, the Pol II binding level of the β-actin gene, an abundantly expressed housekeeping gene, was calculated as 8.60 RPKM. Figure Figure44 summarizes the fold enrichment of the Sox6 peaks and the corresponding Pol II binding of the 867 RefSeq genes associated with Sox6 peaks. We found that the majority of the Sox6 binding site(s)-associated genes were inferred to be transcriptionally inactive (zero to a very low level of Pol II binding). As shown in Figure Figure44 andAdditional file 2, Table S1, of the 867 genes associated with Sox6 binding sites, 442 genes (51%) showed no Pol II binding (0 RPKM) and 289 genes (33%) showed less than one tenth of the Pol II binding to the β-actin gene (<0.86 RPKM), thus 84% of the genes associated with Sox6 binding sites are considered to be transcriptionally inactive or transcribed at a very low level in myotubes. These data strongly suggest that the binding of Sox6 to its targets mostly results in transcriptional suppression. The rest of the Sox6 peak associated genes were transcribed mostly at a range of low to moderate levels (less than half of the Pol II binding to the β-actin gene). There were, however, a small number of Sox6 peak-associated genes that exhibited a high level of Pol II binding. For example, Myl4 (embryonic MyLC isoform), Tnnc1, and Myh3 (embryonic MyHC isoform) showed a relatively high level of Pol II binding (>4.30 RPKM). In the case of Tnnc1, one of the two Sox6 peaks was identified in the first intron (Additional file 3, Figure S2D), where a muscle enhancer element was reported . Therefore, an unidentified enhancer element may exist in the vicinity of the Sox6 binding sites in Myl4 and Myh3.
Gene Ontology (GO) analysis revealed that the Sox6 peak-associated genes showed the highest enrichment for the GO categories relevant to muscle cytoskeleton and myofibril establishment (Table (Table3).3). Many of these genes encode muscle sarcomeric proteins which define fiber types, cardiac isoforms, and developmental isoforms in muscle. For instance, Myh1 (fast MyHC-IIx/d), Myh2 (fast MyHC-IIa), Myh6 (cardiac isoform, MyHC-α), Myh7 (slow MyHC-β), Myh7b (myosin, heavy chain 7B, cardiac muscle, beta), Tnnc1 (troponin C, cardiac/slow skeletal), and Tnni1 (troponin I, skeletal, slow 1) were represented. The profiles of Sox6 binding and Pol II binding for these genes are summarized in Additional file 3, Figure S2A-E. Except for Tnnc1 (4.72 RPKM) and Tnni1 (3.52 RPKM), Pol II binding levels of these genes were very low (Additional file 3, Figure S2A-E). It should be noted that Sox6 peaks were not detected for Myh4 (Additional file 3, Figure S2A) which encodes the fastest adult myosin isoform MyHC-IIb [2,3] nor for Myh8 (data not shown) which encodes the perinatal fast MyHC isoform . This suggests that Sox6 is not directly involved in transcriptional regulation of the fastest MyHC isoforms expressed in fetal or adult skeletal muscle.
Another noticeable GO term category enriched in the genes associated with Sox6 peaks involved regulation of transcription (Table (Table3).3). For instance, Sox6 peaks were found in the vicinity or in the gene region of transcriptional regulators including (but not limited to) Prox1, Sox6, Tead1, Tead4, Tcf4, Hdac9, Hdac11, and Nfatc3 (Additional file 3, Figure S2F-M). These genes (except for Hdac11) are known to play a role in not only skeletal muscle development, but also heart development [12,13,38,42-49]. In spite of its high expression in skeletal muscle, the role of the class IV histone deacetylase Hdac11  in muscle development is yet to be discovered .
Prox1 encodes a transcription factor expressed in slow muscle in zebrafish . Though its role in mammalian skeletal muscle development is yet to be reported, we hypothesize that the Prox1 protein also plays a role in slow muscle fiber differentiation in mice. To support this, we have found that Prox1 mRNA is preferentially expressed in the slow soleus muscle compared to the EDL, TA, and gastrocnemius muscles in adult (Additional file 1, Figure S1B). Therefore, the Prox1 protein may play a role in slow fiber differentiation during muscle development as well as maintenance of slow muscle in adult. In the Sox6 gene region, two Sox6 peaks were detected in the fifth intron (Additional file 3, Figure S2G). Existence of Sox6 binding sites and very low levels of Pol II binding in the Sox6 gene region may suggest a self-regulatory mechanism of Sox6 transcription during skeletal muscle development, as has been recently reported for erythrocyte development .
We also examined whether Tead1, Tead4, Tcf4, Hdac9 and Hdac11 are differentially expressed between slow and fast muscles. We found that Tead1, Tead4, Tcf4 and Hdac9 were all expressed higher in the slow soleus muscle than the group of fast muscles, EDL, TA, and gastrocnemius (Additional file 1, Figure S1C-F). Hdac11, on the other hand, was expressed slightly higher in the fast muscles than soleus (Additional file 1, Figure S1G). These results suggest that Tead1, Tead4, Tcf4 and Hdac9 may also positively regulate slow fiber specific genes. The association of Sox6 peaks to these transcriptional regulatory genes suggests that Sox6 may be indirectly regulating muscle development through these key transcription regulators.
Sox6 binding to the genes described above was validated by ChIP-qPCR (Additional file 4, Figure S3).
The observation that the majority (84%) of the genes associated with Sox6 binding sites show little or no Pol II binding (Figure (Figure4)4) supports our hypothesis that a major function of Sox6 during myogenesis is transcriptional suppression. To further evaluate this hypothesis, we next analyzed mRNA expression of selected genes associated with Sox6 binding in Sox6 KO muscle. For this, we used MCK-Cre mice (harboring the Cre gene under the control of the muscle creatin kinase promoter) to assess the effect of Sox6 inactivation in skeletal muscle as well as in cardiac muscle [53,54], since many of the putative Sox6 target genes are also expressed in the heart.
First, mRNA levels of the eighteen genes (eight fiber type-specific genes, two cardiac isoform genes, one developmental isoform gene and five transcription factors and two histone modification enzyme genes) were compared between control and Sox6 KO mice using newborn skeletal muscle (Figure (Figure5).5). Sixteen genes out of the eighteen tested showed a significant increase in mRNA expression in the newborn Sox6 KO skeletal muscle (Figure (Figure5).5). Nfatc3 and Hdac11 showed a tendency to be increased in Sox6 KO muscle, even though the difference was not statistically significant (Figure (Figure5).5). These results indicate that Sox6 functions as a suppressor for these genes in developing muscle.
Next, we assessed mRNA expression of fifteen genes out of the eighteen tested above (Myl4, Nfatc3, and Hdac11 were excluded) as well as fast fiber specific genes, myogenic regulatory factors, and metabolism related genes in adult Sox6 KO muscle (Table (Table2).2). The slow fiber specific sarcomeric protein genes which had shown increased expression in Sox6 KO newborn muscle (Myh7, Myl2, Tnnc1, Tnni1, and Tnnt1) displayed an even greater fold increase in mRNA expression in adult Sox6 KO muscles compared to control (Table (Table2).2). Among the four muscle groups tested, the TA and EDL Sox6 KO muscles showed the most dramatic increase in slow fiber specific gene expression, the soleus exhibiting the least fold increase (Table (Table2),2), again likely reflecting the lower Sox6 expression in the soleus than the fast muscles TA, EDL, and gastrocnemius (Additional file 1, Figure S1A). The fast fiber specific genes (Myh4, Tnni2, and Tnnt3) exhibited a significant decrease in their mRNA expression in Sox6 KO muscles (Table (Table2).2). Myh1 (IIx/d) and Myh2 (IIa), were either increased or decreased in different Sox6 KO muscle groups (Table (Table2),2), which may reflect the fluid nature of MyHCs IIa and IIx/d's expression in adult skeletal muscle . These two MyHC isoforms are intermediates between MyHC-β and MyHC-IIb when fiber type shift occurs in skeletal muscle. Therefore, they could be more sensitive to the timing and level of the Sox6 gene inactivation, leading to varied expression in the individual Sox6 KO muscles. Upregulation of the cardiac isoform genes, Myh6 and Tnnt2, was also observed in the adult Sox6 KO muscle (Table (Table22).
The significant upregulation in the slow fiber and cardiac isoform gene expression in adult Sox6 KO skeletal muscle likely suggests that inactivation of the Sox6 gene early in myogenic development inhibited the postnatal maturation of the skeletal muscle. Postnatal development of skeletal muscle is characterized by the progressive decline of slow fiber specific gene expression in fast muscles [6,7,55]. As a result, control EDL and TA muscles express only a trace amount of the MyHC-β protein [6,56,57]. The extreme upregulation of the slow fiber specific genes such as Myh7, Tnnc1, and Tnnt1 in the Sox6 KO fast muscles may reflect their suspended postnatal maturation. This delayed maturation hypothesis is supported by the observation that the embryonic isoform acetylcholine receptor (Ach-R) γ (Chrng) is expressed at a higher level than the adult isoform Ach-R ε (Chrne) in Sox6 KO muscles (Table (Table2).2). During postnatal maturation of skeletal muscle, Ach-R γ is replaced by the adult isoform Ach-R ε . In the adult Sox6 KO muscles, silencing of Chrng was not seen and Chrne expression did not reach to the control level (Table (Table2).2). Since we have located one Sox6 peak in the Chrng promoter region (approximately 135 bp upstream of the TSS), Sox6 may be directly suppressing transcription of Chrng during normal skeletal muscle development.
In addition to the sarcomeric protein genes, mRNA levels of some of the transcriptional regulatory genes associated with Sox6 peaks were upregulated in the Sox6 adult KO skeletal muscles. Prox1 expression was significantly increased in Sox6 KO muscles, with the highest fold increase in the TA and EDL, followed by the gastrocnemius (Table (Table2).2). It should be noted that Prox1 expression is highest in the soleus in the adult control (Sox6f/f) muscles (Additional file 1, Figure S1B). These observations suggest that Prox1 may play a role for sustaining slow fiber specific gene expression in adult muscle. Tead4 and Hdac9 also showed a slight increase in their expression in the Sox6 KO TA, EDL, and gastrocnemius muscles (Table (Table2).2). Expression of Tcf4 and Tead1, on the other hand, showed no clear difference between Sox6 KO and control adult muscles (Table (Table2),2), in spite of their higher expression in the Sox6 KO newborn muscle (Figure (Figure5).5). This result suggests that Sox6 may regulate transcription of Tcf4 and Tead1 in developing muscle, but this regulation may not be maintained through adult.
Since it has previously been reported that MyoD and Myogenin are differentially expressed between slow and fast muscles (MyoD higher in fast than slow; Myogenin higher in slow than fast) [59,60], we also examined mRNA expression of these genes in Sox6 KO muscle. As shown in Table Table2,2, there was no discernable change in MyoD mRNA expression, whereas there was a small increase in Myogenin mRNA expression in Sox6 KO muscles. An increase in Myogenin expression in Sox6 KO muscle suggests that Myogenin may play some role in maintaining slow fiber phenotype in the adult skeletal muscle as previously proposed .
Since a close coupling between the slow fiber gene program and the oxidative metabolism gene program in adult skeletal muscle has been reported [1,3], we also examined mRNA expression of the genes whose high expression is correlated with the oxidative state of skeletal muscle metabolism in Sox6 KO muscle (myoglobin, Sdha and Ppargc1a). In MCK-Cre induced Sox6 KO muscle, mRNA levels of Ppargc1a and Sdha were, in general, lower than control (Table (Table2).2). These results replicated the data obtained using Myf5-Cre induced Sox6 KO muscle (Table (Table1).1). Myoglobin expression showed a slight increase in the Sox6 KO TA, EDL, and gastrocnemius (Table (Table2).2). When the color of gastrocnemius and soleus muscles was visually inspected, the characteristic color difference between the two muscles in control muscle (soleus being redder than gastrocnemius) was less clear in Sox6 KO muscle, because the Sox6 KO gastrocnemius exhibited an increase in redness in its color (Figure (Figure6).6). This may reflect a small, but consistent increase in myoglobin expression in the Sox6 KO gastrocnemius muscle (Table (Table2).2). The more red muscle in Sox6 KO muscle has been also reported by Quiat et al. . Interestingly, both Quiat et al. and our current report observed reduced expression in Ppargc1a in Sox6 KO muscle, which may suggest that there could be a pathway independent of Ppargc1a regulating myoglobin expression in Sox6 KO muscle. An alternative explanation for the increased redness in the Sox6 KO muscle could be a change in capillary density. In the Sox6 peak associated RefSeq genes, GO terms related to vasculature development and angiogenesis were also enriched (P < 2 × 10-3) (Additional file 5, Table S2). Thus, the increased capillary density could be the cause of more red color of the Sox6 KO gastrocnemius.
Since the two cardiac MyHC isoform genes, α and β (Myh6 and Myh7), were associated with Sox6 binding (Additional file 3, Figure S2B) and their expression was upregulated in Sox6 KO skeletal muscle (Figure (Figure5,5, Table Table2),2), we next examined their expression in the Sox6 KO heart. In the mouse heart, expression of MyHC-α and MyHC-β is developmentally regulated. MyHC-β is the fetal isoform in the heart and is replaced by the adult isoform MyHC-α within the first week after birth . As summarized in Table Table44 it appears that this isoform transition, fetal to adult, is incomplete in the Sox6 KO myocardium. In the Sox6 KO heart, MyHC-β expression was sustained at an equal to a slightly higher level than control heart, whereas MyHC-α expression decreased to approximately the half of the control level (Table (Table4).4). To test if this is a developmental defect in the postnatal heart, we also examined expression of the developmentally regulated skeletal α-actin gene, which is expressed in the fetal heart and silenced later in adult . Indeed, skeletal α-actin mRNA expression was consistently higher in the Sox6 KO heart (Table (Table4),4), suggesting that the Sox6 KO heart is developmentally more immature than the control heart. Interestingly, the expression of Ppargc1 was also lower in the Sox6 KO heart (Table (Table4).4). Since Ppargc1 plays an important role in maturation of the metabolic state and mitochondrial biogenesis in the postnatal heart [63-65], this result suggests that the loss of Sox6 caused a delay in the postnatal maturation of the heart, thus Sox6 may also be necessary for the functional maturation of cardiac muscle.
It has been reported that Nfatc3 stimulates myogenic differentiation both in vivo and in vitro [43,45]; however, its implication in muscle fiber type specification has not been noted. Calcineurin-directed dephosphorylation of NFAT factors results in their nuclear localization and transcriptional activation of their target genes . We have located one Sox6 peak in the last intron of Nfatc3 (Additional file 3, Figure S2J). As shown in Figure Figure5,5, Sox6 KO newborn skeletal muscle showed a small increase (not statistically significant) in Nfatc3 mRNA expression. To assess whether the Nfatc3 activity increases in Sox6 null myotubes, we examined sub-cellular localization the Nfatc3 protein using Western blot. We took advantage of the Sox6 null mouse (p100H-Sox6 null mutant allele) in our laboratory to obtain a pure population of Sox6 null myotubes [12,13,26]. Fetal myoblasts were prepared from E18.5 p100H-Sox6 null and wild type littermates and were differentiated in differentiation medium (DM). In undifferentiated myoblast cultures, the amount of the nuclear as well as cytoplasmic Nfatc3 protein was comparable between Sox6 null and wild type (Figure (Figure7).7). Once myotube differentiation was induced, in wild type cultures, the Nfatc3 protein was detected only in the nuclear fraction, whereas in the p100H cultures, a continuous presence of the cytoplasmic Nfatc3 protein and a higher level of the nuclear Nfatc3 protein (compared to wild type) were observed (Figure (Figure7).7). We have previously reported that Sox6 expression is significantly increased upon induction of myotube differentiation . Therefore, these results suggest that a higher level of Sox6 expression in wild type myotubes likely suppressed new synthesis of Nfatc3, while the absence of Sox6 in p100H myotubes allowed continuous Nfatc3 synthesis. These results suggest that Nfatc3 activity is upregulated in the Sox6 null myotubes which show a higher level of slow fiber specific gene expression.
In order to characterize the functional nature of the Sox6 binding sites in transcriptional regulation, we next performed reporter gene assays. We chose five Sox6 peak-associated genes, Myh7 (MyHC-β), Myh7b, Tnnc1, Tnni1and Hdac11, in which Sox6 binding was validated by ChIP-qPCR (Additional file 4, Figure S3). All of these Sox6 peaks tested contained a Sox consensus motif. Firefly luciferase vectors containing each of the following sequences, ~3.5 kb Myh7 5'-upstream sequence (two Sox6 peaks; MHC3500), ~6 kb Myh7b 5'-upstearm sequence (one Sox6 peak), ~1.3 kb of the Tnnc1 first intron (one Sox6 peak), ~5.2 kb Tnni1 5'-upstream region (two Sox6 peaks), and ~1 kb Hdac11 5'-upstream sequence (one Sox6 peak) were generated (Figure (Figure8A;8A; see Additional file 3, Figure S2B-E and S2M for the location of Sox6 peaks). It should be noted that the proximal Sox6 peak in the Tnni1 5'-upstream region (approximately -800 bp from TSS) overlapped with the previously reported slow upstream regulatory element (SURE) containing an enhancer element .
To assess whether these Sox6 binding sites function as a negative or positive regulatory element, the luciferase reporter gene constructs described above were transiently transfected to p100H-Sox6 null and wild type myoblasts, differentiated in DM for 48 hours, after which firefly luciferase activities were compared between the Sox6 null and wild type myotube cultures. If these Sox6 binding sequences function as negative regulatory regions, it is expected that the luciferase activity would be higher in p100H myotube cultures in which no functional Sox6 protein is produced. As summarized in Figure Figure8B,8B, four out of the five sequences tested drove a higher firefly luciferase activity in Sox6 null myotube cultures compared to wild type, indicating that these Sox6 binding sites function as negative regulatory sequences. The Myh7b 5'-sequence did not drive a statistically higher luciferase activity in Sox6 null myotube cultures (Figure (Figure8B).8B). Since the endogenous Myh7b expression was higher in Sox6 KO muscle (Figure (Figure5),5), it is possible that the in vitro culture may not be the best approach to assess the effect of the Myh7b Sox6 biding regions. Intriguingly, Bell et al. have shown that Sox6 protein overexpression in C2C12 cells could suppress transcription from the 1 kb Myh7b 5'-upstream sequence . Therefore, there could be a Sox6 binding site not detected in our ChIP-seq analysis which may still be functioning as a negative regulatory element in a different context.
We have previously shown that the proximal Sox6 binding site (-200 bp from the Myh7 TSS) functions as a negative regulatory element in reporter gene assays . In the present report, we have identified an additional distal Sox6 binding site (-2900 bp from TSS) which overlaps with a known muscle enhancer element [69,70]. To delineate the two Sox6 binding sites in the Myh7 5'-upstrem region (see Additional file 3, Figure S2B for the peak locations), the distal Sox consensus sequence (-2,900 bp) was mutated in MHCβ3500 (designated as MHC β3500 m) (Figure (Figure8A).8A). As shown in Figure Figure8B,8B, the loss of the distal Sox motif did not affect the luciferase activity in either Sox6 null or wild type myotubes. This result suggests that the distal Sox motif has little effect on transcriptional suppression from the 3.5 kb Myh7 5'-region in transient assays, and therefore, at least in the current in vitro assay conditions, the proximal Sox6 binding site is sufficient to suppress the transcription driven by the 3.5 kb Myh7 5'-upstream region.
The function of the Sox6 binding site in the Tnnc1 first intron was determined using a hybrid luciferase reporter construct whose transcription is driven by the chicken β-actin promoter. The Tnnc1 first intron contains an enhancer element which was previously identified using C2C12 and Sol8 skeletal muscle cell lines . The presence of this intron alone significantly increased luciferase activity in wild type myotubes (Actb-p vs. Actb-p+Tnnc1, p < 0.0001), confirming the enhancer activity (Figure (Figure8B).8B). The luciferase activity of the construct, Actb-p+Tnnc1, in Sox6 null myotubes was significantly higher than wild type, indicating that Sox6 binding hindered the enhancer activity in this intron (Figure (Figure8B).8B). Unexpectedly, the construct containing only the chicken β-actin promoter exhibited a small but statistically significant increase in luciferase activity in Sox6 null myotube cultures compared to wild type (Figure (Figure8B).8B). This was likely caused by the fortuitous presence of a couple of Sox motif sequences in the chicken β-actin promoter and intron sequences in the vector (data not shown), which could have functioned as a weak silencer element. The 5'-upstream sequences of both Tnni1 and Hdac11 showed a moderate but statistically significant increase in luciferase activity in Sox6 null myotubes (Figure (Figure88).
In order to understand how Sox6 regulates muscle differentiation at the molecular level, we have performed ChIP-seq analysis to identify Sox6 targets in skeletal myotubes and extended the characterization of the Sox6 null muscle phenotype using muscle specific Sox6 inactivation. Among the 867 Refseq genes found to be associated with Sox6 peaks, the overrepresented GO terms included muscle structure and function, skeletal muscle and heart development, as well as transcriptional regulation. In a concurrently conducted Pol II ChIP-seq analysis, we found that the majority of the Sox6 peak-associated genes exhibited little to no recognizable binding peaks, suggesting that Sox6 mainly functions as a transcription suppressor in developing muscle.
How does Sox6 suppress its target genes? Based on evidence from this and other labs, we can speculate on two possible mechanisms (1 and 2) and, based on evidence accumulated in this report we also demonstrate two other likely mechanisms (3 and 4): (1) Sox6 may fine-tune the transcription of the genes that have been marked by MyoD binding, (2) Sox6 may modulate transcription of its target genes in concert with Tead and Runx factors, (3) Sox6 suppresses transcription by hindering the muscle-specific enhancer activity, and (4) Sox6 also indirectly influences downstream gene expression by regulating the expression of other transcription factors and chromatin modifying enzymes. Below, we will discuss each of these proposed mechanisms in more detail.
MyoD is one of the myogenic regulatory factors and defines the myogenic lineage during development [71,72]. In myotubes, MyoD binding events are frequent (~26,000 peaks with a higher cut off, ~60,000 peaks with a lower cut off) and are associated with histone H4 acetylation (H4Ac) , which is a marker of an active chromatin state . We found that 96% of the Sox6 peaks in fetal myotubes overlapped with, or were in the close vicinity to (within 50 bp), the reported MyoD peaks . The E-box motifs in the Sox6 peak regions we found were enriched for the CAGCTG E-box sequence (Figure (Figure3).3). Previously, it has been shown that this motif is represented in the peaks more strongly bound in C2C12 myotubes compared to myoblasts, indicating that this E-box motif is mainly associated with the genes regulating muscle differentiation . Taking this observation together with ours, we speculate that MyoD binding in the myotube would change the chromatin environment in such a way as to allow the approach of additional transcriptional regulators by recruiting the chromatin modifying enzymes , thus allowing the fine-tuning of muscle specific gene expression necessary for the formation of mature skeletal muscle. Sox6 could be one of these additional transcriptional regulators and specify fiber type characteristics during muscle terminal differentiation.
We have previously reported that Sox6 interferes with a MCAT enhancer located in close proximity to the Sox consensus motif in Myh7, causing suppression of Myh7 transcription . Tead/MCAT motifs are frequently found in enhancer or promoter regions of muscle specific genes and it has been demonstrated that binding of TEF-1/Tead1 to the MCAT motifs activates transcription of these muscle-specific genes [38,75]. In our analysis of the 1,066 Sox6 peaks, we found 203 MCAT motifs. This suggests that the mechanism of Myh7 transcriptional suppression by Sox6 (possibly via physical interference) we reported earlier may be a common mechanism Sox6 uses to suppress genes whose transcription is activated via Tead/MCAT motifs. Our analysis also revealed 559 Runx motifs in the 1,066 Sox6 peaks. Currently, the roles of Runx motif binding factors (Runx-1, -2, and 3) in muscle development are not well known, though there are reports showing that Runx1 plays a role in skeletal muscle differentiation [37,76,77]. In adult skeletal muscle, Runx1 expression is induced by denervation , and muscle-specific Runx1 inactivation leads to accelerated muscle wasting in denervated muscle . In an earlier stage of muscle differentiation, it has been reported that Runx1 directly interacts with MyoD preferentially in proliferating myoblasts to inhibit terminal differentiation of skeletal muscle . The authors showed that the Runx1/CBFβ complex recruits suppressive chromatin modifying enzymes (e.g. HDACs), thus inactivating transcription of the MyoD target genes that are necessary for the cell cycle exit and differentiation . Since the Runx proteins have been shown to function as transcriptional suppressors or activators in different circumstances  (similar to Sox6), the transcriptional outcome of the possible interaction between the Sox6 and Runx proteins needs further investigation.
As demonstrated in the Results section, the Sox6 binding sites in the Tnnc1 first intron and the Tnni1 5'-upstream region both effectively reduced the activity of the enhancer elements (Figure (Figure8).8). The molecular mechanisms by which Sox6 overrides muscle enhancers is currently under investigation; however, the skeletal muscle MyHC gene clusters may help shed light on this role of Sox6. In the six MyHC isoform genes clustered on the mouse chromosome 11 [Myh3 (emb), Myh2 (IIa), Myh1 (IIx/d), Myh4 (IIb), Myh8 (peri), Myh13 (eo)] , only the Myh4 and Myh8 genes were not associated with Sox6 peaks (a Sox6 peak was detected in the 5'-upstream region of Myh13 in one of the two ChIP-seq data sets; data not shown). Therefore, Sox6 may be involved in sequential expression of the MyHC loci, possibly in collaboration with an enhancer element similar to the locus control region (LCR) reported for the globin gene cluster . This is an appealing hypothesis, because it has been shown that Sox6 (acting as a transcriptional suppressor) regulates sequential expression of the β-globin genes during erythrogenesis  in concert with BCL11A which binds to the globin gene LCR . There have been reports on transcription factories that unite transcriptionally active genes on separate chromosome regions for coordinated transcription . It is possible that association of Sox6 with its target sequences inhibits transcriptional initiation by Pol II, thus causing dissociation of Sox6 target genes from transcription factories.
We demonstrated that expression of Tead1, Tead4, Hdac9, and Prox1 was upregulated in Sox6 KO skeletal muscle (Figure (Figure5),5), suggesting that Sox6 is a suppressor of these transcriptional regulatory genes. Tead1 (TEF-1) and Tead4 (RTEF-1) are highly expressed in muscle tissues and have been reported to activate muscle specific gene transcription [83-85]. Hdac9 is a class IIa HDAC  and functions as a mediator of motor neuron input to skeletal muscle . Prox1 is expressed in slow muscle in zebrafish . Since Prox1 is preferentially expressed in slow fiber muscle in control mice (Additional file 1, Figure S1B) and Sox6 inactivation caused a sizable increase in Prox1 mRNA expression in Sox6 KO muscle, we propose that Prox1 also plays a role in regulation of slow muscle fiber specific gene expression in mice. This observation presents further evidence of evolutionary conservation in the mechanisms regulating muscle fiber type differentiation in vertebrates [19,88]. Since there are more transcriptional regulator genes that are closely associated with Sox6 peaks, which we did not have space to discuss in this report, it is likely that Sox6 is part of the transcriptional networks that shape the characteristics of both muscle development and mature muscle functions.
The most striking phenotype of Sox6 null skeletal muscle is the dramatic increase in the expression of multiple slow fiber specific genes. This observation originally led us to hypothesize that Sox6 functions as a transcriptional suppressor of slow fiber specific genes [12,13]. In this report, we expanded the gene expression profiling of Sox6 KO skeletal muscle by including cardiac and embryonic muscle isoform genes. Cardiac isoforms Myh6 and Tnnt2, as well as embryonic isoforms Myl4 and Chrng, were upregulated in the Sox6 KO muscle (Figure (Figure5,5, Table Table2).2). It has been reported that Tnnt2 is upregulated in regenerating dystrophic muscle . Myh6 is expressed in specialized craniofacial muscle, such as jaw and extraocular muscle, but not in limb or other body muscle [90,91]. These observations suggest that Sox6 may play a role in not only determining fiber types, but also defining developmental maturity and highly specialized functions of skeletal muscle.
In Sox6 KO muscle, a significant decrease in fast fiber specific gene expression was also observed. This Sox6 KO phenotype could be a secondary effect of the increased slow fiber gene products, or could be regulated indirectly by Sox6. Since we did not find Sox6 peaks associated with fast fiber specific genes, both mechanisms are equally plausible. With regard to indirect regulation, a few possible mechanisms can be hypothesized. For example, expression of the transcription factors Six1 and Six4, activators of fast fiber specific gene expression [29,92], could be indirectly suppressed in Sox6 KO muscle during development. Alternatively, downregulation of fast fiber specific genes in Sox6 KO muscle could be caused by changes in microRNA expression. MicroRNAs are known to function as posttranscriptional regulators of gene expression . A recent report indicates that microRNAs suppress target gene expression predominantly through mRNA degradation , thus, it is plausible to postulate that an increase in microRNAs targeting fast fiber specific genes in Sox6 KO muscle leads to reduced fast fiber specific gene mRNA levels. As described above, we found Sox6 binding peaks associated with Myh6 and Myh7 (Additional file 3, Figure S2B). In the intron sequences of Myh6 and Myh7, miR-208a and miR-208b are encoded, respectively . It has been reported that miR-208 suppresses expression of THRAP1, which promotes fast fiber specific gene expression . The increased transcription of Myh6 and Myh7 in Sox6 KO muscle, therefore, could lead to upregulation of miR-208, which in turn, suppress fast fiber specific gene expression. However, the actual situation is likely to be more complex. It should be noted that miR-208, along with miR-499, also targets the 3'-UTR region of Sox6 [68,97,98]. MiR-499 is encoded in the intron of Myh7b [95,99], which has a Sox6 binding site in its 5'-upstream region (Additional file 3, Figure S2C). Since Myh6, Myh7, and Myh7b are all negatively regulated by Sox6 (Figure (Figure5),5), these data suggest that Sox6 and these miRNAs constitute two-way feedback loops.
Figure Figure99 summarizes both our current results and the reported regulatory mechanisms for Sox6 expression. A recent report on the regulation of Sox6 expression in zebrafish skeletal muscle has demonstrated that Sox6 transcription is positively regulated by MyoD and Myf5, and repression of Sox6 activity in slow fibers is maintained by miR-499 which targets the Sox6 3'-UTR . We have reported that Sox6 transcription is upregulated when myotube differentiation is induced , therefore, MyoD and Myf5 might also be activating Sox6 transcription during mammalian muscle development. Since MyoD is preferentially expressed in fast fibers in adult mice [59,60], it may sustain the higher level of Sox6 expression in adult fast fiber muscles reported here as well as by Quiat et al. . Although the negative regulation of Sox6 by miR-499 has been already reported in mice [68,97,98], how suppression of Sox6 expression in slow fibers is initiated is not yet understood. Alternatively, it is also possible that Sox6 expression is activated when fast-twitch myotubes emerge during fetal muscle development . Since fiber type-specific gene expression in mammalian skeletal muscle during development as well as in adult life is very fluid [2-4], how Sox6 expression is regulated will be an increasingly important question as we try to understand how muscle fiber type is initially specified, maintained and changed in reseponse to the external signaling.
We have shown that: (1) Sox6 directly suppresses the transcription of slow fiber-specific, cardiac, and embryonic isoform genes through binding to the transcriptional regulatory regions, (2) Sox6 regulates expression of transcriptional regulators critical for muscle development, therefore, extending its effect on muscle development by cross-talking with other regulatory pathways, (3) Loss of Sox6 in skeletal muscle results in a significant increase in expression of slow fiber-specific, cardiac, and embryonic isoform genes which are associated with Sox6 binding peaks, accompanied by a decreased in fast fiber-specific gene expression, and (4) Loss of Sox6 in cardiac muscle results in increased expression of fetal isoform genes in the adult heart, which suggests that Sox6 is required for the postnatal maturation of cardiac muscle as well. Since the Sox6 KO phenotypes reported here have relevance to muscle degenerative diseases [102-105] as well as heart failure , uncovering the many functions of Sox6 in muscle development will likely contribute to the understanding of mechanisms of human muscular diseases.
Isolation, culture, and induction of myotube differentiation in differentiation medium (DM) of fetal myoblasts isolated from mouse E18.5 limb were described previously .
ChIP experiments were performed using the Imprint Chromatin Immunoprecipitation Kit (Sigma-Aldrich) according to the manufacturer's instructions. Primary myotubes differentiated in DM for 48 h were washed with phosphate-buffered saline with 1 mM MgCl2 (PBS-Mg) once, and then fixed with 2 mM disuccinimidyl glutarate (DSG; Thermo Fisher Scientific) for 45 min at room temperature as described by . Cells were washed with PBS-Mg twice, and fixed further using 1% formaldehyde for 10 min at room temperature. Antibodies used were rabbit polyclonal antibody to Sox6 (ab30455, Abcam), mouse monoclonal antibody to RNA Polymerase II (Pol II) CTD repeat YSPTSPS (4H8) (ab5408, Abcam), and normal mouse IgG from the ChIP kit. ChIP-seq library was prepared as described previously . Briefly, the immunoprecipitated material was end-repaired, A-tailed, ligated to the sequencing adapters, amplified by 18-cycles of PCR and size selected (300-600 bp) followed by single end sequencing on an Illumina Genome Analyzer II by the DNA Technologies Core Facility at University of California, Davis http://dnatech.genomecenter.ucdavis.edu/. ChIP-seq data are available at Gene Expression Omnibus (accession number GSE32627).
ChIP-seq reads and input sample reads were aligned using Bowtie (version 0.12.5)  with default parameters to the mouse NCBI Build 37 genome assembly. We obtained 3 and 1.5 million uniquely mapped reads from two independent Sox6 ChIP experiments, and 1.5 and 2.8 million uniquely mapped reads from two independent Pol II ChIP experiments, respectively. Peak calling was performed using SISSRs (version 1.4)  with the following parameters. -s (genome size): 2,716,965,481 bp, -F (average length of DNA fragments): 450 bp, -b (background file): input DNA data file of each ChIP experiment. Common peaks from the two Sox6 data sets were identified using ChIP-Seq Tool Set (version 1.0) . Corresponding peaks within 50 bp were considered as overlapping. Peak annotation was carried out using PeakAnalyzer  with the default mouse mm9 annotation file. ChIP-seq data was visualized on the UCSC Genome Browser . Motif discovery was conducted using MEME (version 4.5.0)  with default parameters, followed by comparison against three motif databases (JASPAR, TRANSFAC, and UNIPROBE) using TOMTOM (version 4.5.0) . Gene ontology (GO) analysis was performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID; http://david.abcc.ncifcrf.gov/) [115,116]. Pol II binding was represented as RPKM (reads per kilobase of RefSeq gene region per million mapped reads) based on read (tag) numbers in peak regions. We used the 2.8 million read data for calculation and visualization to maximize accuracy of result. When there are multiple RefSeq gene models per gene, the longest gene model was used for the calculations. When Pol II peaks were extended to extragenic regions, the length of extended regions was added to that of RefSeq gene models.
All measurements were conducted with ABI Prism 7900 HT Sequence Detection System (Applied Biosystems). ChIP-qPCR was performed using Maxima SYBR Green/ROX qPCR Master Mix (2X) (Fermentas) and specific primers listed in Additional file 6, Table S3. Single products were confirmed by dissociation curve analysis. Results were normalized to input, and fold enrichment was calculated by normalizing to enrichment at a negative control region (intergenic). RT-qPCR was performed using TaqMan Gene Expression Assays (Applied Biosystems). Total RNA was extracted with TRIzol reagent (Invitrogen). Following DNase treatment with DNA-free Kit (Ambion), cDNA was synthesized using High Capacity cDNA Reverse Transcription Kits (Applied Biosystems) with SUPERase-In (Ambion).
TaqMan probes used are provided in Additional file 7, Table S4. Results were normalized to β-actin (Actb) transcript level. All statistical analyses were performed using the two-tailed Student's t-tests. Relative mRNA levels against β-actin are shown in Additional files 8 and 9, Figures S4 and S5.
Nuclear and cytoplasmic fractionation of primary myoblasts and myotubes was carried out using NE-PER Nuclear and Cytoplasmic Extraction Reagents (Thermo Fisher Scientific). A total of 30 μg of each protein sample was loaded on 7.5% SDS-polyacrylamide gel electrophoresis and transferred to a nitrocellulose membrane. The blot was incubated with anti-Nfatc3 mouse monoclonal antibody (sc-8405; Santa Cruz Biotechnology) at 1:100, and the signal was detected by Pierce ECL Western Blotting Substrate (Thermo Fisher Scientific). To estimate the amount of protein loaded in each lane, the same blot was stripped and then incubated with anti-TATA binding protein (TBP) mouse monoclonal antibody (ab818; Abcam) or anti α-Tubulin mouse monoclonal antibody (sc-8035; Santa Cruz Biotechnology) at 1:1000.
A firefly luciferase expression vector driven by the MyHC-β promoter (MHCβ3500, which contains 3,500 bp of the 5' upstream sequence of the rat MyHC-β gene) was kindly provided by Dr. Baldwin at University of California, Irvine [117,118]. Using this vector as a template, a Sox motif in the distal Sox6 binding region found by our ChIP-seq experiments (approx. -2.9 kb in mouse) was mutated (TACAAAG to TCAGAAG) by an inverse PCR method  using KAPA HiFi HotStart DNA Polymerase (KAPA Biosystems) to generate MHCβ3500 m ("m" stands for mutation). Firefly luciferase expression vectors driven by the upstream regions of Myh7b, Tnni1, and Hdac11 genes (~ 6.0 kb, ~5.2 kb, and ~1.0 kb, respectively) were generated by inserting restriction enzyme-digested PCR products into appropriate restriction sites of pGL3-basic vector (Promega). A firefly luciferase expression vector driven by the first intron of Tnnc1 gene was constructed by replacing the CMV enhancer of pTriEx-1.1 vector (Novagen) with the first intron of Tnnc1 (~1.3 kb) followed by insertion of a restriction enzyme-digested PCR product of firefly luciferase gene from pGL3-basic vector into the appropriate restriction sites. A Renilla luciferase expresion vector driven by CMV promoter (pcDNA-Rluc) was produced by inserting the Renilla luciferase gene, which was obtained by digesting pRL-TK vector (Promega) with NheI and XbaI, into pcDNA3.1/Zeo(+) vector (Invitrogen).
Primers used were listed in Additional file 10, Table S5.
The reporter plasmids (see above) were co-transfected with the Renilla luciferase vector (pcDNA-Rluc) into mouse fetal primary myoblasts using Lipofectamine 2000 (Invitrogen) at 1:1.5 ratio of DNA and Lipofectamine. Twenty four hours after transfection, cells were washed with PBS once, and switched to DM. Cells were incubated further for 48 h, and firefly and Renilla luciferase activities were measured using Dual-Glo Luciferase Assay System (Promega) and a luminometer (LumiCount; Packard) according to the manufacturer's instructions. Statistical analyses were performed using the two-tailed Student's t-tests. As a negative control, pGL3-basic was used.
Lower hindlimbs were collected from E18.5 embryos and one week old mice (P7), and TA and EDL muscles were collected from four month-old mice and embedded in Tissue Freezing Medium (Triangle Biomedical Sciences) for cryostat sectioning. Sections (10 μm) were fixed in 4% paraformaldehyde and processed for immunohistochemistry. As primary antibodies, rabbit polyclonal Sox6 antibody (ab30455, Abcam) at 400 fold dilution and mouse monoclonal MyHC-β antibody at 100 fold dilution (NOQ7.5.4D, Sigma-Aldrich) were used. As secondary antibodies, Alexa Fluor 555-conjugated goat anti-rabbit IgG (A-21429, Invitrogen) and Alexa Fluor 488-conjugated goat anti-mouse IgG (A-11059, Invitrogen) were used. DAPI staining was performed to visualize nucleus. Images were obtained at the confocal microscope facility at the UC Davis Genome and Biomedical Sciences Facility.
The animal studies were carried out under the guidance issued by the University of California, Davis.
CIA performed ChIP-seq, ChIP-qPCR, RT-qPCR, plasmid construction, reporter assays, and contributed to writing the manuscript. YD performed immunostaining, RT-qPCR, and Western blotting. NH conceived, designed and supervised the study, and contributed to writing the manuscript. All authors read and approved the final manuscript.
Figure S1 Relative mRNA levels of Sox6, Prox1, Tead1, Tead4, Tcf4, Hdac9, and Hdac11 in Sox6f/f muscles. A. Sox6 mRNA levels were determined in the adult EDL, TA, Gas, and Sol muscles using RT-qPCR and relative expression levels to the soleus in individual animals were calculated. Three 2 month-old and two three month-old Sox6f/f mice were examined (n = 5). The error bars indicate standard error of the mean. The p-value for differential expression between the EDL and the soleus was 0.07. B. Prox1 mRNA levels were determined same as described for Sox6 (n = 3; two 2 month-old and one 3 month-old Sox6f/f mice). C-G. For Tead1, Tead4, Tcf4, Hdac9, and Hdac11, data from EDL, TA, Gas were pooled and compared against soleus (n = 3).
Table S1 Pol II binding data presented in Figure 4. Full list of Pol II binding levels to the Sox6 peak-associated genes measured in RPKM are shown.
Figure S2 Examples of Sox6 and Pol II binding events detected by ChIP-seq. ChIP-seq tracks from two data sets of Sox6 (Sox6-1 and Sox6-2) are shown together with Pol II track (Pol II) of the 2.8 million read data (see the Methods section for details). Common Sox6 binding peaks between the two data sets are indicated as black bars (Sox6 peak). Chromosomal positions (mouse NCBI37/mm9 assembly) as well as sequence conservation (Vertebrate Cons) are presented above and below the ChIP-seq plots, respectively. A. Myh1 and Myh2, B. Myh6 and Myh7, C. MyHC7b, D. Tnnc1, E. Tnni1, F. Prox1, G. Sox6, H. Tead1, I. Tead4, J. Nfatc3, K. Tcf4, L. Hdac9, and M. Hdac11.
Figure S3 Validation of Sox6 binding. A total of 28 Sox6 peaks identified for the 19 genes discussed in the text were verified by ChIP-qPCR (ChIP followed by quantitative PCR). The peak profiles are summarized in Additional file 3, Figure S2A-M. ChIP was performed using wild type myotubes and Sox6 antibody as described in the Methods section, and enrichment was quantified by qPCR using the primers designed to amplify each Sox6 binding site (Additional file 6, Table S3). As a negative control, an intergenic region without a Sox6 peak was used. Fold enrichment over a negative control region (Intergenic) are shown. The intergenic region showed no enrichment. Data are represented as mean ± SD (n = 3). (*) P < 0.05; (**) P < .005. A. Enrichment of the Sox6 binding sites associated with sarcomeric protein genes. B. Enrichment of the Sox binding sites associated with transcription regulatory genes. Numbers in the parentheses below gene symbol indicate relative positions (5' to 3') of multiple Sox6 binding sites associated to the gene. For Tcf4, only intragenic binding sites (see Additional file 3, Figure S2K) were tested.
Table S2 Biological processes enriched among genes associated with Sox6 peaks. Full list of Gene Ontology (GO) biological process terms identified by DAVID are shown.
Table S3 Primers used for ChIP-qPCR.
Table S4 TaqMan probes used for RT-qPCR.
Figure S4 Relative mRNA levels of the genes presented in Table Table1.1. Relative mRNA levels against β-actin in TA, EDL, gastrocnemius (Gas), and soleus (Sol) of control (Sox6f/f) and Sox6 knockout (KO, Sox6f/f; Myf5-Cre) mice were calculated using the formula 2-ΔCt. A two and three month-old mice (2 mo and 3 mo, respectively) were analyzed. A. Relative mRNA level of Sox6. B. Relative mRNA level of Myh7, Myh4, Ppargc1a, and Sdha.
Figure S5 Relative mRNA levels of the genes presented in Table Table2and2and and4.4. Relative mRNA levels against β-actin in control (Sox6f/f) and Sox6 knockout (KO, Sox6f/f; MCK-Cre) mice were calculated using the formula 2-ΔCt. Two 2 month-old mice (mouse ID# 1 and 2) and one 3 old-month mouse (mouse ID# 3) were analyzed. A. Relative mRNA level of Sox6 in TA, EDL, gastrocnemius (Gas), soleus (Sol), and the heart. B. Relative mRNA level of contractile protein genes in TA, EDL, Gas, and Sol. C. Relative mRNA level of transcriptional regulatory genes in TA, EDL, Gas, and Sol. D. Relative mRNA level of metabolism related genes and acetylcholine receptor genes in TA, EDL, Gas, and Sol. E. Relative mRNA level of Myh6, Myh7, Acta1, and Ppargc1a in the heart. *: not determined. #: undetected.
Table S5 Primers used for plasmid construction.
We thank the members of Hagiwara laboratory and Mr. Adam Jenkins for helpful discussions, and Dr. Charles Nicolet at the DNA Technologies and Expression Analysis Core Facilities of the UC Davis Genome Center for assistance with the ChIP-seq experiments. We are also grateful to Dr. Véronique Lefebvre at Cleveland Clinic Lerner Research Institute for providing Sox6f/f mice. This work was supported by Expression Analysis Core Seed Grant, Muscular Dystrophy Association (MDA 4135), and the National Institutes of Health (AR055209) (to N.H.).