The ENCODE tiling array analysis of 5′ capped RNA identified 47 annotated genes
with increased expression (fold change >1.5, Bonferroni adjusted p value
<0.05) () and 22
genes with decreased expression in a chronic hepatitis C (HCV) cirrhotic as compared
to an uninfected control liver specimen (). Analysis of poly(A)+ RNA
identified 37 genes with increased expression and 15 genes with decreased expression
in HCV cirrhotic as compared to control liver. Twenty of the upregulated genes and
six of the downregulated genes were identified in both 5′ capped and
poly(A)+ RNA populations (). Of note, 8 out of 17 upregulated genes (47%) and 2 out of 9
down-regulated genes (22%) unique to poly(A)+ RNA did have a
statistically significant expression difference (p<0.05) in 5′ capped RNA,
but were excluded from our list because their fold change did not meet the inclusion
criteria (<1.5). None of 27 upregulated or 17 downregulated genes unique to
5′ capped RNA were found to have significant differential expression in the
poly(A)+ RNA (see
Tables S1,
S2,
S3,
S4,
S5,
S6). This is
likely due to less variation in signal intensity observed with 5′ capped RNA
among experimental replicates as compared to poly(A)+ RNA (the average SEM per
gene was 28.1 for 5′ capped; poly(A)+ average SEM per gene was 77.2).
Thirteen of the 64 genes (20%) found to be upregulated in HCV cirrhotic liver
were identified by GO-term enrichment analysis to have biologic functions related to
the immune response ()
[24],
[25]. Several of
these genes were selected for qPCR analysis of liver tissue from multiple patients,
including interferon regulatory factor 1 (
IRF1), a transcription
factor involved in the interferon response to HCV infection
in-vitro
[26]. On ENCODE
analysis,
IRF1 was found to be upregulated in HCV cirrhotic liver
in 5′ capped RNA only (fold change 2.1, p

=

0.03, ). qPCR analysis of
IRF1 in multiple patient samples confirmed the ENCODE findings
of increased expression in HCV cirrhotic liver (n

=

7) compared
to control liver (n

=

10) (fold change 2.4,
p

=

0.03, ). Percutaneous liver biopsies (n

=

7) from
patients with chronic hepatitis C with mild inflammation and no fibrosis (Metavir
grade 1, stage 0) were also analyzed and showed a significant increase in
IRF1 expression (fold change 3.5,
p

=

0.001, ). These patients were not being treated with recombinant interferon at
the time of liver biospecimen acquisition.
| Table 1Differentially Expressed Genes Associated with Immune Response. |
The tripartite motif-containing 22 (
TRIM22) mRNA, encoded by an
interferon regulated gene previously reported to be upregulated with HIV and
hepatitis B viral infections
[27],
[28], was also found to have increased expression in our
analysis of HCV cirrhotic liver. TRIM22 was increased on ENCODE analysis in both
5′ capped (fold change 4.1, p<0.001) and poly(A)+ RNA (fold change
6.2, p

=

0.001, ); we also identified increased transcription in intronic
regions within the
TRIM22 gene. qPCR analysis of
TRIM22 showed a large increased expression (fold change 9.7,
p<0.001) in HCV cirrhotic liver (). Liver biopsies from patients with mild chronic hepatitis C
and no fibrosis showed a marked increase in
TRIM22 expression (fold
change 17.0, p<0.001, ).
Connective tissue growth factor (
CTGF) mRNA is upregulated during
liver fibrosis and its protein product has recently been suggested as a non-invasive
biomarker of liver fibrosis in patients infected with hepatitis C
[29].
CTGF mRNA was upregulated in both 5′ capped (fold change
5.5, p<0.001) and poly(A)+ RNA (fold change 13, p<0.001,
Figure S1A).
qPCR analysis of liver tissue from multiple control and HCV cirrhotic liver
specimens showed a large mean increase in
CTGF expression (fold
change 17, p

=

0.004) (
Figure S1B). Interestingly, qPCR analysis of
liver biopsies with mild chronic hepatitis C with no fibrosis showed no increase in
CTGF expression (fold change −5.7,
p

=

0.03,
Figure S1B).
Several other differentially expressed genes identified in the ENCODE tiling array
analysis were selected for qPCR analysis using additional patient samples. The Met
proto-oncogene (
MET) is the hepatocyte growth factor receptor and
has been implicated in the development of multiple tumor types. ENCODE array
analysis of 5′ capped RNA showed decreased expression of
MET
in hepatitis C cirrhotic liver (fold change −1.8,
p

=

0.015,
Figure S2A). However, qPCR analysis of multiple
patients did not demonstrate a significant difference in expression in HCV cirrhotic
liver (
Figure
S2B). However, qPCR analysis of mild chronic hepatitis C biopsies with no
fibrosis showed a significant increase in
MET expression (fold
change 3.7, p

=

0.002,
Figure S2B).
Cathepsin D (
CTSD), a lysosomal aspartyl protease associated with
several disease processes, was significantly upregulated in the ENCODE array
analysis of 5′ capped RNA from hepatitis C cirrhotic liver (fold increase 1.9,
p<0.001). However, when multiple patient samples were analyzed by qPCR no
significant difference was identified (data not shown). The FUN14 domain containing
2 (
FUNDC2) mRNA, encoding a protein suggested to interact with
hepatitis C core protein, was downregulated in 5′ capped RNA from hepatitis C
cirrhotic liver (fold change −2.1, p<0.001)
[30]. However, when specimens of liver
from additional patients with hepatitis C cirrhosis were analyzed by RT-PCR no
significant difference was seen (data not shown). Nevertheless, RT-PCR of cDNA
prepared from the same liver specimens (HCV cirrhotic 1 and Control 1) used in the
ENCODE tiling array analysis did verify that
MET,
CTSD and
FUNDC2 were both significantly
different in those specimens (data shown for MET only).
Differential gene expression associated with HCV cirrhotic liver was also observed in
non-protein coding genomic regions including intronic and intergenic regions. illustrates the
differential gene expression observed in exonic, intronic, and intergenic regions in
5′ capped and poly(A)+ RNA. Fifty percent of the upregulated and
78% of the downregulated nucleotides were found in intronic regions using
5′ capped RNA, as compared to 23% of the upregulated and 49% of
the reduced nucleotides when poly(A)+ RNA was analyzed. A similar percentage of
differentially expressed nucleotides were observed in intergenic regions when
5′ capped (4–10%) and poly(A)+ (8–10%) RNA
samples were analyzed. Four annotated genes provide examples of higher intronic
expression in 5′ capped as compared to poly(A)+ RNA: hepatocyte growth
factor receptor (
MET); tissue inhibitor of metalloproteinase 3
(
TIMP3); mitogen-activated kinase kinase kinase 1
(
MAP3K1); MyoD family inhibitor domain containing
(
MDFIC); and (
Figures S2 and
S3).
Our ENCODE tiling array analysis of both 5′ capped and poly(A)+ RNA
identified three differentially expressed transcriptionally active genomic regions
where no annotated genes exist. These 5′ capped RNA transcript regions
originated from chromosome 14 (, coordinates 53,254,000–53,256,899), chromosome 9 (, coordinates
131,088,154–131,089,262) and chromosome 21 (
Figure S4A,
coordinates 39,253,256–39,413,500). All three of these genomic regions
demonstrated RNA transcription at least 10 kb from annotated genes. Although these
regions were found to be transcribed in previous ENCODE tiling array analysis of
HL60, HeLa, and Gm06990 human cell lines (
Figure S5,
http://genome.ucsc.edu/ENCODE/pilot.html), there was no prior
evidence that they represented 5′ capped Pol II RNAs. Further investigation
using the UCSC genome browser (
http://genome.ucsc.edu)
identified several ESTs (spliced and unspliced) originating from the transcribed
region identified on chromosome 9 (
Figure S6A). Not surprisingly, the 160kb region
on chromosome 21 included several ESTs (BE870595, BG459638, BG460250, BI011795) and
overlapped with one hypothetical protein (AJ011409, unpublished). It also included
previously identified 5′ Rapid Amplification of cDNA Ends (RACE) products
[31]. The 5′
capped RNA transcript(s) originating from the unannotated region of chromosome 14
did not overlap with any known genes in the UCSC database but was associated with a
SNP (rs2884435) and several unspliced ESTs (
Figure S6B).
qPCR analysis of the 5′ capped Pol II RNA(s) originating from the unannotated
region on chromosome 14 confirmed the presence of a RNA transcript that was
increased 7.8 fold in seven HCV cirrhotic as compared to ten control liver specimens
(p<0.001) ().
Moreover, qPCR analysis of mild chronic hepatitis C biopsies with no fibrosis showed
a similar increase in expression of the transcript (fold change 11.3,
p

=

0.003, ).
The 5′ capped RNA originating from the unannotated region on chromosome 9 was
also confirmed to have a 4.5 fold increase by qPCR analysis of seven HCV cirrhotic
as compared to ten control liver specimens (p<0.001) (). qPCR analysis of mild chronic
hepatitis C with no fibrosis showed a similar increase in expression of the
transcript (fold change 4.1, p

=

0.002, ).
The region on chromosome 21 did not differ significantly among multiple patients with
HCV cirrhosis using qPCR.
To further define the structure of differentially expressed RNA(s) originating from
chromosome 9 and 14, we used 5′ and 3′ rapid amplification of cDNA ends
(RACE) and DNA sequencing over one region on chromosome 9 and three separate regions
of chromosome 14 (Chr14a, Chr14b, and Chr14c) where differential expression was
observed by ENCODE analysis. Using 5' RACE we identified the 5′ end of an
unannotated RNA transcript found on chromosome 9 (
Figure S6A).
The 5′ end of this transcript was at a similar chromosome coordinate as other
previously identified ESTs. Using 3′ RACE we identified the 3′ ends of
independent RNA transcripts on the minus strand of Chr14a and Chr14b regions,
respectively (
Figure
S6B). We also sequenced approximately 600–800 nucleotides including
the poly(A)+ end of each transcript. The existence of at least a 1.50 kb
transcript in the chromosome 14c region was confirmed by standard PCR product
sequencing. To compare relative expression of RNA transcripts on the Chr14a and
Chr14c regions we performed qPCR analysis. Interestingly, gene expression was
similar in the HCV cirrhotic and control human liver samples but more than 10-fold
higher in the Chr14a region compared to the Chr14c region in Huh7.5 cells, a human
hepatoma cell line (data not shown).