|Home | About | Journals | Submit | Contact Us | Français|
Identification of specific gene expression signatures characteristic of oncogenic pathways is an important step toward molecular classification of human malignancies. Aberrant activation of the Met signaling pathway is frequently associated with tumor progression and metastasis. In this study, we defined the Met-dependent gene expression signature using global gene expression profiling of WT and Met-deficient primary mouse hepatocytes. Newly identified transcriptional targets of the Met pathway included genes involved in the regulation of oxidative stress responses as well as cell motility, cytoskeletal organization, and angiogenesis. To assess the importance of a Met-regulated gene expression signature, a comparative functional genomic approach was applied to 242 human hepatocellular carcinomas (HCCs) and 7 metastatic liver lesions. Cluster analysis revealed that a subset of human HCCs and all liver metastases shared the Met-induced expression signature. Furthermore, the presence of the Met signature showed significant correlation with increased vascular invasion rate and microvessel density as well as with decreased mean survival time of HCC patients. We conclude that the genetically defined gene expression signatures in combination with comparative functional genomics constitute an attractive paradigm for defining both the function of oncogenic pathways and the clinically relevant subgroups of human cancers.
The application of microarray-based gene expression profiling in cancer research has provided mechanistic insights into the oncogenic process and contributed to the molecular classification of malignancies (1). Transcription profiles from hundreds of microarray experiments have recently been integrated into large, multiplex data sets (2), and various reverse engineering methods have been used to decipher functionally significant gene networks (3). An important contribution from these studies is the identification of well-defined, coregulated transcriptional modules that are beginning to provide a molecular explanation of disease pathogenesis and consequently may hold great therapeutic significance (4).
An alternative approach to identifying relevant molecular events of malignant transformation and tumor progression is to characterize both known and suspected oncogenic pathways and to establish their specific gene expression signatures (5). Although the presence of these expression signatures is frequently obscured by the etiological complexity of the human tumors, they could be revealed using in vitro experimental systems and genetically modified animal models, in which the number of experimental variables could be rigorously controlled (6). Cross-comparison of the well-defined expression signatures with transcription profiles of human tumors may also improve the current understanding of the oncogenic process.
The HGF/Met signaling pathway regulates multiple cellular functions, including cell proliferation, motility, differentiation, tubulogenesis, and angiogenesis (7, 8). The importance of intact HGF/Met signaling during embryogenesis is clearly demonstrated in mouse models, where homozygous deletion of either Hgf (9) or Met (10) is embryonic lethal. HGF/Met signaling also affects liver biology at several levels (11). In Hgf KO mice the hepatic plate is underdeveloped (9), while in adult livers Met activation alleviates chemically induced fibrosis (12) and protects hepatocytes from CD95-mediated apoptosis (13, 14). Increased HGF levels after partial hepatectomy promote liver regeneration by enhancing proliferation of mature hepatocytes and hepatic progenitor cells (15).
In addition to its physiological functions, the protooncogene MET is a master regulator of metastasis formation, tumor invasion, and angiogenesis (8, 16). In various types of human carcinomas, including papillary renal cancer and gastric and small cell lung cancer, activating mutation, amplification, and overexpression of the MET gene have been associated with “metastatic phenotype” and poor prognosis (17). Consequently, Met is regarded as a promising molecular target for antimetastasis therapies (18). In human hepatocellular carcinoma (HCC), overexpression and mutation of the MET gene are associated with intrahepatic metastases and vascular invasion, 2 of the most important clinical findings determining disease outcome (17, 19).
In the present study we have adopted a global genomic approach to comprehensively define the effect of HGF/Met signaling on the hepatocyte transcriptome. First, we assessed the HGF-induced gene expression patterns in primary hepatocytes isolated from Met KO (20) and WT mice and identified several potential Met target genes as well as novel regulatory functions of Met. We next applied comparative functional genomic analysis to evaluate the importance of the Met-regulated gene expression signature in the pathogenesis of human HCC. A study of 2 independent human data sets revealed that the Met-regulated gene expression signature characterizes a subgroup of HCC with aggressive phenotypic traits and poor progosis.
To identify HGF/Met–regulated genes, we performed expression microarray analysis after inducible activation of Met receptor in primary cultures of hepatocytes established from WT and Met conditional KO mice. Total RNA was isolated from untreated hepatocyte cultures as well as from cultures treated with 50 ng/ml of HGF for 0.5, 2, 12, or 24 hours. RNA collected from these experiments was converted to fluorescently labeled cDNA and used for hybridizations of oligonucleotide microarrays containing 21,997 features representing 19,140 unique mouse genes. After normalization of the data, 13,477 features with a sufficient number of valid expression values were selected for further analysis as described in Methods.
To define the set of HGF/Met–regulated genes, we compared experiments using a multivariate permutation t test at each time point. In total, 730 unique features showed significant (P < 0.001) and at least 1.5-fold expression differences between the 2 genotypes. As the only variable at these comparisons was the presence or absence of intact Met receptor, we could conclude that the expression of significant genes was regulated in a Met-dependent manner. The diagram in Figure Figure11 gives a summary of the data analysis strategy applied to select the significant HGF-regulated genes.
The set of differentially expressed genes could be further divided into 2 major categories. The first category was represented by genes that showed permanent transcriptional changes in Met KO primary hepatocytes. Thus, expression of 60 genes was found to be altered in Met KO cultures at the 0 time point (after overnight incubation), and 57 of them remained differentially expressed during 24 hours of HGF exposure as compared with control cultures (P < 0.005) (Figure (Figure2A).2A). The presence of the permanent gene expression changes implies that in the absence of Met signaling, KO cells undergo a genotype-specific transcriptional adaptation.
As expected, the majority of significant genes (672/730) were only detected in control hepatocytes after HGF treatment, since Met KO cells did not exhibit a specific response to HGF. The genes in this second category could be further separated on the basis of their temporal expression patterns. Up- or downregulated genes were divided into early and late target gene clusters, as they displayed maximal expression differences between the genotypes after a short-term (a half hour or 2 hours) or long-term (12 or 24 hours) HGF treatment. The heat map image created with the mean-centered log2-transformed expression ratios of the significant Met-regulated genes clearly demonstrates the presence of clusters with the distinctive temporal regulation and the reproducibility of the data in replicate experiments (Figure (Figure2,2, B and C).
To validate the specificity of the Met targets, we also compared gene expression between HGF-treated and untreated control primary hepatocytes as well as between control cells treated at consecutive time points. This approach yielded 1,383 differentially expressed genes using the same selection criteria as in the previous comparisons. Notably, some of these genes did not show significant expression differences between the WT and KO hepatocytes, as they probably reflect common adaptive responses to the culture conditions with time. However, 353 from the previously determined Met targets were also identified with both selection strategies. In most cases, the timing and magnitude of the most significant responses overlapped in the horizontal and vertical comparisons, indicating that the majority of the differentially expressed genes represented a specific response to HGF induction.
Expression differences observed with microarray profiling were verified by quantitative RT-PCR analysis. Good correlation with microarray data was found for all 10 randomly selected significant genes (Figure (Figure3). 3).
In accordance with previous studies, we detected significant changes in the expression levels of known Met target genes, including Hmga1, Spp1 (21), Itgβ1 (22), Egr1 (23), and Cldn2 (24). Consecutive functional analysis of the Met target genes allowed a more detailed insight into the cellular machinery associated with the Met-induced phenotype. A significant number of genes induced at 12 and 24 hours were involved in cell motility (Cxcl10, Capn2, Spp1, Fn1), angiogenesis (Vcam1, Anptl4, Ctgf, Neo1, Robo1), cell adhesion (Cldn2, Tjp3, Cdh17), and cytoskeletal organization (Hspa5, Arpc1b, Cap1, Nck2, Tpm2, Msn, Mid1, Vim, Dnm3, Tubb3, Tubb6, Krt2-8, Tuba1). Some of these significant genes, arranged by their postulated functions, are listed in Table Table1.1. We also observed an early induction of several transcription factors (Hmga1, Egr1, JunB, MafF) after HGF treatment. A number of these immediate early targets of the Met pathway could regulate the expression of other differentially expressed genes at the later treatment points. This type of multistep regulation is well documented in the case of the HGF-Egr1-fibronectin (Fn) sequence (25).
HGF treatment had an especially prominent effect on the expression of genes involved in actin cytoskeleton organization and lamellipodium formation. Thus, HGF significantly upregulated Arpc1b and Nck2, a member and an important activator, respectively, of the Arp2/3 complex, which is involved in the regulation of the actin polymerization, particularly at the leading edge of moving cells (26). Furthermore, genes such as the Ras-responsive adenylyl cyclase–associated protein (Cap1), a key regulator of actin and cofilin turnover (27), as well as moesin (Msn), which connects actin filaments to the cell membrane (28), were induced by HGF treatment. Similarly, upregulation of tubulin-α1, -β3, and -β6 demonstrated that microtubular elements are transcriptional targets of Met signaling. The peak expression of the cell motility–related genes occurred at 12 and 24 hours coincidently with the onset of HGF-induced scattering in hepatocyte cultures. Differential expression of genes involved in actin cytoskeleton and microtubular organization as well as cell adhesion was consistent with the phenotypic differences revealed by immunofluorescence staining of WT and KO hepatocytes with antibodies against F-actin, α-tubulin, and vinculin after 24 hours of HGF treatment (Supplemental Figure 1, A–F; supplemental material available online with this article; doi:10.1172/JCI27236DS1).
We also found that osteopontin (Spp1), a secreted glycoprotein, was upregulated by HGF in primary hepatocytes, in agreement with published data (21). Previously, CD44v6, a surface receptor for osteopontin, was also identified as a Met target gene (23). Although CD44 was not differentially expressed in our model, we detected a concomitant induction of several integrin family members, including integrin-αV (ItgαV), integrin-α3 (Itgα3), and integrin-β1 (Itgβ1) (29). Interestingly, αvβ3 integrin binding to osteopontin forms a complex that facilitates angiogenesis and tumor cell migration (30). Thus our data provide genetic evidence that transcriptional induction of genes controlling migratory and angiogenesis programs represents an essential part of Met signaling.
Among the most striking observations was a profound misregulation of genes involved in antioxidative stress response and glutathione metabolism in Met KO hepatocytes. Notably, the transcription factor nuclear factor (erythroid-derived 2)–like 2 (Nfe2l2), as well as numerous oxidative stress response genes (Aldh1a1, Aldh1a7, Adh1, Ephx1, Ephx2), glutathione-S-transferase isotypes (Gsta1, Gsta3, Gstm6, Mgst1, Gstm2, Gstm3), and glutamate-cysteine ligase (Gclc), a regulator of glutathione metabolism (31), showed significant overexpression in the Met KO hepatocytes. The majority of these genes are well-documented targets of the basic helix-loop-helix transcription factor Nfe2l2 (32), a key regulator of a detoxifying pathway activated by increased oxidative or xenobiotic stress in cells. Consistent with the microarray data, increased nuclear levels of Nfe2l2 protein were detected both in cultured hepatocytes and in intact livers from Met KO mice (our unpublished observations). In contrast, nuclear dimerization partners and possible antagonists of Nfe2l2, the nuclear factors MafF and MafK (33), were more abundant in the control hepatocytes. Upregulation of the antioxidant genes may reflect the altered redox homeostasis of the KO cells. This was also evidenced by decreased oxidized/reduced glutathione ratios (Supplemental Figure 1G) as well as by increased staining with the oxidation-sensitive probe 2′,7′-dichlorofluorescin (Supplemental Figure 1H) in the Met KO hepatocytes.
Based on the previous reports (5, 34), we hypothesized that a considerable part of the Met gene expression signature may be conserved between mouse and human hepatocytes. Therefore, the expression signature generated using mouse hepatocytes could be applied to identify human HCCs with a prominent activation of the Met pathway. To test this hypothesis, we directly compared the expression profiles of the Met-regulated genes in mouse hepatocytes with those from 242 human HCC samples. Based on a list of curated homologous UniGene clusters (http://ncbi.nlm.nih.gov/UniGene/), we collected available human orthologs of the 730 Met-dependent mouse genes in 2 human HCC data sets. The first set, composed of 139 HCC samples (35, 36), has been previously analyzed in our laboratory (LEC [Laboratory of Experimental Carcinogenesis, National Cancer Institute] set), while the second set, containing expression profiles of 103 HCCs and 7 liver metastases, was obtained from the Stanford University microarray database (37).
Expression values were standardized for each gene by adjustment of SD to 1 and mean to 0 separately across all samples independently in the 3 different platforms as previously described (38). Next, we constructed 2 composite mouse-human data sets. One contained the expression profiles of 440 common genes from the LEC HCCs and from the mouse hepatocyte samples, whereas the second data set included 303 orthologous genes from the Stanford HCC tumors and from the mouse samples. Cluster analysis identified 2 distinctive clusters in both independent mouse-human data sets (Figures (Figures4A4A and and5A),5A), which divided HCC samples into 2 subgroups based on the presence (Met+) or absence (Met–) of Met gene expression signature. Tumors in the Met+ group showed an expression pattern highly similar to that of HGF-induced control hepatocytes. Strikingly, all 7 extrahepatic liver metastases included in the Stanford data set also exhibited a clear Met activation pattern (Figures (Figures4B4B and and5B).5B). These results indicate that the Met gene expression signature identified in the mouse primary hepatocytes may successfully discriminate a significant subset of human HCC.
Kaplan-Meier plots and log-rank survival statistics showed that patients with tumors from the Met+ expression subgroup had a significantly shortened mean survival time (35.1 ± 7.15 months) compared with other HCC patients (70.3 ± 9.67 months) (Figure (Figure6A).6A). Hierarchical cluster analysis also revealed significant overlap between the Met+ group and the previously identified (35) bad-prognosis HCC group (cluster A) as shown in Figure Figure4B.4B. Distribution of other clinicopathological variables among the Met+ and Met– clusters is summarized in Table Table2. 2.
Since Met signaling has long been regarded as a promoter of tumor invasion and angiogenesis, we determined the vascular invasion status in HCCs available for histological analysis. As expected, the vascular invasion rate was significantly higher in the HCCs with prominent Met expression signature (χ2 = 4.01, P ≤ 0.05) than in the rest of the tumors (Figure (Figure6B).6B). Previous studies found a good correlation between the expression levels of HGF or Met receptor and microvessel density (MVD) in various human carcinomas (39, 40). Indeed, when MVD was assessed by CD34 immunohistochemistry in the representative HCC samples from each group, the results showed a significant correlation between the presence of Met signature and increased MVD. Accordingly, the average OD of CD34+ vascular features was significantly higher (P < 0.001) in the Met+ (90.78 ± 6.71) than in the Met– (44.55 ± 6.16) HCC subgroup (Figure (Figure6,6, C–E).
Notably, in the LEC data set, average expression level of the Met was not significantly different between the Met– and Met+ clusters as detected by either microarray analysis or immunohistochemistry (data not shown). However, at least 2-fold upregulation of the Met receptor was found more frequently in the Met+ tumors (5/54) compared with the Met– group (2/85) in the LEC set. These data overlap well with the expression profile–based classification and suggest that, in some HCCs, overexpression of the Met receptor is the driving force behind the Met-dependent expression signature.
To test the predictive value of the Met expression signature regarding prognosis of HCC patients, the expression patterns of the human homologs of the mouse Met-regulated genes were used to construct a classifier with 6 different supervised prediction algorithms, including the compound covariate predictor (CCP), nearest neighbors 1 and 3 (NN1 and NN3), nearest centroid (NC), support vector machine (SVM), and linear discriminator analysis (LDA) methods. Since survival data were only available for the patients from the LEC group, we randomly divided these samples into a training set (60 samples) and a validation set (79 samples). Next, we selected the common target genes with the matching expression pattern between tumors displaying the high and low Met gene expression signature in the primary hepatocytes and in the training set. Using these genes and all 6 algorithms, classifiers were built according to a leave-one-out cross-validation (LOOCV) strategy. The optimal classifier producing the highest correct classification rate in the training set contained 111 genes. When the classifier was applied to the validation set, all 6 algorithms could identify the subgroups of Met+ tumors. Moreover, membership of the Met+ group showed little fluctuation using different statistical methods (Table (Table3).3). Kaplan-Meier survival curves and results of the log-rank tests with all predictors showed that HCC patients with tumors harboring prominent Met gene expression signature have a worse survival rate compared with other patients (Figure (Figure7,7, A–E). We also applied the prediction algorithms to the Stanford arrays, using the same LEC training set and only genes that were represented in both platforms. In the Stanford data set, the prediction rate of the metastatic liver lesions was 100% with 5 of 6 algorithms.
Beyond predicting the disease outcome in HCC patients, the classifier genes may also represent the most conserved cross-species part of prominent HGF/Met–regulated expression signature that plays a critical role in the Met-induced cellular transformation. Several of these genes either were previously defined as important contributors to metastasis formation, including HIG2 (41), EPHA2 (42), MAPK3 (43), P85α (44), ITGαV (29), and ITGβ1 (29), or could be related to cell motility and invasiveness by their postulated functions (CAP1, ARPC1B, NCK2)(Table )(Table4)4) (26, 27, 45).
In the current study we used global gene expression profiling to identify the Met-regulated genes in primary mouse hepatocytes. Analysis of microarray data identified several new Met target genes. Some of these genes indicate a novel role for Met in the regulation of hepatocyte homeostasis. Furthermore, comparison of the Met-dependent gene expression signature between mouse hepatocytes and human HCC samples revealed the presence of a clinically significant subgroup of HCC patients with a pronounced Met gene expression signature.
Although many etiological risk factors leading to HCC development (HBV and HCV hepatitis or alcoholic liver disease) are well documented (46), the molecular events of HCC pathogenesis are still not clearly understood. Application of high-throughput techniques including expression microarrays could significantly improve our knowledge in this field. Indeed, recent studies successfully identified the differentially expressed gene sets associated with phenotypic markers or clinicopathological conditions such as viral etiology (47), tumor recurrence (48), and intrahepatic metastases (49) in HCC samples. Furthermore, the use of expression profiling made it possible to divide the HCC patients into subgroups with a significantly different disease outcome and survival rate (35). However, the complexity of the expression profiles in the human tumors often prevents the identification of primary regulatory events. To overcome this limitation, we used a genetically modified mouse model and a functional genomic approach to identify a subgroup of HCCs characterized by a pronounced Met gene expression signature.
Multiple findings support the prominent clinical significance of the HCC classification based on the presence or absence of the Met expression signature. First, the prevalence of the Met expression signature showed a significant association with aggressive phenotypic traits including increased MVD and rate of vascular invasion. These findings are in agreement with the observations that designated HGF/Met signaling as one of the key promoters of metastasis formation and tumor angiogenesis (16). Second, the presence of Met signature had major prognostic significance for the HCC patients as evidenced by 6 different supervised prediction algorithms applied to the independent training and validation sets of HCC samples. The analysis revealed a significantly lower survival rate for patients with the predicted presence of Met expression signature compared with the group with the absence of Met signature. Third, the Met signature included the target genes with strong cross-species conservation. These genes support the Met-regulated cellular programs associated with neoplastic transformation and may therefore constitute important therapeutic targets. Genes involved in focal adhesion formation, as well as in the organization of actin cytoskeleton at the leading edge of a motile cell, are good examples of these functionally related target genes. Finally, the observation that the colon metastatic tumor samples shared the hepatocyte-specific Met expression signature suggests a considerable overlap of Met target genes induced during the progression of carcinomas with a different cellular origin.
The use of WT and Met KO primary hepatocyte cultures provided a significant advantage over experiments conducted on cell lines. Transcriptional response of cultured hepatocytes closely mimicked the in vivo events following HGF stimulation. Moreover, the comparison with Met KO cells allowed us to eliminate the HGF-independent cross-activation of the Met receptor by semaphorin/plexin B1 (50) or EGF/EGFR complexes (51) or by other mechanisms and therefore led to a more stringent detection of HGF target genes.
Nevertheless, it is highly likely that in human liver tumors the Met-regulated expression signature is not always associated with the alteration of the receptor itself. Activation of the HGF/Met pathway at different levels could produce the same expression patterns. There is also a possibility that distinct but functionally homologous signaling pathways could induce the same effector genes as HGF/Met. In this case, the shared expression signatures would reflect common phenotypic changes but different regulatory mechanisms. EGF and TGF-α are likely candidates, since they induce similar intracellular signaling events and cellular responses to Met (51). Comparison of LEC human HCCs with HCC samples from the different transgenic mouse models showed that human tumors with a bad prognosis were the most similar to MYC/TGF-α mouse tumors, suggesting the dominance of TGF-α/EGFR signaling in these samples (38). Thus, some of the Met target genes may be cross-regulated by the TGF-α/EGFR pathway or other tyrosine kinases. Currently, little information is available on the extent of overlap between the expression signatures of different tyrosine kinase pathways. However, previous studies suggested that activation of EGFR receptor could lead to direct transactivation of the Met in transformed cells. It is therefore possible, at least under certain conditions, that target genes specific for Met activation could be induced by the EGFR signaling (52). On the other hand, when similarities between HGF- and VEGF-induced expression profiles were investigated in endothelial cells, the data did not show a close resemblance between the 2 gene sets (53).
We believe that changes detected in gene expression can also provide mechanistic explanations for the phenotypic alterations observed in KO hepatocytes. Absence of induction of genes involved in motility and cytoskeletal organization correlates well with previously reported defects in KO cells such as impaired motility and decreased proliferation potential (20). A significant new finding in this study was the identification of the effect of Met activation on cellular homeostasis. Numerous genes participating in oxidative and xenobiotic stress response (Ephx1, Aldh1a1, Aldh1a7) or related to glutathione metabolism (Gsta3, Gstm3, Gclc) had elevated expression levels in Met KO cells. Most of them are well-established targets of Nfe2l2 transcription factor (54), as they contain antioxidant response element in their promoter sequence. Although this pathway is essential for the detoxification of toxic metabolic intermediates, the permanent activation of Nfe2l2 in a mouse model is lethal (32). A connection between the Nfe2l2 and Met pathways could exist at multiple levels. Activation of ERK signaling by Met may inhibit retinoic X receptor α (RXRα) (55), which together with PPARγ positively regulates Nfe2l2 targets and lipid metabolic enzymes (56). Alternatively, elevated oxidative stress together with accumulation of toxic metabolic intermediates in Met KO animals may cause the high Nfe2l2 activation (32). Together these data implicate HGF/Met signaling as an important regulator of metabolic homeostasis in hepatocytes.
In conclusion, our study demonstrates that characterization of pathway-specific gene expression signatures in well-suited experimental models facilitates the discovery of novel intracellular regulatory mechanisms and can improve the molecular classification of human cancers.
Hepatocyte-specific deletion of exon 16, essential for Met function, was achieved by crossing of the albumin-Cre transgene against the floxed Met allele as described previously (18). The genotypes of the control and mutant mice used in this study were Met+/+AlbCre+/– and Met–/–AlbCre+/–, respectively. All animal experiments were conducted according to the NIH guidelines for animal care. All animal experiments and procedures were approved by the National Cancer Institute Animal Care and Use Committee.
Hepatocytes were isolated by 2-step collagenase perfusion of the mouse livers followed by isodensity purification in Percoll gradient (57). Cells were seeded at 2 × 106 in 10-cm dishes in the plating medium supplemented with 10% FBS as described in ref. 57. After 4 hours, the plating medium was replaced with serum-free medium. The following day, cells were treated with 50 ng/ml of recombinant human HGF (PeproTech) for 0.5, 2, 12, and 24 hours. Triplicate cell cultures were established from 3 individual mice. Cells without treatment were collected at the beginning of each time course experiment and served as 0 time control.
The Mouse OligoLibrary Release 1 plus Extension oligonucleotide set containing 21,997 65-mer oligonucleotides representing 19,740 unique genes was purchased from Compugene Inc. Microarrays were produced in the Laboratory of Molecular Technology, National Cancer Institute. Total RNA was extracted from the primary cultures using TRIZOL (Invitrogen Corp.) protocol. Each sample was hybridized against a common reference sample isolated from B6/129 WT primary hepatocytes, in a reverse-fluor design, and gene expression values were defined as a target-per-reference ratio. Twenty micrograms of total RNA was used to synthesize fluorescently labeled (Amersham Biosciences) cDNA probes. Preparation of the labeled cDNA samples and hybridization of oligonucleotide microarrays were performed with small modifications (37). Briefly, RNA samples were denaturated with 4 μg (d)T20 primer at 70°C for 5 minutes, chilled on ice, and incubated for 1 hour at 42°C in a 50-μl reaction mix containing 10 μl 10× First-Strand buffer (Invitrogen Corp.), 2.5 μl 20× deoxynucleotide triphosphate (dNTP) mix (10 mM dATP, dGTP, and CTP; 4 mM dTTP; and 6 mM aminoallyl-dUTP; Sigma-Aldrich), 5 μl of 0.1 M DTT, 2 μl SuperScript II (Invitrogen Corp.) reverse transcriptase, and 1 μl SUPERaseIn (Ambion Inc.). After digestion with 4 U RNase H (Ambion Inc.) at 37°C for 20 minutes, the samples were cleaned up with a MinElute PCR Purification Kit (Qiagen). The eluted cDNA samples were concentrated by Speed Vac (Global Medical Instrumentation Inc.) and resolved in 20 μl 0.1 M NaHCO3 solution. The Cy3 and Cy5 dyes (Amersham Biosciences) were also dissolved in 20 μl of water. For the coupling reaction, 2 μl of dye was added to each sample, and the mix was left in a dark box for an hour. Labeled cDNA was purified again with a MinElute kit (Qiagen), and Cy3- and Cy5-labeled target and reference samples were combined. Before the hybridization, oligonucleotide microarrays were pretreated with a 5× SSC, 1% BSA, 0.1% SDS blocking solution. The samples were applied on the microarray slides in a hybridization mix containing 25% formamide, 0.1% SDS, and 5× SSC in 28 μl final volume. After overnight incubation, the slides were washed in a 1× SSC 0.1% SDS solution for 2 minutes and then consecutively in 1× SSC and 0.2× SSC solutions. Air-dried arrays were scanned in a GenePix 4000A scanner (Molecular Devices) in such a way that photomultiplier tube voltage was adjusted to achieve optimal signal intensity at both channels with less than 1% saturated spots. Image analysis was performed with GenePix Pro 22.214.171.124 software (Molecular Devices), following the manufacturer’s recommendations.
Real-time PCR quantification of mRNA levels for 10 selected HGF/Met target genes was performed on the same RNA samples that we used for microarray hybridization. After DNase digestion with a DNA-free kit (Ambion Inc.), 3 μg total RNA was reverse transcribed in 20 μl reaction volume with a SuperScript (Invitrogen Corp.) first-strand synthesis kit according to the manufacturer’s instructions. The real-time quantitative PCR analysis was performed with an ABI PRISM 7900HT (Applied Biosystems) thermal cycler in a 96-well reaction plate. The 25-μl PCR reaction mix contained 12.5 μl 2× SYBR Green PCR Master Mix (Applied Biosystems), 400 nM of each primer, and 1 μl cDNA template. Reactions were incubated for 10 minutes at 95°C followed by 40 cycles of 30 seconds at 95°C and 60 seconds at 60°C. Melting analysis of the PCR products was also conducted to validate the amplification of the specific product. The expression level of mouse β2-microglobulin was used as an internal reference. Relative gene expression levels were calculated with the 2–ΔδCT method (58). Primer sequences together with expected product length are listed in Table Table5. 5.
Detection of protein expression levels of human Met receptor and CD34 surface antigen was performed on representative human HCC samples from the subgroups with or without Met activation signature (12 samples each). For immunohistochemistry, 5-μm-thick sections were cut from 10% formalin-fixed, paraffin-embedded tissues. Sections were routinely deparaffinized, and endogenous peroxidase was blocked with 3% H2O2 for 20 minutes. Antigen retrieval was performed by placing samples into boiling citrate buffer (pH 6.0) for 20 minutes. Sections were blocked for 30 minutes with 10% serum, then incubated overnight at 4°C with a 1:100 dilution of anti–human MET (Santa Cruz Biotechnology Inc.) or anti-CD34 (Zymed Laboratories Inc.) antibodies. For visualization of the specific staining, VECTASTAIN ABC Elite kits (Vector Laboratories) and diaminobenzidine peroxidase substrate (Dako) were used. Slides were counterstained with hematoxylin. The staining intensities for Met were classified as weak (+), intermediate (++), or strong (+++). At least 10 visual fields were assessed for each specimen with ×200 magnification.
MVD was determined by CD34 immunostaining in 12 representative samples from both the Met+ and Met– subclasses in the LEC set. The density of CD34+ features was determined in 5 angiogenetic “hot spots” in each sample using Image-Pro Plus version 3 (MediaCybernetics) software. MVD was expressed as mean OD ± SEM per visual field. Two-tailed Student’s t test was used to compare tumor vessel density. The differences were considered statistically significant with P < 0.01.
First, image spots with diameter less than 10 μm or more than 300 μm or signal intensity below background intensity for any of the 2 fluorescent channels were excluded. Only genes with at least 4 data points out of 6 experiments in at least 2 experimental groups in the mouse primary hepatocyte data set were selected for further data analysis. Gene expression values were normalized by median-centering log ratios across all primary hepatocyte samples. For each spot, the target-per-reference intensity ratio was log2 transformed and averaged between duplicate experiments.
Genes differentially expressed between groups of Met KO and Cre-control hepatocyte samples were identified by a multivariate permutation t test using the BRB ArrayTools 3.3 software package (Biometric Research Branch, National Cancer Institute; http://linus.nci.nih.gov/BRB-ArrayTools.html). We set parameters in the multivariate permutation test to provide 90% confidence level at false discovery rate less than 10% after 1,000 random permutations. Selection criteria for individual genes included a significance level of P < 0.001 in the univariate t test plus more than 1.5-fold expression difference between the compared groups. Selected significant genes were further divided into subgroups based on their temporal expression pattern. The group of permanently different genes contained genes whose expression was significantly different (P < 0.005) between the 2 genotypes by univariate t test at 0 hour plus at least 1 early (0.5 or 2 hours) and 1 late (12 or 24 hours) treatment point. The rest of the significant genes were classified as early or late up- or downregulated genes based on the treatment point at which they showed the most significant regulation by HGF.
For comparison of the Met signature in human HCC and mouse hepatocytes, we used 2 independent human HCC microarray data sets. The LEC set was generated in our laboratory (37) and contained expression profiles from 139 HCC samples. The other data set was obtained from the Stanford University microarray database (http://genome-www5.stanford.edu/) and included 103 human HCC samples and 7 liver metastases from extrahepatic tumors. Detailed analysis of the human data sets, as well as available clinical information, is available in previous publications (35–37). In our analysis we selected genes represented on both the mouse and the human platform using curated mammalian orthologs from Jackson Laboratory. We found 440 common orthologs of the Met-regulated genes in the LEC set and 303 genes in the Stanford sets. Before collation of data into 2 mixed mouse-human sets, gene expression ratios were normalized separately on the different microarray platforms with mean set to 0 and SD to 1 for each gene (38). Hierarchical cluster analysis based on Pearson correlation was performed with Cluster 2.11, and results were visualized with TreeView programs (Michael Eisen Laboratory, Lawrence Berkeley National Laboratory and University of California, Berkeley; http://rana.lbl.gov/EisenSoftware.htm).
We used 6 algorithms, the CCP, NN1, NN3, NC, SVM, and LDA (BRB ArrayTools; Biometric Research Branch, National Cancer Institute), to perform the class prediction analysis with the Met target genes. The LEC data set was randomly divided into a training set (60 samples) and a prediction set (79 samples). To build an optimized classifier list, which could estimate the probability of the identity of a particular sample, we used an LOOCV approach. During the cross-validation step, 1 sample was removed from the analysis, and the remaining samples were used to identify the most differentially expressed genes between the groups. Based on expression of these genes, identity of the left-out sample was predicted with a given algorithm. This process was repeated until each sample was left out once. The number of genes in the classifier was varied to provide the highest correct prediction rate in the training set. To estimate accuracy of the prediction model, class labels were randomly permuted, and the LOOCV process was repeated 1,000 times. The significance level is the proportion of the random permutations that gave a cross-validated error rate no greater than the cross-validated error rate obtained with the real data. The 6 classification methods were also applied to the LEC validation set, and survival analysis was performed to assess the clinical significance of predicted HCC groups. Prediction of the metastases in the Stanford data set was performed in a similar manner using the same LEC training set but only with Met target genes commonly available on both platforms. Survival analysis was based on predicted presence or absence of Met activation–based hierarchical clustering or different prediction algorithms. Kaplan-Meier survival analysis and log-rank test were performed with the R 1.8.1 statistical package (http://www.r-project.org/).
We thank Joe Grisham and Liz Conner for their comments and critical review of the manuscript and Stephen Wincovitch for his help in image analysis. This research was supported in part by the Intramural Research Program of the Center for Cancer Research.
Nonstandard abbreviations used: CCP, compound covariate predictor; HCC, hepatocellular carcinoma; LDA, linear discriminator analysis; LEC, Laboratory of Experimental Carcinogenesis, National Cancer Institute; LOOCV, leave-one-out cross-validation; MVD, microvessel density; NC, nearest centroid; NN, nearest neighbor; SVM, support vector machine.
Conflict of interest: The authors have declared that no conflict of interest exists.
Citation for this article: J. Clin. Invest. 116:1582–1595 (2006). doi:10.1172/JCI27236.