We identified a gene expression signature from blood samples consisting of 8 genes (
RUNX3,
TGFBR3,
TRGC2,
TRGV9,
TARP,
ACP1, VCAN and
TSTA3) that differentiates stage I lung adenocarcinoma cases from controls and mirrors cancer-related gene expression changes in the target tissue. Results were validated in additional independent sets of tissue-based and blood-based gene expression analyses of adenocarcinoma cases and controls. Although present in all stages, expression changes were weaker in advanced stages, possibly because of secondary changes due to the spread of the disease. Similarly, the changes were stronger in current smokers but present in all smoking categories. The accuracy in discriminating between stage I lung adenocarcinoma cases and controls was good for most genes when considered separately, in particular those that were down-regulated between cases and controls. A multiplex model based on the expression of all 8 genes combined showed a high diagnostic accuracy of 81% (). If further validated in prospective studies using PWB of cases drawn prior to lung cancer diagnosis (
28), this gene expression signature may be used as a blood-based biomarker for early detection of lung adenocarcinoma in heavy smokers at high risk of lung cancer. We validated its use in current smokers. Further study in never and former smokers is warranted. Moreover, it will be important to test the identified biomarkers in other lung cancer histologies.
The identified genes are promising with regard to potential mechanistic relevance.
RUNX3 (runt-related transcription factor 3), down-regulated in our analyses and with an AUC of 0.69, is involved in the negative regulation of epithelial cell proliferation, functions as a tumor suppressor, and is frequently deleted or transcriptionally silenced in cancer. Hypermethylation of
RUNX3 has also been associated with the evolution of lung cancer (
29) and specifically of lung adenocarcinoma (
30). In addition, higher protein expression of
RUNX3 has been associated with increased survival from lung adenocarcinoma (
31).
TGFBR3 (transforming growth factor beta receptor III) encodes a glycoprotein that binds TGFB, a cytokine that modulates several tissue development and repair processes.
TGFBR3 is the TGF-beta component most commonly down-regulated at both the message and protein levels in several cancers (
32–
36), including non-small cell lung cancer (
37). Our study is the first to show down-regulation of
TGFBR3 mRNA expression in both blood and tumor tissue cells of lung adenocarcinoma patients.
TGFBR3 showed the highest accuracy among the single gene models in discriminating cases from controls (AUC = 0.73).
TRGC2 (T cell receptor
gamma constant 2),
TRGV9 (T cell receptor
gamma variable 9), and
TARP (T cell receptor
gamma alternate reading frame protein) are colocalized at chromosome locus 7p14.1, close to the 7p14.3 chromosomal region that frequently shows allelic loss in non-small cell lung cancer (
38).
TARP is embedded within the TCR
gamma locus and cDNA that detect
TCR gamma mRNA also detect
TARP mRNA. Accordingly, probes in
TRGC2,
TRGV9, and
TARP showed very similar results in our study.
TRGV9 cells have been shown to contribute to the natural immune surveillance against colon cancers (
39).
TARP has been previously studied as a prostate-specific gene and an androgen-regulated protein that may carry out its biological functions via action on mitochondria (
40). Down-regulation in cases with respect to controls and in tumor compared to non-involved tissues of
TRGC2,
TRGV9, and
TARP points to an immune-related alteration as a possible contribution to lung adenocarcinoma development. Case-control discrimination based on
TRGC2,
TRGV9, and
TARP was also good (average AUC = 0.70).
ACP1 (acid phosphatase 1) gene, up-regulated in our analysis, is polymorphic and encodes at least two electrophoretically different isozymes. An increase of fast isozyme concentration increases cancer cells’ invasiveness, whereas a decrease of slow isozyme concentration in cancer results in cancer cell proliferation (
41). In the validation set
ACP1 showed the poorest accuracy in discriminating cases and controls (AUC = 0.55).
VCAN (versican) encodes a protein involved in cell adhesion, proliferation, migration, angiogenesis, tissue morphogenesis and maintenance.
VCAN was initially identified in cultures of lung fibroblasts (
42) and has been recognized to play a role in the invasion of several cancers (
43) including lung cancer (
44).
VCAN mRNA expression was up-regulated in both lung tumor tissue and PWB of adenocarcinoma cases in our study.
TSTA3 (tissue specific transplantation antigen P35B) gene, also up-regulated in our analysis, is involved in the expression of many glycoconjugates. Intriguingly,
TSTA3 is located in chromosomal region 8q24, which contains several polymorphic variants recently associated with several cancers (
45–
47).
VCAN and
TSTA3 also showed a reasonable performance in discriminating between cases and controls (AUC = 0.61 and 0.59, respectively). In addition to the described eight genes, we also identified 42 additional genes whose expression in PWB distinguishes stage I lung adenocarcinoma from controls ( and
Supplemental Material 2) and was stronger among subjects who currently smoked. If further confirmed in additional blood-based analyses, these genes could also contribute to the detection of early lung adenocarcinoma lesions.
In conclusion, gene expression changes from peripheral blood samples can differentiate early stage lung adenocarcinoma cases from controls and resemble gene expression changes in early stage lung adenocarcinoma tissue. This finding suggests that early processes of lung adenocarcinoma development may lead to systemic alterations that can be detected in peripheral blood tests. Gene expression from PWB can provide an important tool for the identification of early detection markers of cancer in the future.