Search tips
Search criteria 


Logo of bmcmedicineBioMed Centralsearchsubmit a manuscriptregisterthis articleBMC Medicine
BMC Med. 2012; 10: 103.
Published online 2012 September 12. doi:  10.1186/1741-7015-10-103
PMCID: PMC3523031

A tool to facilitate clinical biomarker studies - a tissue dictionary based on the Human Protein Atlas


The complexity of tissue and the alterations that distinguish normal from cancer remain a challenge for translating results from tumor biological studies into clinical medicine. This has generated an unmet need to exploit the findings from studies based on cell lines and model organisms to develop, validate and clinically apply novel diagnostic, prognostic and treatment predictive markers. As one step to meet this challenge, the Human Protein Atlas project has been set up to produce antibodies towards human protein targets corresponding to all human protein coding genes and to map protein expression in normal human tissues, cancer and cells. Here, we present a dictionary based on microscopy images created as an amendment to the Human Protein Atlas. The aim of the dictionary is to facilitate the interpretation and use of the image-based data available in the Human Protein Atlas, but also to serve as a tool for training and understanding tissue histology, pathology and cell biology. The dictionary contains three main parts, normal tissues, cancer tissues and cells, and is based on high-resolution images at different magnifications of full tissue sections stained with H & E. The cell atlas is centered on immunofluorescence and confocal microscopy images, using different color channels to highlight the organelle structure of a cell. Here, we explain how this dictionary can be used as a tool to aid clinicians and scientists in understanding the use of tissue histology and cancer pathology in diagnostics and biomarker studies.

Keywords: Antibody-based proteomics, cancer biomarkers, tissue and cell dictionary, immunohistochemistry, protein expression, histology, pathology


The Human Protein Atlas project, launched in 2003, was initiated as a natural extension of the Human Genome Project, with the objective to explore the proteins encoded by the human genome. The primary focus was to analyze the distribution and relative abundance of all proteins in human normal cells and tissues, and to determine the subcellular localization of each protein. One main goal in this effort was to contribute to biomedical and clinical research, and because cancer is a major disease where diagnostics, classification and prognostic stratification is based on tissue morphology, a multitude of clinical cancer tissue samples were included in the comprehensive protein profiling. This has allowed researchers to utilize the protein profiling data for both biomarker discovery efforts and for validation of altered gene expression patterns at the protein level in both normal and cancer tissue.

The Human Protein Atlas project pursues a systematic high-throughput generation of affinity-purified polyclonal antibodies with the aim of generating a map of protein expression patterns on a proteome-wide scale in both human normal cells, tissues and organs, and in cancer tissues [1]. Immunohistochemistry (IHC) is performed on tissue micro arrays (TMA), containing a multitude of different normal tissues and tumors, to enable a comprehensive mapping of protein expression patterns at cellular resolution in a tissue context. Altogether 144 different normal tissues are analyzed together with 216 different tumors representing the 20 most common forms of human cancer [2]. Immunofluorescence (IF)-based profiling of protein expression in cell lines is performed to generate a map of subcellular localization patterns [3]. All protein expression data, including the underlying images, are made publicly available at the Human Protein Atlas web portal ( [4]. The current version of the Human Protein Atlas contains data for more than 14,000 unique proteins. This corresponds to more than 70% of all human protein encoding genes [5].

As the cell constitutes the smallest living entity, it is required to harbor specialized and distinct subcellular structures. Cells vary considerably in function and morphology and these differences form the basis for the concept of different cellular phenotypes. On a higher level, cell types with their distinct phenotypes are organized into tissues, commonly categorized as epithelial, muscle, vascular, nervous and connective tissue, and hematopoietic cells. Genetic changes leading to dysregulated signaling pathways with altered protein expression patterns cause a transformation from normal to the phenotypes and morphology that signifies cancer. Cancer is a heterogeneous disease associated with alterations in protein expression patterns leading to cell growth and 'anti-social behavior' of tumor cells. The deregulated expression patterns in tumor cells are caused by genetic and epigenetic alterations leading to distortion of multiple proteins and signaling pathways. Despite the complexity of cancer, microscopic evaluation of tissue morphology remains the gold standard for determining a cancer diagnosis in a clinical setting. Although morphology is crucial, adding a layer of information regarding protein expression on top of morphology appears to be beneficial for the stratification of different tumor types. Immunohistochemistry prevails as an invaluable method to provide such a tool for visualization of protein expression patterns in cells from a section of tumor tissue.

The Dictionary - a tool for biomarker studies

The dictionary contains three main parts: normal tissues, cancer tissues, and cells ( (Figure (Figure1).1). All images and examples include descriptive text boxes and supporting text with background information, to facilitate interpretation of the complex patterns underlying normal tissue histology, tumor pathology and cell biology. H & E-stained tissue sections have been scanned at 40 × magnification and both normal and cancer tissues are shown at three different levels of magnification.

Figure 1
Schematic showing the starting page for the dictionary. The three main parts, normal tissues, cancer tissues and cell structures, are displayed side by side with alphabetical lists below showing the contents of each part to facilitate navigation. All ...

Altogether 45 normal tissue types (represented by 173 images), 20 different cancer types (represented by 193 images) and 18 subcellular structures (represented by 103 images) are included in the dictionary. Examples of normal tissue show colon (Figure (Figure2A)2A) and breast (Figure (Figure2B)2B) at the three levels of magnification. For cancer, one case of low-grade (Figure (Figure2C)2C) and one case of high-grade (Figure (Figure2D)2D) ductal breast cancer is shown. IF and IHC images representing antibodies targeting proteins in the nucleoli and the mitochondria demonstrate the cell structure part (Figure (Figure3).3). In addition to high resolution images, there are summarizing descriptive text paragraphs to supplement the images.

Figure 2
Images showing examples of H & E-stained tissues, including descriptive text boxes, at three different magnifications. Normal tissue is exemplified by two tissue types. The top overview shows the major components of a normal human colon, followed ...
Figure 3
Examples of images demonstrating different organelles in cells. The upper panel shows IF (left and middle) and IHC (right) images representing the nucleoli, visualized by antibodies targeting proteins expressed in the nucleoli. The nucleoli are shown ...

As one of the main goals of this project is to identify novel biomarkers that can be developed for clinical use, the 20 types of human cancers illustrated in the dictionary have also been used for protein profiling in the Human Protein Atlas. Using the search function at the Human Protein Atlas portal [6], search strings can be created to identify candidates for cell or tumor type specific markers and also proteins differentially expressed within a given tumor type, thus representing potential prognostic indicators.

Clinical impact

Successful identification and translation of informative biomarkers to aid clinical decision-making is a prerequisite for the implementation of personalized cancer therapeutic regimens. The antibody-based proteomics strategy employed in the Human Protein Atlas plays a key role in the cancer biomarker discovery and validation pipeline, facilitating evaluation of candidate markers [7]. The newly launched dictionary provides a useful tool to interpret and evaluate biomarker candidates identified through various search strategies in the Human Protein Atlas. The appraisal of protein expression patterns in tumor tissue is a crucial step to select the most promising candidates for extended experiments, including clinical studies in larger cohorts, functional studies and in-depth validation of expression patterns.

The Human Protein Atlas has already been used in several clinical biomarker studies as a starting point for exploring both diagnostic and prognostic factors. Cell and tumor type specific protein expression, critical for developing diagnostic markers, is exceedingly rare [8], and only a few such markers exist for clinical use. As an example, the DNA-binding protein SATB2 was identified in the Human Protein Atlas as a potential novel diagnostic marker for colorectal cancer and in an extended study including more than 2,400 tumors, SATB2 was found to be both a sensitive and highly specific marker for colorectal cancer [9]. The basic protein profiling data available in the Human Protein Atlas has also allowed for several potential prognostic cancer biomarkers to be identified for different types of cancer. This is exemplified by the RNA-binding protein RBM3, discovered to be a prognostic marker for several different forms of cancer [10-12], and also a potential treatment predictive marker for platinum-based therapies [13]. Understanding of tumor tissue composition is also fundamental for studies regarding tumor stroma compartments. In a recent tumor biology study [14] using a mouse model, large numbers of granulin-expressing bone marrow-derived hematopoietic cells were found in the tumor stroma of breast cancers responding to instigating signals. This study also showed that the expression of granulin in human breast cancer was strongly correlated with the triple negative/basal-like breast tumor subtypes, and that breast cancer patients with tumors positive for granulin staining had a significantly worse outcome in terms of overall survival. The presented dictionary and Human Protein Atlas can also be utilized in other clinical research fields, exemplified by the identification of targets for in vivo imaging of pancreatic beta-cells cells in diabetes research [15,16].

Concluding remarks

Here we present a freely available cell and tissue dictionary as an amendment to the Human Protein Atlas (reviewed in [17]) that can be used to facilitate the interpretation of clinical tissue biomarkers. A multitude of high quality images with supporting short text paragraphs are displayed on the Human Protein Atlas web portal ( to provide a useful guide for researchers who are not familiar with the microscopical landscape that forms the home ground for histologists and pathologists. In this first version of the dictionary, H & E stained tissue sections are presented for visualization of normal tissue and cancer morphology. The essential background needed to interpret and understand expression data retrieved from tissues and cells is presented. The aim is to expand the contents of the dictionary to also include additional levels of information regarding protein expression so that various cell populations not distinguishable from morphology alone can be visualized. Established antibodies can be used for IHC on consecutive sections of selected tissues to demonstrate different cell types, for example, B-lymphocytes, T-lymphocytes and endothelial cells, and different cell states, for example, proliferation and differentiation. Moreover, updates with additional links and text paragraphs can be added as well as including more examples of both normal and diseased tissues. For educational purposes, show/hide functionality for text boxes could be further developed together with sets of relevant 'Questions & Answers'. We anticipate that a content rich and knowledge-based cell and tissue dictionary, combined with the comprehensive map of protein expression patterns in normal and cancer tissues available through the Human Protein Atlas, will provide a significant foundation for both basic and clinical research projects.


H & E: haematoxylin and eosin; IF: immunofluorescence; IHC: immunohistochemistry; TMA: tissue micro arrays.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

CK, MU and FP contributed to the conception and the supervision of the project. CK, JB, AA and FP wrote the manuscript. JB, MW, EL, SN and FP wrote the text corresponding to all normal, cancer and cell images. PO developed the database and website. All authors were involved in editing and the final approval of the paper.

Authors' information

CK: associate professor and site Director for the tissue protein profiling facility, JB: PhD student (tissue biomarkers), PO: IT developer, AA: post-doctoral (immunohistochemistry-based cell profiling), SN: senior pathologist (immunohistochemistry-based tissue profiling), MW: PhD student (immunofluorescence-based cell profiling), EL: associate professor and responsible for the subcellular profiling unit, MU: professor and program Director for the Human Protein Atlas and FP: professor, senior pathologist and clinical Director for the Human Protein Atlas.

Pre-publication history

The pre-publication history for this paper can be accessed here:


This work was supported by grants from the Knut and Alice Wallenberg Foundation. The pathologists and staff at the Pathology Clinic, Uppsala University Hospital, Uppsala, Sweden, are gratefully acknowledged for all efforts regarding the handling and diagnostics of the tissues used in the Human Protein Atlas. The authors wish to especially thank Dijana Djureinovic, Sofie Gustafsson, Elene Karlberg, Dijana Cerjan and John Juter for assistance with the dictionary work.


  • Uhlen M, Ponten F. Antibody-based proteomics for human tissue profiling. Mol Cell Proteomics. 2005;4:384–393. doi: 10.1074/mcp.R500009-MCP200. [PubMed] [Cross Ref]
  • Kampf C, Olsson I, Ryberg U, Sjostedt E, Ponten F. Production of tissue microarrays, immunohistochemistry staining and digitalization within the human protein atlas. J Vis Exp. 2012. p. e3620. DOI: 10.3791/3620. [PMC free article] [PubMed]
  • Barbe L, Lundberg E, Oksvold P, Stenius A, Lewin E, Bjorling E, Asplund A, Ponten F, Brismar H, Uhlen M, Andersson-Svahn H. Toward a confocal subcellular atlas of the human proteome. Mol Cell Proteomics. 2008;7:499–508. [PubMed]
  • Uhlen M, Bjorling E, Agaton C, Szigyarto CA, Amini B, Andersen E, Andersson AC, Angelidou P, Asplund A, Asplund C, Berglund L, Bergström K, Brumer H, Cerjan D, Ekström M, Elobeid A, Eriksson C, Fagerberg L, Falk R, Fall J, Forsberg M, Björklund MG, Gumbel K, Halimi A, Hallin I, Hamsten C, Hansson M, Hedhammar M, Hercules G. et al. A human protein atlas for normal and cancer tissues based on antibody proteomics. Mol Cell Proteomics. 2005;4:1920–1932. doi: 10.1074/mcp.M500279-MCP200. [PubMed] [Cross Ref]
  • Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S, Wernerus H, Björling L, Ponten F. Towards a knowledge-based Human Protein Atlas. Nat Biotechnol. 2010;28:1248–1250. doi: 10.1038/nbt1210-1248. [PubMed] [Cross Ref]
  • Bjorling E, Lindskog C, Oksvold P, Linne J, Kampf C, Hober S, Uhlen M, Ponten F. A web-based tool for in silico biomarker discovery based on tissue-specific protein profiles in normal and cancer tissues. Mol Cell Proteomics. 2007;7:825–44. [PubMed]
  • Brennan DJ, O'Connor DP, Rexhepaj E, Ponten F, Gallagher WM. Antibody-based proteomics: fast-tracking molecular diagnostics in oncology. Nat Rev Cancer. 2010;10:605–617. doi: 10.1038/nrc2902. [PubMed] [Cross Ref]
  • Ponten F, Gry M, Fagerberg L, Lundberg E, Asplund A, Berglund L, Oksvold P, Bjorling E, Hober S, Kampf C, Navani S, Nilsson P, Ottosson J, Persson A, Wernérus H, Wester K, Uhlén M. A global view of protein expression in human cells, tissues, and organs. Mol Syst Biol. 2009;5:337. [PMC free article] [PubMed]
  • Magnusson K, De Wit M, Brennan DJ, Johnson LB, McGee SF, Lundberg E, Naicker K, Klinger R, Kampf C, Asplund A, Wester K, Gry M, Bjartell A, Gallagher WM, Rexhepaj E, Kilpinen S, Kallioniemi OP, Belt E, Goos J, Meijer G, Birgisson H, Glimelius B, Borrebaeck CA, Navani S, Uhlén M, O'Connor DP, Jirström K, Pontén F. SATB2 in combination with cytokeratin 20 identifies over 95% of all colorectal carcinomas. Am J Surg Pathol. 2011;35:937–948. doi: 10.1097/PAS.0b013e31821c3dae. [PubMed] [Cross Ref]
  • Jogi A, Brennan DJ, Ryden L, Magnusson K, Ferno M, Stal O, Borgquist S, Uhlen M, Landberg G, Pahlman S, Pontén F, Jirström K. Nuclear expression of the RNA-binding protein RBM3 is associated with an improved clinical outcome in breast cancer. Mod Pathol. 2009;22:1564–1574. doi: 10.1038/modpathol.2009.124. [PubMed] [Cross Ref]
  • Hjelm B, Brennan DJ, Zendehrokh N, Eberhard J, Nodin B, Gaber A, Ponten F, Johannesson H, Smaragdi K, Frantz C, Hober S, Johnson LB, Påhlman S, Jirström K, Uhlen M. High nuclear RBM3 expression is associated with an improved prognosis in colorectal cancer. Proteomics Clin Appl. 2011;5:624–635. doi: 10.1002/prca.201100020. [PubMed] [Cross Ref]
  • Jonsson L, Bergman J, Nodin B, Manjer J, Ponten F, Uhlen M, Jirstrom K. Low RBM3 protein expression correlates with tumour progression and poor prognosis in malignant melanoma: an analysis of 215 cases from the Malmo Diet and Cancer Study. J Transl Med. 2011;9:114. doi: 10.1186/1479-5876-9-114. [PMC free article] [PubMed] [Cross Ref]
  • Ehlen A, Brennan DJ, Nodin B, O'Connor DP, Eberhard J, Alvarado-Kristensson M, Jeffrey IB, Manjer J, Brandstedt J, Uhlen M, Pontén F, Jirström K. Expression of the RNA-binding protein RBM3 is associated with a favourable prognosis and cisplatin sensitivity in epithelial ovarian cancer. J Transl Med. 2010;8:78. doi: 10.1186/1479-5876-8-78. [PMC free article] [PubMed] [Cross Ref]
  • Elkabets M, Gifford AM, Scheel C, Nilsson B, Reinhardt F, Bray MA, Carpenter AE, Jirstrom K, Magnusson K, Ebert BL, Pontén F, Weinberg RA, McAllister SS. Human tumors instigate granulin-expressing hematopoietic cells that promote malignancy by activating stromal fibroblasts in mice. J Clin Invest. 2011;121:784–799. doi: 10.1172/JCI43757. [PMC free article] [PubMed] [Cross Ref]
  • Lindskog C, Asplund A, Engkvist M, Uhlen M, Korsgren O, Ponten F. Antibody-based proteomics for discovery and exploration of proteins expressed in pancreatic islets. Discov Med. 2010;9:565–578. [PubMed]
  • Lindskog C, Korsgren O, Ponten F, Eriksson JW, Johansson L, Danielsson A. Novel pancreatic beta cell-specific proteins: antibody-based proteomics for identification of new biomarker candidates. J Proteomics. 2012;75:2611–2620. doi: 10.1016/j.jprot.2012.03.008. [PubMed] [Cross Ref]
  • Ponten F, Schwenk JM, Asplund A, Edqvist PH. The Human Protein Atlas as a proteomic resource for biomarker discovery. J Intern Med. 2011;270:428–446. doi: 10.1111/j.1365-2796.2011.02427.x. [PubMed] [Cross Ref]

Articles from BMC Medicine are provided here courtesy of BioMed Central