Search tips
Search criteria 


Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Cancer. Author manuscript; available in PMC 2009 October 1.
Published in final edited form as:
PMCID: PMC2745185

A multi-disciplinary approach to honest broker services for tissue banks and clinical data: a pragmatic and practical model



Honest broker services are essential for tissue- and data-based research. The honest broker provides a firewall between clinical and research activities. Clinical information is stripped of Health Insurance Portability and Accountability Act-denoted personal health identifiers. Research material may have linkage codes, precluding the identification of patients to researchers. The honest broker provides data derived from clinical and research sources. These data are for research use only, and there are rules in place that prohibit reidentification. Very rarely, the institutional review board (IRB) may allow recontact and develop a recontact plan with the honest broker. Certain databases are structured to serve a clinical and research function and incorporate ‘real-time’ updating of information. This complex process needs resolution of a variety of issues regarding the precise role of the HB and their interaction with data. There also is an obvious need for software solutions to make the task of deidentification easier.


The University of Pittsburgh has implemented a novel, IRB-approved mechanism to address honest broker functions to meet the specimen and data needs of researchers. The Tissue Bank stores biologic specimens. The Cancer Registry culls data and annotating information as part of state- and federal-mandated functions and collects data on the clinical progression, treatment, and outcomes of cancer patients. The Cancer Registry also has additional IRB approval to collect data elements only for research purposes. The Clinical Outcomes Group is involved in patient safety and health services research. Radiation Oncology and Medical Oncology provide critical treatment related information. Pathology and Oncology Informatics have designed software tools for querying availability of specimens, extracting data, and deidentifying specimens and annotating data for clinical and translational research. These entities partnered and submitted a joint IRB proposal to create an institutional honest broker facility. The employees of this conglomerate have honest broker agreements with the University of Pittsburgh and the Medical Center. This provides a large group of honest brokers, ensuring availability for projects without any conflict of interest.


The honest broker system has been an IRB-approved institutional entity at the University of Pittsburgh since 2003. The honest broker system currently includes 33 certified honest brokers encompassing the multiple partners of this system. The honest broker system has handled >1600 requests over the past 4 years with a 25% increase in volume each year.


The current results indicate that the collaborative honest broker model described herein is robust and provides a highly functional solution to the specimen and data needs for critical clinical and translational research activities.

Keywords: honest broker, biologic specimens, data annotation, Institutional Review Board, tissue bank, translational research, Health Insurance Portability, Accountability Act of 1996

The last decade has seen significant advances in molecular biology (genomics and proteomics) and translational research. These new initiatives have resulted in a growing demand for specific, highly annotated human tissues and other biological specimens [13]. This growing demand has reinforced the importance of tissue banks as a major part of the necessary infrastructure of any institution/ research initiative seeking to address biologically and clinically relevant issues [46]. The NCI has an office dedicated to biospecimens [7] that has provided a document detailing best practices for repositories [8]. In addition, the International Society for Biologic and Environmental Repositories has also put together a “best practices” document incorporating suggestions from its members [9]. Similar initiatives have been undertaken in the US [10] and in the UK [11]. Finally there are variations of laws from state to state regarding the use of tissues and annotating data [12].

In addition, many of these research initiatives require extensive annotating information that is not present in one data source.

Two areas particularly need annotating information. These are tissue-based research and health services based research, which requires patient information for research assessment. Health services research includes outcomes focused research, assessing impact of different therapeutic regimens, research focused on quality and safety of health care, research to evaluate quality assurance, quality control and errors, as well as research focused on impact of different information system on overall quality of health delivery, patient education and error reduction.

The past few years have also seen a significant structured movement to protect the confidentiality of research study participants. Although the concept of an honest broker has been around for more than a decade, the advent of the Health and Insurance Portability and Accountability Act (HIPAA) [13] further emphasized the need for systems/ mechanisms to identify and remove personal health identifiers (PHI) from research information. It should be noted that HIPAA does not address research information except that it may become PHI if it is identified and in a covered entity. There are facility/institution-specific regulations mandating policies on patient confidentiality. The Institutional Review Board (IRB) also provides input and direction regarding policies and procedures impacting access to patient information. Finally institutions are also cognizant of prevailing views regarding legal and ethical issues. It is important to develop protocols to protect patient identifiers and confidentiality in the current environment.

The need for data as well as the need for subject confidentiality protection resulted in a log jam blocking data aggregation and disbursement. This conflict exposed the lack of preparedness of major institutions to collect, collate and disburse data elements needed for projects while maintaining patient privacy. The result was that access to well documented tissue specimens, using normalized descriptors, became an important impediment to the progress of research projects [14].

The Cancer Registry and the Health Sciences Tissue Bank engaged in discussions to evaluate mechanisms for addressing this issue. The major players in the research field were identified. The Health Sciences Tissue Bank, the Pathology Laboratory Information System, the Cancer registry, the Clinical Research Informatics Service, the Clinical Outcomes group, Radiation and Medical Oncology, Pathology and Oncology Informatics and the “Electronic Medical Record” team were considered key players. This list might not encompass every possible entity that could play a role; nonetheless it captures the major players involved in the aggregation and provision of specimens and data. Policies and procedures were established to serve as guiding principles.


The requests for biological specimens and data for research purposes have increased significantly over the years. Data requests have become increasingly complex. This increased complexity is partly related to outcomes related initiatives to evaluate biomarkers and their role in guiding therapy or predicting outcome. In addition, awareness of confidentiality issues has increased significantly since the implementation of HIPAA.

The primary request for research projects consist primarily of:

  1. Tissue and biological specimens only.
  2. Clinical (phenotype) data, most frequently pathology data.
  3. Outcomes information including treatment, progression and vital status

We evaluated tissue and data requests at the University of Pittsburgh and found that 20% of research projects needed biological specimens only, 10% of research projects needed outcomes information, while the remaining 70% needed more fully annotated tissues requiring phenotypic (clinical) data. The annotation varied from easily accessible (e.g. pathology data) to complex (pre-therapy and post-therapy information). This breakdown of research requests is shown in figure 1. This suggested the need to design a system that could provide research biological specimens annotated with patient data while protecting the confidentiality of patient information, while fully meeting the requirements of federal regulations [15]. This required implementation of a system that is HIPAA compliant and provides human subjects protection. The resulting system for this process was based on the Honest Broker Concept.

Fig. 1
Breakdown of research requests received and their associated annotation requirements.

In many instances, the collection of information on clinical progression, treatment and outcomes of cancer patients may fall under human subjects’ research and therefore require specific IRB review and approval. The overall attempt is to have “informed consent” from all the patients for research use of their biological materials. In addition, it is also the attempt of the various registries in the institute to obtain “informed consent” from all patients for the research use of their data.

Human Subjects Protection – The Honest Broker Concept

The tissue/databank ensures protection of patient identity through "The Honest Broker Concept." The honest broker is an individual/organization/system which acts on, or on the behalf of, the tissue/databank. The role of the honest broker is to collect and provide health information to research investigators in such a manner whereby it would not be reasonably possible for the investigators, or other individuals, to identify the subjects directly or indirectly. The “honest broker” or “tissue/data bank trustee” acts as a well defined barrier between the clinical environment (in which fully identified confidential patient information is routinely exchanged as part of medical care) and the general research community (in which all information must be completely de-identified). The honest broker also ensures that research data, which is generally not clinically validated, is not used for clinical care [16].

In our rendition, the honest broker is not part of either the clinical or research team. The honest broker is dedicated to providing “honest broker” services only to a particular project and is not part of either the data collection team or the research team. This is to avoid any potential conflict of interest. It needs to be emphasized that these roles change from project to project for a particular individual. However the end result is that the dedicated honest broker for that project is not part of either the research team or the data/biological specimen aggregation team.

This is important to ensure confidentiality and honest research. The honest broker is the only entity that can link research identifiers and clinical identifiers. This transfers control and responsibility of the de-identification process to an independent third party, the honest broker, thereby reducing the risk of conflict of interest. Personal and clinical identifiers (names, addresses, medical record numbers etc.) are limited to the clinical space. The research identifiers (i.e. “subject 12432”) cannot be traced back to the personal or clinical identifies except through the honest broker’s linkage codes. This concept differs from anonymization. Anonymization is a one-way process in which the linkage between personal identifiers and research identifiers is removed. Anonymization precludes any subsequent updating of data. The process of data annotation with the particular specimen stops when anonymization is performed. The process of having the honest broker assign linkage codes (re-identification codes) allows information to be updated at anytime in the future. The honest broker can identify the patient by means of the linkage code, access information related to this patient from the clinical domain, and provide updated information to the researchers in a de-identified fashion, using the original linkage code. The link between codes must be retained and protected by the honest broker. Subsequent requests to update information on research protocol participants (research cohort) must be conducted through the honest broker. The honest broker system is therefore an upgrade to the process of anonymization. Anonymization essentially provides information up tot the time of accrual, whereas the honest broker concept allows information to be updated in a manner that is consistent with current legal and ethical protocols.

Discussions involving the Cancer Registry and the Health Sciences Tissue Bank identified the major sources of tissue and biological specimens and annotating data for research use. The privacy rule of the HIPAA of 1996 permits access to protected health information without patient authorization in a limited number of situations [13]. One frequent situation is where the protected health information is being used in a de-identified fashion. The honest broker plays a prominent role in this scenario, since neither the federal policy nor HIPAA regulations require prior written consent or authorization of patients when using existing health information in a de-identified fashion. The honest broker can be a part of the facility providing the data. In addition the honest broker can be a business associate of the facility [17]. The supplemental files attached include the University of Pittsburgh template business associate agreement. This approach allowed us to expand the circle of participating facilities. We decided to include division/departments involved in data aggregation as well as facilities that were creating and implementing software solutions and tools for these groups as participants for this initiative. The software groups included Pathology and Oncology Informatics and the Electronic Medical Records team. This list may not include every possible entity that could play a role; nonetheless it does capture the major players involved in aggregation and provision of specimens and data, and designing software tools for these efforts.

The facilities currently part of the “Honest broker facility” and their role in this initiative is described below.

Participating facilities

1. The Health Sciences Tissue Bank

The Health Sciences Tissue Bank is the main institutional infrastructure for collecting tissue and other biological materials for research. These research specimens are stored in a de-identified fashion, annotated with linkage codes, because of confidentiality issues. However the linkage codes allow access to specific information regarding the donor. This is important since many research projects require not only tissue and biological specimens but also additional data regarding family history, treatment history, and outcomes.

2. The Pathology Laboratory Information System

This is the clinical system used for reporting pathology information. This repository contains extensive information regarding clinical evaluation of tissue and other biological specimens. This information is extremely useful to provide a better understanding of the composition of the research specimen. The system stores clinically reported information pertaining to tissue specimens (biopsy and resection reports), cytology specimens (exfoliated as well as aspirate specimens), and other biologic specimens (blood/blood products/urine/other biological specimens).

3. The Cancer Registry

The Registry performs the state-mandated function of collecting information on cancer patients. The information collected pertains to both diagnostic details as well as follow up information. The data collected by the Registry consists of a set of defined data elements that are part of a standardized set of common data elements. We have further modified this approach by adding additional data elements, of primarily research value, as part of a separate IRB approved initiative.

4. The Clinical Outcomes group

This institutional entity collects and provides information pertaining to ongoing clinical trials, health services research and patient safety research.

5. Radiation and Medical Oncology

Radiation and medical oncology are important caregivers for oncologic diseases. The clinical database of these two entities provides critical information regarding therapeutic intervention and responses to those specific therapies. Information accrued from Radiation and Medical Oncology is therefore critical in providing insight regarding patient response to therapeutic protocols.

6. Pathology and Oncology Informatics

This growth is responsible for designing and maintaining the informatics infrastructure for collection, storage and disbursement of annotating information. It is important to affiliate this group with the honest broker infrastructure development since Pathology and Oncology Informatics designs, tests and maintains the tools needed for the other components of the honest broker system. Some of these include software packages needed for Inventory Management by the Health Sciences Tissue Bank, data aggregation software packages for the Cancer Registry and clinical outcomes group, clinical information and research information recording mechanisms for Medical and Radiation Oncology, and de-identification software packages needed by many participating facilities (Health Sciences Tissue Bank, Cancer Registry, the Electronic Medical Record team and others). NOTE: Our Pathology and Oncology Informatics groups were recently merged into the new Department of Biomedical Informatics as of June 2006 (

7. The University of Pittsburgh Health Systems Information Services Division

Most clinical data is captured in an electronic form in various hospital information systems. This includes patient history, details of surgical and radiological procedures, therapeutic interventions and follow-up information. The clinical component of the electronic medical records consists of information in an identified form. However the transfer of this information into the research domain requires de-identification of this information. The electronic medical record team therefore serves as a gatekeeper for this information and oversees implementation of appropriate de-identification protocols prior to the incorporation of this data into research databases. The electronic medical record team also plays a critical role in performing queries for specific research requests. This activity helps identify appropriate patient populations for research projects. These identified patient lists then need to undergo de-identification.

In this concept at least one individual is acting as an honest broker at each of the facilities listed above. For clinical and translational research studies in oncology, the Cancer registrars are extremely valuable since their federal mandate and the job specifications allow them ready access to clinical information on cancer patients. In addition, they are not involved in specimen banking or research and thus do not have access to the data annotating tissue bank samples or the results of the research studies. The inclusion of the cancer registry into an honest broker system facilitates data accrual from this purely clinical data entity which maintains updated information on all oncology patients. This updating is done every six months and is part of the state-mandated function of the cancer registry.

The “Institutional Honest Broker” system ensures that the honest broker ("trustee") is the only person who can link a patient with the tissue bank number that identifies that patient. The Institutional Honest Broker system also provides a process via which new clinical outcome information can be added to a file identified only by a code number, rather than a name. This creates a fail-safe mechanism for communicating with patients in the extremely rare event of an IRB directed dissemination of important research data to the patient or their survivors.

It was decided to incorporate the above named groups, involved in tissue and data aggregation with possible research application, into an Institutional Honest Broker system.

The University of Pittsburgh Academic Health Center consists of two closely interacting, but legally separate, entities. These are the University of Pittsburgh, which oversees primarily the research activities, and the University of Pittsburgh Medical Center (UPMC), which oversees clinical activity and in which the clinical data resides. Potential legal/ethical issues pertaining to the creation of this system were discussed with the Institutional Review Board (IRB) of the University of Pittsburgh as well the legal team of the UPMC. A formal IRB application for this “Honest Broker Facility” incorporating the comments and suggestions of the IRB and the legal team of the University of Pittsburgh Medical Center Health Systems was approved by the IRB and formally went into effect in May 8, 2003.

The employees of the Honest Broker Facility have honest broker agreements with the University of Pittsburgh and the University of Pittsburgh Health Systems. This Honest Broker Facility encompasses several separate departments and divisions. Each of these entities has contributed by providing personnel into the honest broker pool. This arrangement has provided a large task force for honest broker activities, which is important since an honest broker should not be involved with the research requiring honest broker services. This approach ensures lack of conflict for the individual engaged in honest broker activities, thereby creating an appropriate work environment.

Honest Broker Process

The honest broker certification process requires completion of IRB mandated education modules. These modules are Research Integrity, Human Subjects Research in Biomedical Sciences, and HIPAA Researchers Privacy Requirements. The education modules can be completed via the Web at the University of Pittsburgh IRB web site ( A certificate of completion is generated once each module has been completed. In addition the honest broker also has to enter into a business associate agreement (17). An individual can become a certified honest broker, once these administrative requirements have been completed.

The honest broker facility provides an update to the IRB every six months. The update is in opportunity to add/delete honest brokers. The Institutional Honest Broker system at the University of Pittsburgh has assigned overall administrative responsibility for the honest broker service to the Manager of the Cancer Registry. However this oversight can be provided by the leaders of any of the participating entities.

The Pathology and Oncology Informatics division has designed a Data Request Tracking Tool for the honest broker system. This tool is located on a password protected website. The description of the process, and interaction with affiliated entities, is described on an accessible website ( This tool provides the interface for entering descriptive detail information pertaining to a research project requiring honest broker services. This tracking tool is password protected and is located within the firewall of the University of Pittsburgh. After logging into the system, a menu of options is available to the honest broker. This is shown in figure 2. The honest broker handling a particular request enters all the information about the research project into the database using the initial data-entry screen of this tool. The initial data-entry screen captures information pertaining to the investigator, the nature of the request, as well as important workflow issues like requested turnaround time, IRB status and approval number. In addition this screen also captures information pertaining to billing, in case the services provided will be compensated through an institutional account, rather than grant funded mechanisms. This tool has a built-in query capability. The honest broker designates the fields required for the data sources, the disease category, method of output for tissue/ biological specimens and data, the method of distribution and the purpose of the request. A screen capture of this aspect of the tool is shown in figure 3. The honest broker alerts their supervisor once all project information has been entered into the tracking tool. The supervisor reviews project details and provides input and approval. This tracking tool is used to follow a research tissue/data request from start to finish. This provides information regarding turnaround time as well as time spent on a project. All of this information is summarized and available in the final "complete request" snapshot of the tool. This is shown in figure 4.

Fig. 2
A screen capture of the login screen for the Honest Broker tool. It shows the multiple processes involved in serving a request by the Honest Broker.
Fig. 3
A screen capture of the query tool showing the multiple types of data and biological specimen requests that it can handle.
Fig. 4
A screen capture showing the “completed request” summary generated by the Honest Broker tool.

De-identification protocols

The de-identification of patient samples and data is performed using a variety of tools. The Pathology Lab Information System, CoPath, has limited de-identification capabilities. The electronic medical record system also has de-identification software systems. The honest broker system can be utilized for de-identifying specimens/data, with the honest broker retaining codes for the specimen/data provided. The Clinical Research Informatics Service ( in the Department of Biomedical Informatics has created a HIPAA compliant de-identification engine. Electronic mechanisms for addressing honest broker issues are described in literature (18, 19, 20).

This de-identification engine has been certified by the IRB of the University of Pittsburgh as well as by the University of Pittsburgh medical Center security office for generating de-identified output from a variety of free text medical reports. This engine identifies all HIPAA mandated PHI, e.g. names and replaces them with a de-identified tag and replacement letters. If the same person is encountered in multiple places in the same report, the same replacement letters are used for every occurrence. Similarly dates are replaced by an offset which allows intervals among aggregated reports to still allow for interval determination. An example of a de-identified report generated by this engine is shown in figure 5. The system generates a linkage file for each patient. This file is stored on a secure server. A diagrammatic representation of this process is shown in figure 6.

Fig. 5
An example of a de-identified report created by the “De-identification” engine.
Fig. 6
Pictorial representation of process of data request, structure of information generation, retrieval, storage and encryption.

Data sources

The collaborative honest broker service utilizes multiple sources of data. These include clinical applications (Pathology Laboratory Information Services, Radiation Oncology Systems, Outpatient Systems and Hospital Information Systems), Clinical Trials related applications, Cancer Registry applications, and Tissue Banking Inventory and Information Systems. In addition paper-based records in physician offices and legacy records in the hospital may be used. These multiple data sources are listed in Table 1.

Oversight of the honest broker system

Sharon Winters, the director of the cancer registry, serves as the overall manager of the honest broker facility. She is primarily responsible for maintaining oversight regarding administrative and regulatory issues. She is assisted in this role by the lead supervisors of the participating facilities. This includes the manager of the Tissue Bank, the manager of the Quality Assurance facility, the manager of Pathology and Oncology Informatics, as well as the data managers for Medical Oncology and Radiation Oncology.

Oversight for tissue and biological specimen disbursement

is provided by a number of organ-specific Tissue Utilization Committees (TUCs). These utilization committees are within the University of Pittsburgh’s translational and clinical programs. The University of Pittsburgh has functioning Tissue Utilization committees in the following organ sites: Lung, Head & Neck, GU, GI, Women’s Health, Melanoma, Liver and Transplant, and non-neoplastic lung diseases. The committee provides representation to the different groups involved in decision making and research specimen usage for that particular organ type (Surgery/ Oncology/ Pathology/ Researchers). Each of these committees makes binding recommendations to the personnel of the Tissue Bank for the priorities for distribution of tissue and biological materials. There is an Institutional Oversight Committee that oversees the different organ-specific TUC committees. This Institutional Oversight Committee serves as a final arbitrator in case of conflicts that are not resolved in the organ specific TUC. The oversight committee consists of clinical and research leaders at the University of Pittsburgh. The oversight committee also serves the role of an internal scientific advisory board.

Mechanisms for prioritization of biological specimens

The prioritization protocol is consistent with institutional policies. However there are variations from organ system to organ system, depending on the different projects being taken care of. The criteria for prioritization are:

  1. SPORE projects.
  2. Exploratory pilot projects directed at SPORE project development.
  3. Projects funded by federally funded peer-reviewed agencies.
  4. Projects funded by non-federal agencies.
  5. Projects funded by industry.

The tissue utilization committees do have the authority to make exceptions, with the approval of the oversight committee.


The honest broker facility received IRB approval in May, 2003 ( A period of four months was needed for training personnel and accomplishing paperwork for certification of honest brokers. The existence of the system was announced to the staff and faculty of the University of Pittsburgh in October 2003.

The initial response to the facility was initially slow and the last three months of 2003 generated only six requests for the honest broker facility. The volume of research requests increased significantly in 2004. The calendar year 2004 generated 148 requests. The calendar year 2005 generated 449 requests. The calendar year 2006 generated 548 requests. The first 11 months of calendar year 2007 have generated 621 requests. The volume of requests is shown in figure 7.

Fig. 7
Quarterly volume of cases handled by the Honest broker facility from 2004-Nov. 2007.

The requests for the honest broker facility have come from all major oncology areas. The honest broker facility has handled requests from all the major organ type groups. These include the pulmonary group, the head and neck group, the gastrointestinal diseases group, the genitourinary and prostate group, the hematology group, the skin and melanoma group and the gynecology diseases group, including the breast group. The volume of requests from the different organ types since the inception of the Honest broker facility is shown in Fig. 8. It should be noted the Breast and Gynecologic Oncology Group has started using the facility starting January 2006.

Fig. 8
Details of case volume handled by the Honest broker facility, delineated by organ type.

The honest broker facility has received work requests for a variety of different tasks. These include preparatory for research, research projects, presentations and abstracts, quality and process improvement, assessment of incidence of disease, marketing of clinical program, as well as for patient safety initiatives, clinical quality control and quality improvement. We evaluated these requests to assess distribution by organ type. The detailed breakup of these requests is shown in figure 9.

Fig. 9
Details of the many different reasons for requesting Honest broker services by researchers focused on the various organ systems.

How does an investigator use the honest broker facility?

A researcher can approach any of the constituents of the honest broker facility with a research request. The research request can be for tissue, biological specimen, or clinical data. This specific component of the honest broker facility approached by the investigator evaluates the research requests and identifies the different components of the honest broker facility that would play a role in fulfilling the request. One of the constituent facilities is designated as the primary handler of the requests. This facility interacts with the other components involved in the request. This primary facility communicates with the researcher, ensures that all the requested tissue/biological specimens have been retrieved and collates the data. The entire set of tissue and biological specimens and annotating data is de-identified and then provided to the investigator.

Use by repositories other than the University of Pittsburgh

This honest broker system has been used by entities other than the University of Pittsburgh. A similar model has been applied by the Cooperative Prostate Cancer Tissue Resource [3, 21, 22] as well as the Pennsylvania Cancer Alliance Bioinformatics Consortium. In addition, similar protocols were adopted for case retrieval for the Shared Pathology Informatics Network (SPIN) validation studies [23].


The honest broker facility is now a well established mechanism for de-identified tissue and data disbursement. This facility has become very popular in a short period of time. This is borne by the incremental increase in the use of the facility over the last four years. The popularity of the honest broker facility has started creating logistical issues, especially pertaining to staffing and turnaround issues.

There are certain aspects of the honest broker facility that need to be considered when creating a facility similar to the one at the University of Pittsburgh.

Training of honest brokers

This is an important aspect of maintaining uniform functionality of the honest broker facility. The facility has seen a significant increase in honest brokers on the last four years. The honest broker facility started with five honest brokers. The facility now has 33 honest brokers. The initial aspect of training focuses on explaining the compliance guidelines and objectives of the honest broker facility, discussing the philosophy of existence of the facility and completion of the IRB mandated research models. These steps provide the new honest broker is with conceptual details of the honest broker facility. The honest brokers are then trained on the software available for extracting data. This includes the honest broker tracking tool as well as mechanisms for de-identification.

Specialization of cancer registrars

Another important parallel initiative has focused on creating a pool of specialized cancer registrars. These cancer registrars work in a specific organ system of cancer program. They are involved in collecting information on patients with a specific cancer. The information collected consists of the state-mandated reporting requirements from the Cancer Registry. In addition, these "specialized" cancer registrars collect additional data elements for research purposes that have been approved by the IRB of the University of Pittsburgh. These cancer registrars also frequently approach the clinical caregivers to resolve data discrepancies among different sources.

These cancer registrars therefore focus in on a particular organ system of cancer program. Their work could be considered representative; however this specialized approach serves to increase their knowledge base and awareness of issues related to a particular subset of tissues and tumor types. These registrars perform data entry for the state-mandated clinical function of the Cancer Registry. In addition they handle specific requests for their area of concentration. This ensures a higher quality of data entry and retrieval.

In addition to increasing the clinical and translational research skills of the specialized cancer registrars, they become experts in a variety of clinical information systems from which they extract phenotypic data. They also develop a variety of informatics skills in the areas of data processing, data de-identification and the use data warehouses. They have particularly developed skills in data mining tools (both commercial as well as developing their own customized algorithms for clinical and translational research).

Increased availability of Tissue/Data to investigators

Numerous annotated tissue repositories already exist in this institutions and its affiliated cancer center. These include frozen as well as paraffin embedded tissue materials and other biological materials. The overall goal is to make them available to a wider research community, in a manner that is efficient, rapid, and compliant with legal and ethical concerns. There is significant awareness locally about the benefits of expanding utilization of our resources in collaborative projects. The creation of an institutional tissue resource as well as an institutional honest broker facility has served to accelerate access to tissue, biological materials and annotating data. Furthermore, many tissue bank-focused projects do not take into account the vast resources of paraffin archives, housed in many academic pathology departments [2, 15], that are available for use. This initiative will serve to bring down barriers at the institutional level and provide access to all forms of biological materials and data.

The structure and design of the University of Pittsburgh Honest Broker system has been presented at a meeting of the International Society for Biological and Environmental Repositories. It has also been shared with many collaborating academic institutions.


Initial provision of adequate resources is required to ensure the success of this institutional facility. There has been upfront investment by the institution in terms of personnel. The honest broker facility also has been incorporated in grant submissions to provide committed funding for these activities. The honest broker facility is consulted by the principal investigator submitting the grand proposal. The broad outline of the project is discussed. An estimate is made of the amount of time needed to fulfill projected needs of the project. The principal investigator then incorporates the anticipated personnel requirements in the budget of the proposal. In addition this facility also functions on a fee-for-service basis. The fee-for-service mechanism applies to work done on non-grant funded initiatives. The fee-for-service is based on an hourly rate for providing honest broker/de-identification services, data accrual, creation of database, and chart review. These different monetary mechanisms have helped provide resources for the facility to survive and grow.


The creation of an institutional honest broker facility has created a robust mechanism for data accrual and disbursement. In addition it has led to the development of a significant informatics infrastructure to support this facility's functions. This has decreased turnaround time for providing data associated with samples provided to investigators. It is hoped that this system will promote more robust, efficient and clinically and biologically relevant studies of biomarkers. Studies resulting from the creation of this facility may allow for better classification of cancer types, more accurate assessment of disease prognosis, a better ability to identify the most appropriate individuals for clinical trial participation, and better surrogate markers of disease progression and/or response to therapy. In addition, the biomedical informatics infrastructure and the honest broker tools created to serve the honest broker facility will be made available for use by outside institutions. It is hoped that this approach focused on sharing our experience and software tools will benefit research on a more global scale.

Table 1
A list of some critical data sources utilized by the Honest Broker Facility. These data sources are part of affiliated facilities and are accessible to the Honest Broker Facility.


Funded by:

  • Cancer Center
  • National Cancer Institute (NCI)
  • Cooperative Prostate Cancer Tissue Resource


1. Becich MJ. The role of the pathologist as tissue refiner and data miner: the impact of functional genomics on the modern pathology laboratory and the critical roles of pathology informatics and bioinformatics. Mol Diagn. 2000;5(4):287–299. [PubMed]
2. Eiseman E. Rand Corporation. Case studies of existing human tissue repositories: "best practices" for a biospecimen resource for the genomic and proteomic era. Santa Monica, CA: RAND; 2003. (
3. Patel AA, Gilbertson JR, Parwani AV, Dhir R, Datta MW, Gupta R, Berman JJ, Melamed J, Kajdacsy-Balla A, Orenstein J, Becich MJ. Cooperative Prostate Cancer Tissue Resource An informatics model for tissue banks--lessons learned from the Cooperative Prostate Cancer Tissue Resource. BMC Cancer. 2006 May 5;6:120. [PMC free article] [PubMed]
4. Thasler WE, Schlott T, Kalkuhl A, Plan T, Irrgang B, Jauch KW, Weiss TS. Human tissue for in vitro research as an alternative to animal experiments: a charitable "honest broker" model to fulfil ethical and legal regulations and to protect research participants. Altern Lab Anim. 2006 Aug;34(4):387–392. [PubMed]
5. Qualman SJ, Bowen J, Brewer-Swartz S, France M. The Role of Tumor Banking and Related Informatics. In: Ladanyi M, Gerald WL, Totowa NJ, editors. Expression profiling of human tumors: diagnostic and research applications. Vol. 7. Humana Press; 2003. pp. 103–117.
6. Naber SP, Smith LL, Jr, Wolfe HJ. Role of the frozen tissue bank in molecular pathology. Diagn Mol Pathol. 1992;1(1):73–79. [PubMed]
7. Office of Biorepositories and Biospecimen research.
8. NCI Best Practices for Biorepositories.
9. ISBER: Best Practices for Repositories I: Collection, Storage, and Retrieval of Human Biological Material for Research by International Society for Biological and Environmental Repositories (ISBER) Cell Preservation Technology. 2005;3(1):5–48.
10. Eiseman E. Rand Corporation. Case studies of existing human tissue repositories: "best practices" for a biospecimen resource for the genomic and proteomic era. Santa Monica, CA: RAND; 2003. p. 208. xxxviii []
11. National Translational Cancer Research Network (NTRAC)
12. Hakimian R, Taube S, Bledsoe M, Aamondt R. 50-State Survey of Laws Regulating the Collection, Storage and Use of Human Tissue Specimens and Associated Data for Research. U.S. Department of Health and Human Services, National Institutes of Health, National Cancer Institute; 2004. Nov, publication no.05.5628.
13. Health Insurance Portability and Accountability Act of 1996. [] [PubMed]
14. Gilbertson JR, Gupta R, Nie Y, Patel AA, Becich MJ. Automated clinical annotation of tissue bank specimens. Medinfo. 2004;11(Pt 1):607–610. [PubMed]
15. Department of Health and Human Services. 45 CFR (Code of Federal Regulations), 164.514(6)(2)(i). Standards for Privacy of Individually Identifiable Health Information (final) []
16. Mertz JF, Sankar P, Taube SE, LiVolsi V. Use of Human Tissue in Research: Clarifying Clinician and Researcher Roles and Information Flow. J Invest. Med. 1997;45:252–257. [PubMed]
18. Dennis RA, Wang J, Huang K, Helsley A, Robinson AG. The Central Codebook (CCB) at the David Geffen School of Medicine at UCLA: an open source electronic honest broker for managing subject identifiers and sensitive external identifiers, like medical record numbers; AMIA Annu Symp Proc; 2006. p. 908. [PMC free article] [PubMed]
19. Boyd AD, Hosner C, Hunscher DA, Athey BD, Clauw DJ, Green LA. An 'Honest Broker' mechanism to maintain privacy for patient care and academic medical research. Int J Med Inform. 2007 May–Jun;76(5–6):407–411. Epub 2006 Nov 1. [PubMed]
20. Boyd AD, Hunscher DA, Kramer AJ, Hosner C, Saxman P, Athey BD, Greden JF, Clauw DC. The "Honest Broker" method of integrating interdisciplinary research data; AMIA Annu Symp Proc; 2005. p. 902. [PMC free article] [PubMed]
21. Patel AA, Kajdacsy-Balla A, Berman JJ, Bosland M, Datta MW, Dhir R, Gilbertsonx J, Melamed J, Orenstein J, Tai KF, et al. The development of common data elements for a multi-institute prostate cancer tissue bank: the Cooperative Prostate Cancer Tissue Resource (CPCTR) experience. BMC Cancer. 2005;5:108. [PMC free article] [PubMed]
22. Melamed J, Datta MW, Becich MJ, Orenstein JM, Dhir R, Silver S, Fidelia-Lambert M, Kadjacsy-Balla A, Macias V, Patel A, et al. The cooperative prostate cancer tissue resource: a specimen and data resource for cancer researchers. Clin Cancer Res. 2004;10(14):4614–4621. [PubMed]
23. Patel AA, Gupta D, Seligson D, Hattab EM, Balis UJ, Ulbright TM, Kohane IS, Berman JJ, Gilbertson JR, Dry S, Schirripa O, Yu H, Becich MJ, Parwani AV. Shared Pathology Informatics Network. Availability and quality of paraffin blocks identified in pathology archives: a multi-institutional study by the Shared Pathology Informatics Network (SPIN) BMC Cancer. 2007 Feb 28;7:37. [PMC free article] [PubMed]