Health care professionals worldwide attend courses and workshops to learn evidence-based medicine (EBM), but evidence regarding the impact of these educational interventions is conflicting and of low methodologic quality and lacks generalizability. Furthermore, little is known about determinants of success. We sought to measure the effect of EBM short courses and workshops on knowledge and to identify course and learner characteristics associated with knowledge acquisition.
Health care professionals with varying expertise in EBM participated in an international, multicentre before–after study. The intervention consisted of short courses and workshops on EBM offered in diverse settings, formats and intensities. The primary outcome measure was the score on the Berlin Questionnaire, a validated instrument measuring EBM knowledge that the participants completed before and after the course.
A total of 15 centres participated in the study and 420 learners from North America and Europe completed the study. The baseline score across courses was 7.49 points (range 3.97–10.42 points) out of a possible 15 points. The average increase in score was 1.40 points (95% confidence interval 0.48–2.31 points), which corresponded with an effect size of 0.44 standard deviation units. Greater improvement in scores was associated (in order of greatest to least magnitude) with active participation required of the learners, a separate statistics session, fewer topics, less teaching time, fewer learners per tutor, larger overall course size and smaller group size. Clinicians and learners involved in medical publishing improved their score more than other types of learners; administrators and public health professionals improved their score less. Learners who perceived themselves to have an advanced knowledge of EBM and had prior experience as an EBM tutor also showed greater improvement than those who did not.
EBM course organizers who wish to optimize knowledge gain should require learners to actively participate in the course and should consider focusing on a small number of topics, giving particular attention to statistical concepts.
Prevalence of allergic diseases in infants, whose parents and siblings do not have allergy, is approximately 10% and reaches 20–30% in those with an allergic first-degree relative. Intestinal microbiota may modulate immunologic and inflammatory systemic responses and, thus, influence development of sensitization and allergy. Probiotics have been reported to modulate immune responses and their supplementation has been proposed as a preventive intervention.
The World Allergy Organization (WAO) convened a guideline panel to develop evidence-based recommendations about the use of probiotics in the prevention of allergy.
We identified the most relevant clinical questions and performed a systematic review of randomized controlled trials of probiotics for the prevention of allergy. We followed the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach to develop recommendations. We searched for and reviewed the evidence about health effects, patient values and preferences, and resource use (up to November 2014). We followed the GRADE evidence-to-decision framework to develop recommendations.
Currently available evidence does not indicate that probiotic supplementation reduces the risk of developing allergy in children. However, considering all critical outcomes in this context, the WAO guideline panel determined that there is a likely net benefit from using probiotics resulting primarily from prevention of eczema. The WAO guideline panel suggests: a) using probiotics in pregnant women at high risk for having an allergic child; b) using probiotics in women who breastfeed infants at high risk of developing allergy; and c) using probiotics in infants at high risk of developing allergy. All recommendations are conditional and supported by very low quality evidence.
WAO recommendations about probiotic supplementation for prevention of allergy are intended to support parents, clinicians and other health care professionals in their decisions whether to use probiotics in pregnancy and during breastfeeding, and whether to give them to infants.
Electronic supplementary material
The online version of this article (doi:10.1186/s40413-015-0055-2) contains supplementary material, which is available to authorized users.
Allergy; Prevention; Probiotics; Practice guidelines; GRADE
THE GRADE WORKING GROUP IS DEVELOPING and evaluating a common, sensible approach to grading quality of evidence and strength of recommendations in health care. In this article, we discuss the advantages and disadvantages of using letters, numbers, symbols or words to represent grades of evidence and recommendations. Using multiple strategies, we searched for comparative studies of alternative ways of representing ordered categories in any context. In addition, we contacted experts and reviewed theoretical work and qualitative research on how best to communicate grades of any kind quickly and clearly. We were unable to identify health care research that addressed, either directly or indirectly, the best way to present grades of evidence and recommendations. We found examples of symbols used by government, commercial and consumer organizations to communicate quality of evidence or strength of recommendations, but no comparative studies. Although a number of grading systems are used in health care and other fields, there is little or no evidence of how well various presentations are understood. Before promoting the use of specific symbols, numbers, letters or words, the extent to which the intended message is comprehended should be evaluated.
Chronic lung disease is exacerbated by comorbid psychiatric issues and treatment of depression may improve disease symptoms. We sought to add to the literature as to whether depression is associated with pulmonary function in healthy adults.
In 2,551 healthy adults from New York State, USA, we studied the association of depression via the Center for Epidemiologic Studies Depression scale (CES-D) score and forced expiratory volume (FEV1) and forced vital capacity (FVC) using general linear models and a cross sectional design.
We identified statistically significant inverse trends in FEV1, FVC, FEV1% and FVC% by CES-D category especially in ever smokers and men. When adjusted for covariates, the difference in FEV1 and FEV1% for smokers with >18.5 lifetime pack years from CES-D score 0-3 to ≥16 (depressed) is approximately 0.25 L and 5.0%; adjusted P for trend are <0.001 and 0.019, respectively. In men, we also observed statistically significant inverse trends in pulmonary function with increasing CES-D.
We identified an inverse association of depressive symptoms and pulmonary function in healthy adults especially in men and individuals with a heavy smoking history. Further studies of these associations are essential for the development and tailoring of interventions for the prevention and treatment of chronic lung disease.
pulmonary disease; chronic lung disease; depression; respiratory function tests
Faculty productivity is essential for academic medical centers striving to achieve excellence and national recognition. The objective of this study was to evaluate whether and how academic Departments of Medicine in the United States measure faculty productivity for the purpose of salary compensation.
We surveyed the Chairs of academic Departments of Medicine in the United States in 2012. We sent a paper-based questionnaire along with a personalized invitation letter by postal mail. For non-responders, we sent reminder letters, then called them and faxed them the questionnaire. The questionnaire included 8 questions with 23 tabulated close-ended items about the types of productivity measured (clinical, research, teaching, administrative) and the measurement strategies used. We conducted descriptive analyses.
Chairs of 78 of 152 eligible departments responded to the survey (51% response rate). Overall, 82% of respondents reported measuring at least one type of faculty productivity for the purpose of salary compensation. Amongst those measuring faculty productivity, types measured were: clinical (98%), research (61%), teaching (62%), and administrative (64%). Percentages of respondents who reported the use of standardized measurements units (e.g., Relative Value Units (RVUs)) varied from 17% for administrative productivity to 95% for research productivity. Departments reported a wide variation of what exact activities are measured and how they are monetarily compensated. Most compensation plans take into account academic rank (77%). The majority of compensation plans are in the form of a bonus on top of a fixed salary (66%) and/or an adjustment of salary based on previous period productivity (55%).
Our survey suggests that most academic Departments of Medicine in the United States measure faculty productivity and convert it into standardized units for the purpose of salary compensation. The exact activities that are measured and how they are monetarily compensated varied substantially across departments.
Electronic supplementary material
The online version of this article (doi:10.1186/1472-6920-14-205) contains supplementary material, which is available to authorized users.
Faculty productivity; Salary compensation; Academia; Department of medicine; Survey
Although several tools to evaluate the credibility of health care guidelines exist, guidance on practical steps for developing guidelines is lacking. We systematically compiled a comprehensive checklist of items linked to relevant resources and tools that guideline developers could consider, without the expectation that every guideline would address each item.
We searched data sources, including manuals of international guideline developers, literature on guidelines for guidelines (with a focus on methodology reports from international and national agencies, and professional societies) and recent articles providing systematic guidance. We reviewed these sources in duplicate, extracted items for the checklist using a sensitive approach and developed overarching topics relevant to guidelines. In an iterative process, we reviewed items for duplication and omissions and involved experts in guideline development for revisions and suggestions for items to be added.
We developed a checklist with 18 topics and 146 items and a webpage to facilitate its use by guideline developers. The topics and included items cover all stages of the guideline enterprise, from the planning and formulation of guidelines, to their implementation and evaluation. The final checklist includes links to training materials as well as resources with suggested methodology for applying the items.
The checklist will serve as a resource for guideline developers. Consideration of items on the checklist will support the development, implementation and evaluation of guidelines. We will use crowdsourcing to revise the checklist and keep it up to date.
Although even randomization (that is, approximately 1:1 randomization ratio in study arms) provides the greatest statistical power, designed uneven randomization (DUR), (for example, 1:2 or 1:3) is used to increase participation rates. Until now, no convincing data exists addressing the impact of DUR on participation rates in trials. The objective of this study is to evaluate the epidemiology and to explore factors associated with DUR.
We will search for reports of RCTs published within two years in 25 general medical journals with the highest impact factor according to the Journal Citation Report (JCR)-2010. Teams of two reviewers will determine eligibility and extract relevant information from eligible RCTs in duplicate and using standardized forms. We will report the prevalence of DUR trials, the reported reasons for using DUR, and perform a linear regression analysis to estimate the association between the randomization ratio and the associated factors, including participation rate, type of informed consent, clinical area, and so on.
A clearer understanding of RCTs with DUR and its association with factors in trials, for example, participation rate, can optimize trial design and may have important implications for both researchers and users of the medical literature.
Participation rate; Designed uneven randomization trials; Trial participation
Systematic reviews and meta-analyses of randomized trials that include patient-reported outcomes (PROs) often provide crucial information for patients, clinicians and policy-makers facing challenging health care decisions. Based on emerging methods, guidance on improving the interpretability of meta-analysis of patient-reported outcomes, typically continuous in nature, is likely to enhance decision-making. The objective of this paper is to summarize approaches to enhancing the interpretability of pooled estimates of PROs in meta-analyses. When differences in PROs between groups are statistically significant, decision-makers must be able to interpret the magnitude of effect. This is challenging when, as is often the case, clinical trial investigators use different measurement instruments for the same construct within and between individual randomized trials. For such cases, in addition to pooling results as a standardized mean difference, we recommend that systematic review authors use other methods to present results such as relative (relative risk, odds ratio) or absolute (risk difference) dichotomized treatment effects, complimented by presentation in either: natural units (e.g. overall depression reduced by 2.4 points when measured on a 50-point Hamilton Rating Scale for Depression); minimal important difference units (e.g. where 1.0 unit represents the smallest difference in depression that patients, on average, perceive as important the depression score was 0.38 (95% CI 0.30 to 0.47) units less than the control group); or a ratio of means (e.g. where the mean in the treatment group is divided by the mean in the control group, the ratio of means is 1.27, representing a 27% relative reduction in the mean depression score).
Clinicians, providers and guideline panels use absolute effects to weigh the advantages and downsides of treatment alternatives. Relative measures have the potential to mislead readers. However, little is known about the reporting of absolute measures in systematic reviews. The objectives of our study are to determine the proportion of systematic reviews that report absolute measures of effect for the most important outcomes, and ascertain how they are analyzed, reported and interpreted.
We will conduct a methodological survey of systematic reviews published in 2010. We will conduct a 1:1 stratified random sampling of Cochrane vs. non-Cochrane systematic reviews. We will calculate the proportion of systematic reviews reporting at least one absolute estimate of effect for the most patient-important outcome for the comparison of interest. We will conduct multivariable logistic regression analyses with the reporting of an absolute estimate of effect as the dependent variable and pre-specified study characteristics as the independent variables. For systematic reviews reporting an absolute estimate of effect, we will document the methods used for the analysis, reporting and interpretation of the absolute estimate.
Our methodological survey will inform current practices regarding reporting of absolute estimates in systematic reviews. Our findings may influence recommendations on reporting, conduct and interpretation of absolute estimates. Our results are likely to be of interest to systematic review authors, funding agencies, clinicians, guideline developers and journal editors.
Systematic reviews; Meta-analysis; Statistical data; Evidence-based medicine; Numbers needed to treat; Data reporting; Absolute effect measures
To inform clinical guidelines and patient care we need high quality evidence on the relative benefits and harms of intervention. Patient reported outcome (PRO) data from clinical trials can “empower patients to make decisions based on their values” and “level the playing field between physician and patient”. While clinicians have a good understanding of the concept of health-related quality of life and other PROs, evidence suggests that many do not feel comfortable in using the data from trials to inform discussions with patients and clinical practice. This may in part reflect concerns over the integrity of the data and difficulties in interpreting the results arising from poor reporting.
The new CONSORT PRO extension aims to improve the reporting of PROs in trials to facilitate the use of results to inform clinical practice and health policy. While the CONSORT PRO extension is an important first step in the process, we need broader engagement with the guidance to facilitate optimal reporting and maximize use of PRO data in a clinical setting. Endorsement by journal editors, authors and peer reviewers are crucial steps. Improved design, implementation and transparent reporting of PROs in clinical trials are necessary to provide high quality evidence to inform evidence synthesis and clinical practice guidelines.
Quality of life; CONSORT PRO; Reporting; Clinical trials
Cochrane Reviews are intended to help providers, practitioners and patients make informed decisions about health care. The goal of the Cochrane Applicability and Recommendation Methods Group (ARMG) is to develop approaches, strategies and guidance that facilitate the uptake of information from Cochrane Reviews and their use by a wide audience with specific focus on developers of recommendations and on healthcare decision makers. This paper is part of a series highlighting developments in systematic review methodology in the 20 years since the establishment of The Cochrane Collaboration, and its aim is to present current work and highlight future developments in assessing and presenting summaries of evidence, with special focus on Summary of Findings (SoF) tables and Plain Language Summaries.
A SoF table provides a concise and transparent summary of the key findings of a review in a tabular format. Several studies have shown that SoF tables improve accessibility and understanding of Cochrane Reviews.
The ARMG and GRADE Working Group are working on further development of the SoF tables, for example by evaluating the degree of acceptable flexibility beyond standard presentation of SoF tables, developing SoF tables for diagnostic test accuracy reviews and interactive SoF tables (iSoF).
The plain language summary (PLS) is the other main building block for dissemination of review results to end-users. The PLS aims to summarize the results of a review in such a way that health care consumers can readily understand them. Current efforts include the development of a standardized language to describe statistical results, based on effect size and quality of supporting evidence.
Producing high quality PLS and SoF tables and making them compatible and linked would make it easier to produce dissemination products targeting different audiences (for example, providers, health policy makers, guideline developers).
Current issues of debate include optimal presentation formats of SoF tables, the training required to produce SoF tables, and the extent to which the authors of Cochrane Reviews should provide explicit guidance to target audiences of patients, clinicians and policy-makers.
Systematic reviews and meta-analyses of randomized trials that include patient-reported outcomes (PROs) often provide crucial information for patients and clinicians facing challenging health care decisions. Based on emerging methods, guidance on combining PROs in meta-analysis is likely to enhance their usefulness.
The objectives of this paper are: i) to describe PROs and why they are important for health care decision-making, ii) illustrate the key risk of bias issues that systematic reviewers should consider and, iii) address outcome characteristics of PROs and provide guidance for combining outcomes.
We suggest a step-by-step approach to addressing issues of PROs in meta-analyses. Systematic reviewers should begin by asking themselves if trials have addressed all the important effects of treatment on patients’ quality of life. If the trials have addressed PROs, have investigators chosen the appropriate instruments? In particular, does evidence suggest the PROs used are valid and responsive, and is the review free of outcome reporting bias? Systematic reviewers must then decide how to categorize PROs and when to pool results.
Patient-reported outcomes; Health-related quality of life; Meta-analysis; Systematic review; Health care decision-making
Clinical practice guidelines (CPGs) recommend universal prenatal screening for Group B Streptococcus (GBS) to identify candidates for intrapartum antibiotic prophylaxis to prevent early onset neonatal GBS infection. Interventions to promote physician adherence to these guidelines are imperative. This study examined the effectiveness of academic detailing (AD) of obstetricians, compared with CPG mailshot and no intervention, on the screening of pregnant women for GBS.
A randomized controlled clinical trial was conducted in the medical cooperative of Porto Alegre, Brazil. All obstetricians who assisted in a delivery covered by private health insurance managed by the cooperative in the 3 months preceding the study (n = 241) were invited to participate. The obstetricians were randomized to three groups: direct mail (DM, n = 76), AD (n = 76) and control (C, n = 89, no intervention). Those in the DM group were sent guidelines on GBS. The AD group received the guidelines and an educational visit detailing the guidelines, which was conducted by a trained physician. Data on obstetrician age, gender, time since graduation, whether patients received GBS screening during pregnancy, and obstetricians who requested screening were collected for all participant obstetricians for 3 months before and after the intervention, using database from the private health insurance information system.
Three months post-intervention, the data showed that the proportion of pregnant women screened for GBS was higher in the AD group (25.4%) than in the DM (15.9%) and C (17.7%) groups (P = 0.023). Similar results emerged when the three groups were taken as a cluster (pregnant women and their obstetricians), but the difference was not statistically significant (Poisson regression, P = 0.108). Additionally, when vaginal deliveries were analyzed separately, the proportion screened was higher in the AD group (75%) than in the DM group (41.9%) and the C group (30.4%) (chi-square, P < 0.001).
The results suggest that AD increased the prevalence of GBS screening in pregnant women in this population.
Guidelines; Physicians; Pregnancy; Screening; Streptococci
Randomized controlled trials (RCTs) that are inappropriately designed or executed may provide biased findings and mislead clinical practice. In view of recent interest in the treatment and prevention of thrombotic complications in cancer patients we evaluated the characteristics, risk of bias and their time trends in RCTs of anticoagulation in patients with cancer.
We conducted a comprehensive search, including a search of four electronic databases (MEDLINE, EMBASE, ISI the Web of Science, and CENTRAL) up to February 2010. We included RCTs in which the intervention and/or comparison consisted of: vitamin K antagonists, unfractionated heparin (UFH), low molecular weight heparin (LMWH), direct thrombin inhibitors or fondaparinux. We performed descriptive analyses and assessed the association between the variables of interest and the year of publication.
We included 67 RCTs with 24,071 participants. In twenty one trials (31%) DVT diagnosis was triggered by clinical suspicion; the remaining trials either screened for DVT or were unclear about their approach. 41 (61%), 22 (33%), and 11 (16%) trials respectively reported on major bleeding, minor bleeding, and thrombocytopenia. The percentages of trials satisfying risk of bias criteria were: adequate sequence generation (85%), adequate allocation concealment (61%), participants’ blinding (39%), data collectors’ blinding (44%), providers’ blinding (41%), outcome assessors’ blinding (75%), data analysts’ blinding (15%), intention to treat analysis (57%), no selective outcome reporting (12%), no stopping early for benefit (97%). The mean follow-up rate was 96%. Adequate allocation concealment and the reporting of intention to treat analysis were the only two quality criteria that improved over time.
Many RCTs of anticoagulation in patients with cancer appear to use insufficiently rigorous outcome assessment methods and to have deficiencies in key methodological features. It is not clear whether this reflects a problem in the design, conduct or the reporting of these trials, or both. Future trials should avoid the shortcomings described in this article.
China is experiencing increased health care use and expenditures, without sufficient controls to ensure quality and value. Transparent, cost-conscious and patient-centered guidelines based on the best available evidence could help establishing these quality and practice measures.
We examined how guidelines could support the Chinese health reform. Specifically, we summarized the current state of the art and related challenges in guideline development and explored possible solutions in the context of the Chinese health reform.
China currently lacks capacity for evidence-based guideline development and coordination by a central agency. Most Chinese guideline users rely on recommendations developed by professional groups that lack demonstration of transparency (including conflict of interest management and evidence synthesis) and quality. These deficiencies appear larger than in other regions of the world. In addition, misperceptions about the role of guidelines in assisting practitioners as opposed to providing rules requiring adherence, and a perception that traditional Chinese medicine (TCM) cannot be appropriately incorporated in guidelines are present.
China’s capacity could be strengthened by a central guideline agency to provide or coordinate evidence synthesis for guideline development and to oversee the work of guideline developers. China can build on what is known and work with the international community to develop methods to meet the challenges of evidence-based guideline development.
Healthcare decision makers face challenges when using guidelines, including understanding the quality of the evidence or the values and preferences upon which recommendations are made, which are often not clear.
GRADE is a systematic approach towards assessing the quality of evidence and the strength of recommendations in healthcare. GRADE also gives advice on how to go from evidence to decisions. It has been developed to address the weaknesses of other grading systems and is now widely used internationally. The Developing and Evaluating Communication Strategies to Support Informed Decisions and Practice Based on Evidence (DECIDE) consortium (http://www.decide-collaboration.eu/), which includes members of the GRADE Working Group and other partners, will explore methods to ensure effective communication of evidence-based recommendations targeted at key stakeholders: healthcare professionals, policymakers, and managers, as well as patients and the general public. Surveys and interviews with guideline producers and other stakeholders will explore how presentation of the evidence could be improved to better meet their information needs. We will collect further stakeholder input from advisory groups, via consultations and user testing; this will be done across a wide range of healthcare systems in Europe, North America, and other countries. Targeted communication strategies will be developed, evaluated in randomized trials, refined, and assessed during the development of real guidelines.
Results of the DECIDE project will improve the communication of evidence-based healthcare recommendations. Building on the work of the GRADE Working Group, DECIDE will develop and evaluate methods that address communication needs of guideline users. The project will produce strategies for communicating recommendations that have been rigorously evaluated in diverse settings, and it will support the transfer of research into practice in healthcare systems globally.
Guidelines; Recommendations; Communication; Presentation formats
Venous thromboembolism (VTE) is a common preventable cause of mortality in hospitalized medical patients. Despite rigorous randomized trials generating strong recommendations for anticoagulant use to prevent VTE, nearly 40% of medical patients receive inappropriate thromboprophylaxis. Knowledge-translation strategies are needed to bridge this gap.
We conducted a 16-week pilot cluster randomized controlled trial (RCT) to determine the proportion of medical patients that were appropriately managed for thromboprophylaxis (according to the American College of Chest Physician guidelines) within 24 hours of admission, through the use of a multicomponent knowledge-translation intervention. Our primary goal was to determine the feasibility of conducting this study on a larger scale. The intervention comprised clinician education, a paper-based VTE risk assessment algorithm, printed physicians’ orders, and audit and feedback sessions. Medical wards at six hospitals (representing clusters) in Ontario, Canada were included; three were randomized to the multicomponent intervention and three to usual care (i.e., no active strategies for thromboprophylaxis in place). Blinding was not used.
A total of 2,611 patients (1,154 in the intervention and 1,457 in the control group) were eligible and included in the analysis. This multicomponent intervention did not lead to a significant difference in appropriate VTE prophylaxis rates between intervention and control hospitals (appropriate management rate odds ratio = 0.80; 95% confidence interval: 0.50, 1.28; p = 0.36; intra-class correlation coefficient: 0.022), and thus was not considered feasible. Major barriers to effective knowledge translation were poor attendance by clinical staff at education and feedback sessions, difficulty locating preprinted orders, and lack of involvement by clinical and administrative leaders. We identified several factors that may increase uptake of a VTE prophylaxis strategy, including local champions, support from clinical and administrative leaders, mandatory use, and a simple, clinically relevant risk assessment tool.
Hospitals allocated to our multicomponent intervention did not have a higher rate of medical inpatients appropriately managed for thromboprophylaxis than did hospitals that were not allocated to this strategy.
Thromboprophylaxis; Medical patients; Anticoagulants; Venous thromboembolism; Cluster randomization; Standard orders
Use of antiretroviral therapy (ART) during treatment of drug susceptible tuberculosis (TB) improves survival. However, data from HIV infected individuals with drug resistant TB are lacking. Second line TB drugs when combined with ART may increase drug interactions and lead to higher rates of toxicity and greater noncompliance. This systematic review sought to determine the benefit of ART in the setting of second line drug therapy for drug resistant TB.
We included individual patient data from studies that evaluated treatment of drug-resistant tuberculosis in HIV-1 infected individuals published between January 1980 and December of 2009. We evaluated the effect of ART on treatment outcomes, time to smear and culture conversion, and adverse events.
Ten observational studies, including data from 217 subjects, were analyzed. Patients using ART during TB treatment had increased likelihood of cure (hazard ratio (HR) 3.4, 95% CI 1.6–7.4) and decreased likelihood of death (HR 0.4, 95% CI 0.3–0.6) during treatment for drug resistant TB. These associations remained significant in patients with a CD4 less than 200 cells/mm3 and less than 50 cells/mm3, and when correcting for drug resistance pattern.
We identified only observational studies from which individual patient data could be drawn. Limitations in study design, and heterogeneity in a number of the outcomes of interest had the potential to introduce bias.
While there are insufficient data to determine if ART use increases adverse drug interactions when used with second line TB drugs, ART use during treatment of drug resistant TB appears to improve cure rates and decrease risk of death. All individuals with HIV appear to benefit from ART use during treatment for TB.
Many academic medical centres have introduced strategies to assess the productivity of faculty as part of compensation schemes. We conducted a systematic review of the effects of such strategies on faculty productivity.
We searched the MEDLINE, Healthstar, Embase and PsycInfo databases from their date of inception up to October 2011. We included studies that assessed academic productivity in clinical, research, teaching and administrative activities, as well as compensation, promotion processes and satisfaction.
Of 531 full-text articles assessed for eligibility, we included 9 articles reporting on eight studies. The introduction of strategies for assessing academic productivity as part of compensation schemes resulted in increases in clinical productivity (in six of six studies) in terms of clinical revenue, the work component of relative-value units (these units are nonmonetary standard units of measure used to indicate the value of services provided), patient satisfaction and other departmentally used standards. Increases in research productivity were noted (in five of six studies) in terms of funding and publications. There was no change in teaching productivity (in two of five studies) in terms of educational output. Such strategies also resulted in increases in compensation at both individual and group levels (in three studies), with two studies reporting a change in distribution of compensation in favour of junior faculty. None of the studies assessed effects on administrative productivity or promotion processes. The overall quality of evidence was low.
Strategies introduced to assess productivity as part of a compensation scheme appeared to improve productivity in research activities and possibly improved clinical productivity, but they had no effect in the area of teaching. Compensation increased at both group and individual levels, particularly among junior faculty. Higher quality evidence about the benefits and harms of such assessment strategies is needed.
Clinical practice guidelines are one of the foundations of efforts to improve healthcare. In 1999, we authored a paper about methods to develop guidelines. Since it was published, the methods of guideline development have progressed both in terms of methods and necessary procedures and the context for guideline development has changed with the emergence of guideline clearinghouses and large scale guideline production organisations (such as the UK National Institute for Health and Clinical Excellence). It therefore seems timely to, in a series of three articles, update and extend our earlier paper. In this second paper, we discuss issues of identifying and synthesizing evidence: deciding what type of evidence and outcomes to include in guidelines; integrating values into a guideline; incorporating economic considerations; synthesis, grading, and presentation of evidence; and moving from evidence to recommendations.