Search tips
Search criteria 


Logo of jidLink to Publisher's site
J Infect Dis. 2011 November 15; 204(Suppl 4): S1120–S1129.
PMCID: PMC3192542

Interferon-γ Release Assays for Active Pulmonary Tuberculosis Diagnosis in Adults in Low- and Middle-Income Countries: Systematic Review and Meta-analysis


Background. The diagnostic value of interferon-γ release assays (IGRAs) for active tuberculosis in low- and middle-income countries is unclear.

Methods. We searched multiple databases for studies published through May 2010 that evaluated the diagnostic performance of QuantiFERON-TB Gold In-Tube (QFT-GIT) and T-SPOT.TB (T-SPOT) among adults with suspected active pulmonary tuberculosis or patients with confirmed cases in low- and middle-income countries. We summarized test performance characteristics with use of forest plots, hierarchical summary receiver operating characteristic (HSROC) curves, and bivariate random effects models.

Results. Our search identified 789 citations, of which 27 observational studies (17 QFT-GIT and 10 T-SPOT) evaluating 590 human immunodeficiency virus (HIV)–uninfected and 844 HIV-infected individuals met inclusion criteria. Among HIV-infected patients, HSROC/bivariate pooled sensitivity estimates (highest quality data) were 76% (95% confidence interval [CI], 45%–92%) for T-SPOT and 60% (95% CI, 34%–82%) for QFT-GIT. HSROC/bivariate pooled specificity estimates were low for both IGRA platforms among all participants (T-SPOT, 61% [95% CI, 40%–79%]; QFT-GIT, 52% [95% CI, 41%–62%]) and among HIV-infected persons (T-SPOT, 52% [95% CI, 40%–63%]; QFT-GIT, 50% [95% CI, 35%–65%]). There was no consistent evidence that either IGRA was more sensitive than the tuberculin skin test for active tuberculosis diagnosis.

Conclusions. In low- and middle-income countries, neither the tuberculin skin test nor IGRAs have value for active tuberculosis diagnosis in adults, especially in the context of HIV coinfection.

Interferon-γ release assays (IGRAs) are the first new diagnostic test for latent tuberculosis (LTBI) in >100 years. Newest generation IGRAs measure interferon (IFN)–γ secretion after exposure of whole blood (QuantiFERON-TB Gold In-Tube [QFT-GIT], Cellestis) or peripheral blood mononuclear cells (T-SPOT.TB [T-SPOT], Oxford Immunotec) to antigens encoded in the region of difference–1 (RD1), a portion of the Mycobacterium tuberculosis genome absent among all bacille Calmette-Guérin (BCG) strains and most nontuberculous mycobacteria [1]. We have shown in previous systematic reviews that compared with the tuberculin skin test (TST), IGRAs have higher specificity for LTBI in settings with low tuberculosis incidence, better correlation with surrogate measures of M. tuberculosis exposure, and less cross-reactivity with the BCG vaccine [24]. Thus, in recent years, IGRAs have become widely endorsed in high-income countries for diagnosis of LTBI [57].

However, IGRAs were explicitly designed to replace the TST in diagnosis of LTBI and were not intended for active tuberculosis, which is a microbiological diagnosis. Furthermore, diagnosis and treatment of LTBI remains limited in scope in most low- and middle-income countries, where detection and management of active tuberculosis is of highest priority for national tuberculosis programs. Because IGRAs, like the TST, cannot distinguish LTBI from active tuberculosis [810], these tests can be expected to have poor specificity for active tuberculosis in all high-burden settings because of a high background prevalence of LTBI [11]. Additional differences in patient spectrum, such as anergy due to advanced disease, malnutrition, and human immunodeficiency virus (HIV)–associated immune suppression, or characteristics of the setting, such as laboratory procedures and infrastructure, may also contribute to a lower performance of IGRAs observed in these settings [12]. However, private sector laboratories in high-burden countries increasingly use IGRAs for active tuberculosis diagnosis [13], and many investigators continue to recommend the use of IGRAs for active tuberculosis diagnosis [1417].

Because of unclear benefits and potential costs to patients and national tuberculosis programs, we conducted a systemic review and meta-analysis to determine IGRA test performance in persons with suspected or confirmed active pulmonary tuberculosis living in low- and middle-income settings.



Because of the absence of studies evaluating patient-important outcomes in persons with suspected tuberculosis who were randomized to treatment on the basis of IGRA results, we focused our review on the diagnostic accuracy of IGRAs for active tuberculosis. We observed standard guidelines and methods for systematic reviews and meta-analyses of diagnostic tests [1821].

Search Methods

We previously published systematic and narrative reviews on the accuracy and performance of IGRAs in various subgroups [24, 10, 12]. We updated the previous literature searches to identify all studies evaluating IGRAs published through May 2010. We searched PubMed, Embase, Biosis, and Web of Science for studies in all languages. The search terms used included “interferon-gamma release assay,” “T cell–based assay,” “antigen-specific T cell,” “T cell response,” “T-cell response,” “interferon,” “interferon-gamma,” “gamma-interferon,” “IFN,” “elispot,” “ESAT-6,” “CFP-10,” “culture filtrate protein,” “enzyme-linked immunosorbent spot,” “Quantiferon,” “Quantiferon-TB,” “tuberculosis,” and “Mycobacterium tuberculosis.” In addition to database searches, we reviewed bibliographies of reviews and guidelines, screened citations of all included studies, searched for ongoing studies, and contacted both experts in the field and IGRA manufacturers to identify additional published and unpublished studies. We requested pertinent information not reported in the original publication from the primary authors of all studies included in the review.

Study Selection and Data Collection

We included studies that evaluated the performance of the most recent generation of commercial, RD1 antigen-based IGRAs (QFT-GIT and T-SPOT) among adults (age ≥15 years) with suspected active pulmonary tuberculosis or confirmed tuberculosis in low- and middle-income countries [22]; the World Bank Country Classification was considered as a surrogate for national tuberculosis incidence. HIV infection was established either by documented serological testing or self-report. We excluded (1) studies that evaluated noncommercial (in-house) IGRAs, purified protein derivative–based IGRAs, QuantiFERON-TB Gold (2G), and IGRAs performed using specimens other than blood; (2) longitudinal data focused on the effect of antituberculosis treatment on IGRA response; (3) studies including <10 eligible individuals; (4) studies focused on extrapulmonary tuberculosis or children (age <15 years); (5) studies reporting insufficient data to determine diagnostic accuracy measures; and (6) conference abstracts, letters without original data, and reviews.

At least 2 reviewers (J. Z. M., C. K. E., K. R. S., or A. C.) independently screened the accumulated citations for relevance, reviewed full-text articles using the prespecified eligibility criteria, and extracted data with use of a standardized form. The reviewers resolved disagreements about study selection and data extraction by consensus.

Assessment of Study Quality

Because primary outcomes for this systematic review focus on test accuracy, we evaluated study quality with use of a subset of relevant criteria from the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool, a validated tool for diagnostic accuracy studies [23]. Because of growing concerns about conflicts of interest in diagnostic studies and guidelines [24, 25], we also reported whether IGRA manufacturers had any involvement with the design or conduct of each study, including donation of materials, monetary support, work and/or financial relationships with study authors, and participation in data analysis.

Outcome Definitions

Well-designed diagnostic accuracy studies focus on a representative target population in whom genuine diagnostic uncertainty exists (ie, patients for whom clinicians would apply the test in the course of regular clinical practice) [26]. There is evidence that diagnostic studies that include only known patients with the condition of interest and healthy control subjects without this condition tend to overestimate test accuracy [27]. Therefore, we considered studies simultaneously evaluating IGRA sensitivity and specificity among persons with suspected active tuberculosis to represent the highest quality evidence, whereas studies evaluating IGRA performance among patients with known active tuberculosis (for sensitivity) were considered to be of lesser quality. Because of our focus on active tuberculosis diagnostic accuracy and the high prevalence of LTBI in settings with a high tuberculosis burden, IGRA specificity was estimated exclusively among studies enrolling persons with suspected active tuberculosis for whom the diagnostic examination ultimately showed no evidence of active disease.

A hierarchy of reference standards for active tuberculosis was developed a priori to judge the quality of each individual assessment of IGRA diagnostic accuracy. From most to least favorable, these reference standards included (1) culture confirmation or sputum smear positivity in settings with high tuberculosis incidence (≥50 cases/100000 population), where sputum smear microscopy has been shown to have high specificity [28]; (2) sputum smear positivity without culture confirmation in settings with low or intermediate tuberculosis incidence (<50 cases/100000 population); and (3) clinical diagnosis based on presenting symptoms, radiologic findings, and/or response to tuberculosis treatment without microbiological confirmation. Because the TST remains in widespread use and indeterminate IGRA results may affect assay performance in low-income settings, we also evaluated (1) observed differences in sensitivity for active tuberculosis diagnosis between IGRA and TST, and (2) the proportion of IGRA results among patients with active disease that were indeterminate.

We used the following definitions for primary outcomes: (1) sensitivity was defined as the proportion of individuals with a positive IGRA result among those with culture-positive tuberculosis (we included indeterminate IGRA results in the denominator if they occurred in individuals with culture-positive tuberculosis), and (2) specificity was defined as the proportion of individuals with a negative IGRA result among those who had active tuberculosis disease ruled out (indeterminate IGRA results were excluded from analysis). With use of the Grading of Recommendations Assessment, Development and Evaluation framework [26], these measures can be interpreted as surrogates for patient-important outcomes.

Data Synthesis and Meta-Analysis

Multiple sources of heterogeneity frequently exist when summarizing estimates from studies of diagnostic tests [29]. We adopted the following approach to account for expected heterogeneity. First, when possible, we separately synthesized data for each commercial IGRA and by HIV status. The prespecified subgroups minimize heterogeneity related to differences in testing platform (enzyme-linked immunosorbent assay vs enzyme-linked immunospot assay), antigens used to elicit IFN-γ release (ESAT-6/CFP-10 vs ESAT-6/CFP-10/TB 7.7), and test performance related to HIV-associated host immunosuppression. Second, we visually assessed heterogeneity with use of forest plots, characterized the variation in study results attributable to heterogeneity (I-squared value), and statistically tested for heterogeneity (χ2 test) [29]. Third, we calculated pooled sensitivity and specificity estimates with use of random effects modeling, which provides more conservative estimates than does fixed effects modeling when heterogeneity is a concern [19, 30].

For each individual study, we assessed all outcomes for which data were available. First, we generated forest plots to display the individual study estimates and their 95% confidence intervals (CIs). Second, we used bivariate random effects regression models [31] when both sensitivity and specificity could be reported from the same population of tuberculosis suspects. Because pooling sensitivity and specificity separately can produce biased estimates of test accuracy [19], we preferred to generate pooled estimates when both sensitivity and specificity were reported in a study and ranked this as higher-quality evidence. Third, we generated hierarchical summary receiver operating characteristic (HSROC) curves to summarize the global test performance [30]. Because of the need to summarize 2 correlated measures (eg, sensitivity and specificity) and because substantial between-study heterogeneity is common, meta-analysis of diagnostic accuracy requires different and more complex methods than do traditional meta-analytic techniques. Graphically illustrating the trade-off between sensitivity and specificity, HSROC curves differ from traditional ROC curves in allowing accuracy to vary by each individual study (ie, allowing for random effects and, thus, asymmetry in the plotted curve) and by discouraging extrapolation beyond the available data by plotting the curve only over the observed range of test characteristics. The HSROC approach is closely related to the bivariate random effects regression model [32]. These 2 methods generally produce similar results and are both recommended by the Cochrane Diagnostic Test Accuracy Working Methods group [20]. We calculated pooled estimates when at least 4 studies were available in any subgroup and summarized individual study results when <4 studies were available. We performed all analyses with use of Stata, version 11 (StataCorp). For bivariate random effects regression and HSROC analyses, we used the user-written “metandi” program for Stata [31].


Search Results

The initial search yielded 789 citations (Figure 1). After full-text review of 168 articles, 19 [15, 17, 3349] were determined to meet eligibility criteria for IGRA evaluation of active tuberculosis in low- and middle-income settings. Because some articles included >1 commercial IGRA, there were 27 unique evaluations (referred to as studies; 17 of QFT-GIT and 10 of T-SPOT) that included a total of 590 HIV-uninfected and 844 HIV-infected individuals.

Figure 1.
Study selection. IGRA, interferon-γ release assay; LTBI, latent tuberculosis infection.

Study Characteristics

Of the total studies, 7 (26%) were from low-income countries and 20 (74%) were from middle-income countries. Fourteen studies (52%) included HIV-infected individuals, and 21(78%) studies included ambulatory patients (Table 1). IGRAs were performed for persons suspected of having active tuberculosis in 14 studies (52%) [34, 3638, 40, 41, 46, 47, 49] and in persons with known active tuberculosis in 13 studies (48%) [15, 17, 33, 35, 39, 4245, 48]. A list of excluded studies and reasons for exclusion is available from the authors on request.

Table 1.
Characteristics of Included Studies

Study Quality

The majority of studies satisfied the QUADAS criteria assessed (Figure 2), with the exception of patient spectrum (biased sampling) and blinding. Sixteen studies (59%) did not enroll a representative spectrum of patients, and 9 (33%) did not clearly report whether assessment of the reference standard was performed with blinding to IGRA results. Industry involvement was unknown in 5 studies (19%) and acknowledged in 8 (30%), including donation of IGRA kits (6 studies) and work and/or financial relationships between authors and IGRA manufacturers (2 studies).

Figure 2.
Assessment of study quality with use of the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool. For each QUADAS item, 2 reviewers independently determined whether a study met the quality criterion or whether it was unclear.

Sensitivity and Specificity Estimation Among Persons With Suspected Tuberculosis

We identified a total of 14 studies that simultaneously estimated sensitivity and specificity among persons with suspected tuberculosis, and test accuracy estimates were pooled using bivariate random effects and/or HSROC methods (these studies were ranked as high-quality evidence). Overall, studies enrolling persons with suspected active tuberculosis revealed a sensitivity of 83% (95% CI, 63%–94%) and specificity of 61% (95% CI, 40%–79%) for T-SPOT (6 studies) and a sensitivity of 69% (95% CI, 52%–83%) and specificity of 52% (95% CI, 41%–63%) for QFT-GIT (8 studies).


With the exception of 2 studies [36, 47], the sensitivity of IGRAs was assessed on the basis of a positive culture result (21 studies [78%]) or a positive sputum acid-fast bacilli smear result in a setting with high tuberculosis incidence (4 studies [15%]). Among studies performed in patients with known active tuberculosis, 6 (46%) included patients who had been treated for >1 week.

HIV-Infected Persons.

Nine studies assessed IGRA sensitivity among HIV-infected persons with suspected active tuberculosis. HSROC and/or bivariate pooled sensitivity estimates were higher for T-SPOT (76%; 95% CI, 45%–92%; 4 studies [34, 37, 40, 41]) than for QFT-GIT (60%; 95% CI, 34%–82%; 5 studies [37, 38, 40, 41, 49]) (Figure 3). Pooled sensitivity estimates did not change appreciably for either T-SPOT (68%; 95% CI, 56%–80%; 5 studies [15, 34, 4042]) or QFT-GIT (65%; 95% CI, 52%–77%; 7 studies [33, 38, 40, 41, 48, 49]) when studies evaluating patients with known active tuberculosis were included in the analysis (Figure 4). Pooled sensitivity estimates for both T-SPOT (I-squared, 72%; P < .01) and QFT-GIT (I-squared, 76%; P < .001) showed significant heterogeneity.

Figure 3.
A and B, Hierarchical summary receiver operating characteristic (HSROC) plot of studies that reported both sensitivity and specificity among persons with suspected active tuberculosis. The summary curves from the HSROC model contain a summary operating ...
Figure 4.
Sensitivity of QuantiFERON-TB Gold In-Tube and T-SPOT.TB in human immunodeficiency virus (HIV)–infected persons with confirmed active tuberculosis in low- and middle-income countries. The forest plots display the sensitivity estimates obtained ...

HIV-Uninfected Persons.

Five studies assessed IGRA sensitivity among HIV-uninfected persons with suspected active tuberculosis; data were insufficient to report HSROC and/or bivariate pooled sensitivity estimates for either QFT-GIT [36, 37, 47] or T-SPOT [37, 46]. Pooled sensitivity estimates were similar for T-SPOT (88%; 95% CI, 81%–95%; 4 studies [17, 37, 43, 46]) and QFT-GIT (84%; 95% CI, 78%–91%; 9 studies [10, 33, 3537, 39, 45, 47, 48]) when studies evaluating patients with known active tuberculosis were included in the analysis (Figure 5). Pooled sensitivity estimates showed significant heterogeneity for QFT-GIT (I-squared, 60%; P = .01) but not for T-SPOT (I-squared, 28%; P = .25).

Figure 5.
Sensitivity of QuantiFERON-TB Gold In-Tube and T-SPOT.TB among human immunodeficiency virus (HIV)–uninfected persons with confirmed active tuberculosis in low- and middle-income countries. The forest plots show the sensitivity estimates obtained ...

Comparisons of QFT-GIT and T-SPOT Sensitivity.

Overall, 4 studies (3 involving HIV-infected patients [37, 40, 41] and 1 involving HIV-uninfected persons [37]) reported comparisons of T-SPOT and QFT-GIT sensitivity. T-SPOT sensitivity was higher but not significantly different from QFT-GIT sensitivity (sensitivity difference, 19%; 95% CI, −17% to 56%; P = .3) (Table 2). Results were similar when restricted to HIV-infected individuals.

Table 2.
Comparison of Sensitivity of T-SPOT.TB Versus QuantiFERON-TB Gold In-Tube Among Persons With Suspected Active Tuberculosis

Comparisons of TST and IGRA Sensitivity.

Overall, 9 studies reported comparisons of TST and IGRA (3 T-SPOT and 6 QFT-GIT) sensitivity. TST sensitivity in the 5 studies [17, 39, 43, 45, 48] involving HIV-uninfected patients was higher (78%; 95% CI, 71%–86%) than that in the 4 studies [15, 38, 45, 48] involving HIV-infected patients (45%; 95% CI, 15%–75%). IGRA sensitivity was not statistically different from TST sensitivity for either T-SPOT (sensitivity difference, 23%; 95% CI, 0%–45%; P = .05) or QFT-GIT (sensitivity difference, 7%; 95% CI, −9% to 23%; P = .37) (Figure 6). There was significant heterogeneity for both estimates (I-squared, >75%; P < .001). Data were insufficient to form HIV-stratified pooled sensitivity difference estimates for either IGRA.

Figure 6.
Sensitivity difference between interferon-γ release assay (IGRA) and tuberculin skin test (TST) results. The forest plots display percent differences (IGRA sensitivity–TST sensitivity) for confirmed active pulmonary tuberculosis in individual ...


All specificity estimates were determined in persons with suspected tuberculosis with use of HSROC and/or bivariate techniques. Overall, pooled specificity was low for both T-SPOT (61%; 95% CI, 40%–79%; 6 studies) and QFT-GIT (52%; 95% CI, 41%–62%; 8 studies). When restricted to HIV-infected persons with suspected active tuberculosis, pooled specificity for T-SPOT (52%; 95% CI, 40%–63%; 4 studies [34, 37, 40, 41]) was similar to that for QFT-GIT (50%; 95% CI, 35%–65%; 5 studies [37, 38, 40, 41, 49]) (Figure 3). An insufficient number of studies were available to estimate pooled specificity for HIV-uninfected patients.

Proportion of Indeterminate IGRA Results

The proportion of indeterminate IGRA results among patients with suspected or confirmed active tuberculosis varied considerably (range of 0%–26% among studies enrolling ≥50 participants). The proportion of indeterminate results was low (4%; 95% CI, 1%–7%) among HIV-uninfected patients, regardless of IGRA platform (Figure 1; online only). However, the proportion of indeterminate results was considerably higher among HIV-infected patients for both QFT-GIT (15%; 95% CI, 9%–21%; 8 studies) and T-SPOT (9%; 95% CI, 0%–17%; 6 studies) (Figure 2; online only). Results were similar for HIV-infected patients when stratified by persons with suspected tuberculosis and persons with known active tuberculosis.


The vast majority of the estimated annual 9.4 million new cases of active tuberculosis and 1.7 million tuberculosis-related deaths occur in low- and middle-income countries [50]. Because of resource constraints, public health policies have appropriately placed limited emphasis on diagnosis and treatment of LTBI in these settings. Clinical use of IGRAs, however, has expanded dramatically in recent years, especially in the private sector [13]. Because of their high burden of disease and emerging economies, these countries (eg, India, South Africa, Brazil, and China) represent a potentially lucrative market for commercial IGRAs. Although IGRAs are intended for LTBI and not active tuberculosis disease, and although these tests cannot distinguish between latent infection and active disease, there is concern about increasing use of IGRAs for active tuberculosis in high-burden countries. In this systematic review focused on individuals living in low- and middle-income countries, the highest-quality evidence from persons with suspected tuberculosis demonstrated sensitivity of 69%–83% and specificity of 52%–61% for IGRAs in the diagnosis of active tuberculosis. Furthermore, there was no consistent evidence that either IGRA was more sensitive than the TST for active tuberculosis diagnosis.

The majority of evidence for the diagnostic accuracy of IGRAs to date has been summarized from high-income settings where active tuberculosis has been used as a surrogate reference standard for LTBI diagnosis [4, 14]. However, diagnostic test performance (eg, sensitivity and specificity) can be expected to vary according to disease prevalence and other population characteristics [51, 52]. Likewise, clinicians have been advised to base their decisions on studies that most closely match their own clinical circumstances [53].

IGRAs were designed as diagnostic tests of LTBI, though the lack of an accepted gold standard for LTBI has been a significant limitation in establishing test performance. In contrast, adequate and commonly used reference standards exist for diagnosing active tuberculosis. Among studies that enrolled persons with suspected active tuberculosis (ie, patients with diagnostic uncertainty), both IGRAs demonstrated suboptimal rule-out value for active tuberculosis. In other words, approximately 1 in 4 patients with culture-confirmed active tuberculosis can be expected to have negative IGRA results in low- and middle-income countries; this has consequences for patients in terms of morbidity and mortality. Although high-quality data were limited, sensitivity of both IGRAs was lower among HIV-infected patients (60%–70%), suggesting that ~1 in 3 HIV-infected patients with active tuberculosis will have negative IGRA results. The few available comparisons between QFT-GIT and T-SPOT revealed higher sensitivity for the T-SPOT platform, although this difference did not reach statistical significance. Lastly, comparisons with pooled estimates of TST sensitivity were difficult to interpret because of substantial heterogeneity. Our results, however, suggest that neither IGRA platform may be more sensitive than the TST for active tuberculosis diagnosis in low- and middle-income countries.

IGRA specificity in diagnosing LTBI, estimated among individuals at low risk for tuberculosis exposure in settings with low tuberculosis incidence (high-income settings), is known to be high (≥98%) [4]. In contrast, specificity for active tuberculosis diagnosis is best estimated only in studies evaluating persons with suspected tuberculosis. As expected, because of the higher background LTBI prevalence and the known inability of IGRAs to differentiate LTBI from active tuberculosis [10], the specificity of both IGRAs for active tuberculosis was low, regardless of HIV status. These data suggest that 1 in 2 patients without active tuberculosis will have positive IGRA results; this has consequences for patients because of unnecessary therapy for tuberculosis and its attendant risks. Studies demonstrating activated T-cell IFN-γ response throughout the entire spectrum of tuberculosis, from latency to active disease [54], lend biologic plausibility to our findings. Even in the spectrum of latent tuberculosis infection [55], activated T-cell IFN-γ responses occur throughout each phase, with the possible exception of the innate immune response (which eliminates M. tuberculosis without priming a T-cell immune response).

The goal of our systematic review was to critically evaluate the diagnostic accuracy of IGRAs for active tuberculosis diagnosis in low- and middle-income settings. However, there are inherent limitations to sensitivity, specificity, and predictive values as measures of test performance. These measures are unable to determine the extent to which a test may improve on readily available clinical information [56] or the degree to which patient-important outcomes are improved by test results [26]. Although limited, available data suggest that IGRAs may add little to the conventional diagnostic investigation for active tuberculosis in settings with low [57] and high tuberculosis incidence [58]. Additional work is necessary to confirm this.

Our meta-analysis has several limitations. First, as with previous systematic reviews [4, 14], heterogeneity was substantial for the primary outcomes of sensitivity and specificity. We used empirical random effects weighting, excluded all studies contributing <10 eligible individuals, and separately synthesized data for currently manufactured IGRAs to minimize heterogeneity. Second, World Bank income classification is an imperfect surrogate for national tuberculosis incidence. Although no standard criteria currently exist for defining countries with high tuberculosis incidence, our results were fundamentally unchanged when restricted to nations with a World Health Organization (WHO)-defined annual tuberculosis incidence of ≥50 cases/100000 population [50]. Third, it is likely that unpublished data and ongoing studies were missed. It is also possible that studies that found poor IGRA performance were less likely to be published. Because of the lack of statistical methods to account for publication bias in diagnostic meta-analyses, it would be prudent to assume some degree of overestimation of our estimates resulting from publication bias. Fourth, our review did not include evidence on use of IGRAs in 2 patient subgroups in which conventional tests for active tuberculosis perform poorly: children and patients with suspected extrapulmonary tuberculosis. Lastly, we did not identify any studies directly measuring the impact of IGRAs on patient-important outcomes.

In conclusion, as in the case of the TST, the data suggest no role for using IGRAs for active tuberculosis diagnosis for adults living in low- and middle-income countries. These data should help inform evidence-based policies on the role of IGRAs in active tuberculosis diagnosis in low- and middle-income settings. Indeed, a WHO Expert Group considering this evidence recently recommended that IGRAs should not be used as a replacement for conventional microbiological diagnosis of pulmonary and extrapulmonary tuberculosis in low-and middle-income countries [59].



We thank the authors of all studies included in the review for kindly responding to our requests for additional information; George Yen, for his help with translation; and UNICEF/UNDP/World Bank/WHO Special Programme for Research and Training in Tropical Diseases, WHO Stop TB Department, and New Diagnostics Working Group, Stop TB Partnership, for supporting this work.

Financial Support.

This work was supported in part by the National Institutes of Health (UCSF-CTSI KL2 RR024130 to J. Z. M., K23HL094141 to A. C., and K24 HL087713 to L. H.) and a New Investigator Award from the Canadian Institutes of Health Research (to M. P.).

Potential conflicts of interest.

K. R. S. serves as Coordinator of the Evidence Synthesis subgroup of Stop TB Partnership's New Diagnostics Working Group; M. P. serves as cochair of the Stop TB Partnership's New Diagnostics Working Group and as consultant to the Bill & Melinda Gates Foundation. All other authors: no conflicts.

All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.


1. Pai M, Kalantri S, Dheda K. New tools and emerging technologies for the diagnosis of tuberculosis: part I. Latent tuberculosis. Expert Rev Mol Diagn. 2006;6:413–22. [PubMed]
2. Menzies D, Pai M, Comstock G. Meta-analysis: new tests for the diagnosis of latent tuberculosis infection: areas of uncertainty and recommendations for research. Ann Intern Med. 2007;146:340–54. [PubMed]
3. Pai M, Riley LW, Colford JM., Jr Interferon-gamma assays in the immunodiagnosis of tuberculosis: a systematic review. Lancet Infect Dis. 2004;4:761–76. [PubMed]
4. Pai M, Zwerling A, Menzies D. Systematic review: T cell-based assays for the diagnosis of latent tuberculosis infection: an update. Ann Intern Med. 2008;149:177–84. [PMC free article] [PubMed]
5. Mazurek G, Jereb J, Vernon A, LoBue P, Goldberg S. Updated guidelines for using interferon-gamma release assays to detect Mycobacterium tuberculosis infection, United States. MMWR. In press. [PubMed]
6. CTC. Canada communicable disease report: updated recommendations on interferon gamma release assays for latent tuberculosis infection. 2008. [PubMed]
7. NHS. Health Protection Agency position statement on the use of interferon gamma release assay (IGRA) tests for tuberculosis (TB) HPA Tuberculosis Programme Board; 2008.
8. Lange C, Pai M, Drobniewski F, Migliori GB. Interferon-gamma release assays for the diagnosis of active tuberculosis: sensible or silly? Eur Respir J. 2009;33:1250–3. [PubMed]
9. Menzies D. Using tests for latent tuberculous infection to diagnose active tuberculosis: can we eat our cake and have it too? Ann Intern Med. 2008;148:398–9. [PubMed]
10. Pai M, Menzies D. Interferon-gamma release assays: what is their role in the diagnosis of active tuberculosis? Clin Infect Dis. 2007;44:74–7. [PubMed]
11. Sester M, Sotgiu G, Lange C, et al. Interferon-{gamma} release assays for the diagnosis of active tuberculosis: a systematic review and meta-analysis. Eur Respir J [PubMed]
12. Dheda K, van Zyl Smit R, Badri M, Pai M. T-cell interferon-gamma release assays for the rapid immunodiagnosis of tuberculosis: clinical utility in high-burden vs. low-burden settings. Curr Opin Pulm Med. 2009;15:188–200. [PubMed]
13. Denkinger CM, Dheda K, Pai M. Guidelines on interferon-γ release assays for tuberculosis infection: concordance, discordance or confusion? Clin Microbiol Infect. 2011;6:806–14. [PubMed]
14. Diel R, Loddenkemper R, Nienhaus A. Evidence-based comparison of commercial interferon-gamma release assays for detecting active TB: a metaanalysis. Chest. 2010;137:952–68. [PubMed]
15. Jiang W, Shao L, Zhang Y, et al. High-sensitive and rapid detection of Mycobacterium tuberculosis infection by IFN-gamma release assay among HIV-infected individuals in BCG-vaccinated area. BMC Immunol. 2009;10:31. [PMC free article] [PubMed]
16. Kanunfre KA, Leite OH, Lopes MI, Litvoc M, Ferreira AW. Enhancement of diagnostic efficiency by a gamma interferon release assay for pulmonary tuberculosis. Clin Vaccine Immunol. 2008;15:1028–30. [PMC free article] [PubMed]
17. Soysal A, Torun T, Efe S, Gencer H, Tahaoglu K, Bakir M. Evaluation of cut-off values of interferon-gamma-based assays in the diagnosis of M. tuberculosis infection. Int J Tuberc Lung Dis. 2008;12:50–6. [PubMed]
18. Deville WL, Buntinx F, Bouter LM, et al. Conducting systematic reviews of diagnostic studies: didactic guidelines. BMC Med Res Methodol. 2002;2:9. [PMC free article] [PubMed]
19. Gatsonis C, Paliwal P. Meta-analysis of diagnostic and screening test accuracy evaluations: methodologic primer. AJR Am J Roentgenol. 2006;187:271–81. [PubMed]
20. Leeflang MM, Deeks JJ, Gatsonis C, Bossuyt PM. Systematic reviews of diagnostic test accuracy. Ann Intern Med. 2008;149:889–97. [PMC free article] [PubMed]
21. Pai M, McCulloch M, Enanoria W, Colford JM., Jr Systematic reviews of diagnostic test evaluations: what's behind the scenes? ACP J Club. 2004;141:A11–3. [PubMed]
22. World Bank List of Economies. Accessed 1 June 2010.
23. Whiting P, Rutjes AW, Reitsma JB, Bossuyt PM, Kleijnen J. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Med Res Methodol. 2003;3:25. [PMC free article] [PubMed]
24. Fontela PS, Pant Pai N, Schiller I, Dendukuri N, Ramsay A, Pai M. Quality and reporting of diagnostic accuracy studies in TB, HIV and malaria: evaluation using QUADAS and STARD standards. PLoS One. 2009;4:e7753. [PMC free article] [PubMed]
25. Pai M, Minion J, Steingart K, Ramsay A. New and improved tuberculosis diagnostics: evidence, policy, practice, and impact. Curr Opin Pulm Med. 2010;16:271–84. [PubMed]
26. Schunemann HJ, Oxman AD, Brozek J, et al. Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. BMJ. 2008;336:1106–10. [PMC free article] [PubMed]
27. Rutjes AW, Reitsma JB, Di Nisio M, Smidt N, van Rijn JC, Bossuyt PM. Evidence of bias and variation in diagnostic accuracy studies. CMAJ. 2006;174:469–76. [PMC free article] [PubMed]
28. Van Deun A. What is the role of mycobacterial culture in diagnosis and case definition? In: Frieden T, editor. Toman’s tuberculosis: case detection, treatment, and monitoring-questions and answers. 2nd ed. World Health Organization; 2004. pp. 35–43.
29. Lijmer JG, Bossuyt PM, Heisterkamp SH. Exploring sources of heterogeneity in systematic reviews of diagnostic tests. Stat Med. 2002;21:1525–37. [PubMed]
30. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med. 2001;20:2865–84. [PubMed]
31. Harbord R, Whiting P. Metandi: meta-analysis of diagnostic accuracy using hierarchical logistic regression. Stata J. 2009;9:211–29.
32. Harbord RM, Deeks JJ, Egger M, Whiting P, Sterne JA. A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics. 2007;8:239–51. [PubMed]
33. Aabye MG, Ravn P, PrayGod G, et al. The impact of HIV infection and CD4 cell count on the performance of an interferon gamma release assay in patients with pulmonary tuberculosis. PLoS One. 2009;4:e4220. [PMC free article] [PubMed]
34. Cattamanchi A, Smith R, Steingart KR, et al. Interferon-gamma release assays for the diagnosis of latent tuberculosis infection in HIV-infected individuals: a systematic review and meta-analysis. J Acquir Immune Defic Syndr. 2011;56:230–8. [PMC free article] [PubMed]
35. Chegou NN, Black GF, Kidd M, van Helden PD, Walzl G. Host markers in QuantiFERON supernatants differentiate active TB from latent TB infection: preliminary report. BMC Pulm Med. 2009;9:21. [PMC free article] [PubMed]
36. Chen X, Yang Q, Zhang M, et al. Diagnosis of active tuberculosis in China using an in-house gamma interferon enzyme-linked immunospot assay. Clin Vaccine Immunol. 2009;16:879–84. [PMC free article] [PubMed]
37. Dheda K, van Zyl-Smit RN, Meldau R, et al. Quantitative lung T cell responses aid the rapid diagnosis of pulmonary tuberculosis. Thorax. 2009;64:847–53. [PubMed]
38. Kabeer BSA, Sikhamani R, Swaminathan S, Perumal V, Paramasivam P, Raja A. Role of interferon gamma release assay in active TB diagnosis among HIV infected individuals. PLoS One. 2009;4:e5718. [PMC free article] [PubMed]
39. Katiyar SK, Sampath A, Bihari S, Mamtani M, Kulkarni H. Use of the QuantiFERON-TB Gold In-Tube test to monitor treatment efficacy in active pulmonary tuberculosis. Int J Tuberc Lung Dis. 2008;12:1146–52. [PubMed]
40. Leidl L, Mayanja-Kizza H, Sotgiu G, et al. Relationship of immunodiagnostic assays for tuberculosis and numbers of circulating CD4+ T cells in HIV-infection. Eur Respir J. 2010;35:619–26. [PubMed]
41. Markova R, Todorova Y, Drenska R, Elenkov I, Yankova M, Stefanova D. Usefulness of interferon-gamma release assays in the diagnosis of tuberculosis infection in HIV-infected patients in Bulgaria. Biotechnology & Biotechnological Equipment. 2009;23:1103–8.
42. Oni T, Patel J, Gideon HP, et al. Enhanced diagnosis of HIV-1 associated tuberculosis by relating T-SPOT.TB and CD4 counts. Eur Respir J. 2010;36:594–600. [PMC free article] [PubMed]
43. Ozekinci T, Ozbek E, Celik Y. Comparison of tuberculin skin test and a specific T cell-based test, T-Spot.TB, for the diagnosis of latent tuberculosis infection. J Int Med Res. 2007;35:696–703. [PubMed]
44. Pai M, Joshi R, Bandyopadhyay M, et al. Sensitivity of a whole-blood interferon-gamma assay among patients with pulmonary tuberculosis and variations in T cell responses during anti-tuberculosis treatment. Infection. 2007;35:98–103. [PMC free article] [PubMed]
45. Raby E, Moyo M, Devendra A, et al. The effects of HIV on the sensitivity of a whole blood IFN-gamma release assay in Zambian adults with active tuberculosis. PLoS One. 2008;3:e2489. [PMC free article] [PubMed]
46. Huang S, Lu S, Zhu Z, et al. Enzyme-linked immunospot assay combined with serum latex agglutination test for diagnosis of pulmonary tuberculosis and concomitant pulmonary cryptococcosis. Chin J Infect Chemother. 2009;9:252–5.
47. Tahereh K, Alireza N, Massoud S, Amina K. A validity study of the QuantiFERON-TB Gold (QFT-TB) method for the diagnosis of pulmonary tuberculosis in a high risk population. Swiss Med Wkly. 2010;140:95–6. [PubMed]
48. Tsiouris SJ, Coetzee D, Toro PL, Austin J, Stein Z, El-Sadr W. Sensitivity analysis and potential uses of a novel gamma interferon release assay for diagnosis of tuberculosis. J Clin Microbiol. 2006;44:2844–50. [PMC free article] [PubMed]
49. Veldsman C, Kock MM, Rossouw T, et al. QuantiFERON-TB GOLD ELISA assay for the detection of Mycobacterium tuberculosis-specific antigens in blood specimens of HIV-positive patients in a high-burden country. FEMS Immunol Med Microbiol. 2009;57:269–73. [PubMed]
50. WHO. Global tuberculosis control 2010. Geneva: World Health Organization; 2010.
51. Leeflang MM, Bossuyt PM, Irwig L. Diagnostic test accuracy may vary with prevalence: implications for evidence-based diagnosis. J Clin Epidemiol. 2009;62:5–12. [PubMed]
52. Brenner H, Gefeller O. Variation of sensitivity, specificity, likelihood ratios and predictive values with disease prevalence. Stat Med. 1997;16:981–91. [PubMed]
53. Guyatt GH, Oxman AD, Vist GE, et al. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008;336:924–6. [PMC free article] [PubMed]
54. Andersen P, Doherty TM, Pai M, Weldingh K. The prognosis of latent tuberculosis: can disease be predicted? Trends Mol Med. 2007;13:175–82. [PubMed]
55. Barry CE, 3rd, Boshoff HI, Dartois V, et al. The spectrum of latent tuberculosis: rethinking the biology and intervention strategies. Nat Rev Microbiol. 2009;7:845–55. [PubMed]
56. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115:928–35. [PubMed]
57. Metcalfe JZ, Cattamanchi A, Vittinghoff E, et al. Evaluation of quantitative IFN-gamma response for risk stratification of active tuberculosis suspects. Am J Respir Crit Care Med. 2010;181:87–93. [PMC free article] [PubMed]
58. Rangaka MX, Gideon HP, Wilkinson KA, et al. No discriminatory value of interferon release added to smear negative HIV-tuberculosis algorithms. Eur Respir J. 2011 doi:10.1183.09031936.00058911. [PMC free article] [PubMed]
59. WHO. Strategic and Technical Advisory Group for Tuberculosis (STAG-TB): report of the tenth meeting. Geneva: World Health Organization. 2010. Accessed 1 December 2010.

Articles from The Journal of Infectious Diseases are provided here courtesy of Oxford University Press