A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy
- PMID: 15774235
- DOI: 10.3310/hta9120
A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy
Abstract
Objectives: To review how heterogeneity has been examined in systematic reviews of diagnostic test accuracy studies.
Data sources: Centre for Reviews and Dissemination's Database of Abstracts of Reviews of Effects (DARE).
Review methods: Systematic reviews that evaluated a diagnostic or screening test by including studies that compared a test with a reference test were identified from DARE. Reviews for which structured abstracts had been written up to December 2002 were screened for inclusion. Data extraction was undertaken using standardised data extraction forms.
Results: A total of 189 systematic reviews met the inclusion criteria. The median number of studies included was 18. Meta-analyses have a higher number with a median of 22 studies compared with 11 for narrative reviews. Graphical plots to demonstrate the spread in study results were provided in 56% of meta-analyses; in 79% these were plots of sensitivity and specificity in the receiver operating characteristic (ROC) space. Statistical tests to identify heterogeneity were used in 32% of reviews: 41% of meta-analyses and 9% of reviews using narrative syntheses. The chi-squared test and Fisher's exact test to assess heterogeneity in individual aspects of test performance were the most common. In contrast, only 16% of meta-analyses used correlation coefficients to test for a threshold effect. A narrative synthesis was used in 30% of reviews. Of the meta-analyses, 52% carried out statistical pooling alone, 18% conducted only summary receiver operator characteristic (SROC) analyses and 30% used both methods of statistical synthesis. For those undertaking SROC analyses, the main differences between the models used were the weights chosen for the regression models, although in 42% of cases the use of, or choice of, weight was not provided. The proportion of reviews using statistical pooling alone has declined from 67% in 1995 to 42% in 2001, with a corresponding increase in the use of SROC methods, from 33% to 58%. However, two-thirds of those using SROC methods also carried out statistical pooling rather than presenting only SROC models. Reviews using SROC analyses also tended to present their results as some combination of sensitivity and specificity rather than using alternative, perhaps less clinically meaningful, means of data presentation such as diagnostic odds ratios. Three-quarters of meta-analyses attempted to investigate statistically possible sources of variation, using subgroup analysis or regression analysis. The impact of clinical or socio-demographic variables was investigated in 74% of these reviews and test- or threshold-related variables in 79%. At least one quality-related variable was investigated in 63% of reviews. Within this subset, the most commonly considered variables were the use of blinding, sample size, the reference test used and the avoidance of verification bias.
Conclusions: The emphasis on pooling individual aspects of diagnostic test performance and the under-use of statistical tests and graphical approaches to identify heterogeneity perhaps reflect the uncertainty in the most appropriate methods to use and also greater familiarity with more traditional indices of test accuracy. This indicates the difficulty and complexity of carrying out such reviews. In these cases it is strongly suggested that meta-analyses are carried out with the involvement of a statistician familiar with the field. Further methodological work on the statistical methods available for combining diagnostic test accuracy studies is needed, as are sufficiently large, prospectively designed primary studies of diagnostic test accuracy comparing two or more tests for the same target disorder. Use of individual patient data meta-analysis in diagnostic test accuracy reviews should be explored to allow heterogeneity to be considered in more detail.
Similar articles
-
Graphical enhancements to summary receiver operating characteristic plots to facilitate the analysis and reporting of meta-analysis of diagnostic test accuracy data.Res Synth Methods. 2021 Jan;12(1):34-44. doi: 10.1002/jrsm.1439. Epub 2020 Aug 12. Res Synth Methods. 2021. PMID: 32706182
-
Systematic reviews and meta-analyses of diagnostic test accuracy.Clin Microbiol Infect. 2014 Feb;20(2):105-13. doi: 10.1111/1469-0691.12474. Clin Microbiol Infect. 2014. PMID: 24274632 Review.
-
Low-Dose Aspirin for the Prevention of Morbidity and Mortality From Preeclampsia: A Systematic Evidence Review for the U.S. Preventive Services Task Force [Internet].Rockville (MD): Agency for Healthcare Research and Quality (US); 2014 Apr. Report No.: 14-05207-EF-1. Rockville (MD): Agency for Healthcare Research and Quality (US); 2014 Apr. Report No.: 14-05207-EF-1. PMID: 24783270 Free Books & Documents. Review.
-
Response to letter to the editor from Dr Rahman Shiri: The challenging topic of suicide across occupational groups.Scand J Work Environ Health. 2018 Jan 1;44(1):108-110. doi: 10.5271/sjweh.3698. Epub 2017 Dec 8. Scand J Work Environ Health. 2018. PMID: 29218357
-
Meta-DiSc 2.0: a web application for meta-analysis of diagnostic test accuracy data.BMC Med Res Methodol. 2022 Nov 28;22(1):306. doi: 10.1186/s12874-022-01788-2. BMC Med Res Methodol. 2022. PMID: 36443653 Free PMC article.
Cited by
-
Cerebral Small-Vessel Disease and Risk of Incidence of Depression: A Meta-Analysis of Longitudinal Cohort Studies.J Am Heart Assoc. 2020 Aug 4;9(15):e016512. doi: 10.1161/JAHA.120.016512. Epub 2020 Jul 25. J Am Heart Assoc. 2020. PMID: 32715831 Free PMC article.
-
The salivary microbiome as a diagnostic biomarker of periodontitis: a 16S multi-batch study before and after the removal of batch effects.Front Cell Infect Microbiol. 2024 Jul 12;14:1405699. doi: 10.3389/fcimb.2024.1405699. eCollection 2024. Front Cell Infect Microbiol. 2024. PMID: 39071165 Free PMC article.
-
Is three-dimensional ultrasonography a valuable diagnostic tool for patients with ovarian cancer? Systematic review and meta-analysis.Front Oncol. 2024 Jul 8;14:1404426. doi: 10.3389/fonc.2024.1404426. eCollection 2024. Front Oncol. 2024. PMID: 39040447 Free PMC article.
-
Diagnostic Performance of Intravascular Ultrasound-Derived Minimal Lumen Area to Predict Functionally Significant Non-Left Main Coronary Artery Disease: a Meta-Analysis.Korean Circ J. 2016 Sep;46(5):622-631. doi: 10.4070/kcj.2016.46.5.622. Epub 2016 Sep 28. Korean Circ J. 2016. PMID: 27721852 Free PMC article.
-
The Diagnostic Performance of Linked Color Imaging Compared to White Light Imaging in Endoscopic Diagnosis of Helicobacter pylori Infection: A Systematic Review and Meta-Analysis.Gut Liver. 2024 May 15;18(3):444-456. doi: 10.5009/gnl230244. Epub 2023 Oct 6. Gut Liver. 2024. PMID: 37800315 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous