Training the ACRIN 6666 Investigators and Effects of Feedback on Breast Ultrasound Interpretive Performance and Agreement in BI-RADS Ultrasound Feature Analysis

1 July 2012

journal article
Published by American Roentgen Ray Society in American Journal of Roentgenology

Vol. 199 (1), 224-235
https://doi.org/10.2214/ajr.11.7324

Abstract

OBJECTIVE. Qualification tasks in mammography and breast ultrasound were developed for the American College of Radiology Imaging Network (ACRIN) 6666 Investigators. We sought to assess the effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS feature analysis among a subset of these experienced observers. MATERIALS AND METHODS. After a 1-hour didactic session on BI-RADS: Ultrasound, an interpretive skills quiz set of 70 orthogonal sets of breast ultrasound images including 25 (36%) malignancies was presented to 100 experienced breast imaging observers. Thirty-five observers reviewed the quiz set twice: first without and then with immediate feedback of consensus feature analysis, management recommendations, and pathologic truth. Observer performance (sensitivity, specificity, area under the curve [AUC]) was calculated without feedback and with feedback. Kappas were determined for agreement on feature analysis and assessments. RESULTS. For 35 observers without feedback, the mean sensitivity was 89% (range, 68–100%); specificity, 62% (range, 42–82%); and AUC, 82% (range, 73–89%). With feedback, the mean sensitivity was 93% (range, 80–100%; mean increase, 4%; range of increase, 0–12%; p < 0.0001), the mean specificity was 61% (range, 45–73%; mean decrease, 1%; range of change, –18% to 11%; p = 0.19), and the mean AUC was 84% (range, 78–90%; mean increase, 2%; range of change, –3% to 9%; p < 0.0001). Three breast imagers in the lowest quartile of initial performance showed the greatest improvement in sensitivity with no change or improvement in AUC. The kappa values for feature analysis did not change, but there was improved agreement about final assessments, with the kappa value increasing from 0.53 (SE, 0.02) without feedback to 0.59 (SE, 0.02) with feedback (p < 0.0001). CONCLUSION. Most experienced breast imagers showed excellent breast ultrasound interpretive skills. Immediate feedback of consensus BI-RADS: Ultrasound features and histopathologic results improved performance in ultrasound interpretation across all experience variables.

Keywords

This publication has 51 references indexed in Scilit:

Shear-wave Elastography Improves the Specificity of Breast US: The BE1 Multinational Study of 939 Masses
Radiology, 2012
Mammographic Interpretive Volume and Diagnostic Mammogram Interpretation Performance in Community Practice
Radiology, 2012
Cystic Breast Masses and the ACRIN 6666 Experience
Radiologic Clinics of North America, 2010
Multicenter Study of Ultrasound Real-Time Tissue Elastography in 779 Cases for the Assessment of Breast Lesions: Improved Diagnostic Performance by Combining the BI-RADS®-US Classification System with Sonoelastography
Ultraschall in der Medizin - European Journal of Ultrasound, 2010
Breast cancer detection using automated whole breast ultrasound and mammography in radiographically dense breasts
European Radiology, 2009
The “Laboratory” Effect: Comparing Radiologists' Performance and Variability during Prospective Clinical and Laboratory Mammography Interpretations
Radiology, 2008
Combined Screening With Ultrasound and Mammography vs Mammography Alone in Women at Elevated Risk of Breast Cancer
JAMA, 2008
Radiologist Characteristics Associated With Interpretive Performance of Diagnostic Mammography
JNCI Journal of the National Cancer Institute, 2007
Interexamination variation of whole breast ultrasound
The British Journal of Radiology, 2003
Sonography of solid breast lesions: observer variability of lesion description and assessment.
American Journal of Roentgenology, 1999

Cited by 52 articles