Training the ACRIN 6666 Investigators and Effects of Feedback on Breast Ultrasound Interpretive Performance and Agreement in BI-RADS Ultrasound Feature Analysis
- 1 July 2012
- journal article
- Published by American Roentgen Ray Society in American Journal of Roentgenology
- Vol. 199 (1), 224-235
- https://doi.org/10.2214/ajr.11.7324
Abstract
OBJECTIVE. Qualification tasks in mammography and breast ultrasound were developed for the American College of Radiology Imaging Network (ACRIN) 6666 Investigators. We sought to assess the effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS feature analysis among a subset of these experienced observers. MATERIALS AND METHODS. After a 1-hour didactic session on BI-RADS: Ultrasound, an interpretive skills quiz set of 70 orthogonal sets of breast ultrasound images including 25 (36%) malignancies was presented to 100 experienced breast imaging observers. Thirty-five observers reviewed the quiz set twice: first without and then with immediate feedback of consensus feature analysis, management recommendations, and pathologic truth. Observer performance (sensitivity, specificity, area under the curve [AUC]) was calculated without feedback and with feedback. Kappas were determined for agreement on feature analysis and assessments. RESULTS. For 35 observers without feedback, the mean sensitivity was 89% (range, 68–100%); specificity, 62% (range, 42–82%); and AUC, 82% (range, 73–89%). With feedback, the mean sensitivity was 93% (range, 80–100%; mean increase, 4%; range of increase, 0–12%; p < 0.0001), the mean specificity was 61% (range, 45–73%; mean decrease, 1%; range of change, –18% to 11%; p = 0.19), and the mean AUC was 84% (range, 78–90%; mean increase, 2%; range of change, –3% to 9%; p < 0.0001). Three breast imagers in the lowest quartile of initial performance showed the greatest improvement in sensitivity with no change or improvement in AUC. The kappa values for feature analysis did not change, but there was improved agreement about final assessments, with the kappa value increasing from 0.53 (SE, 0.02) without feedback to 0.59 (SE, 0.02) with feedback (p < 0.0001). CONCLUSION. Most experienced breast imagers showed excellent breast ultrasound interpretive skills. Immediate feedback of consensus BI-RADS: Ultrasound features and histopathologic results improved performance in ultrasound interpretation across all experience variables.Keywords
This publication has 51 references indexed in Scilit:
- Shear-wave Elastography Improves the Specificity of Breast US: The BE1 Multinational Study of 939 MassesRadiology, 2012
- Mammographic Interpretive Volume and Diagnostic Mammogram Interpretation Performance in Community PracticeRadiology, 2012
- Cystic Breast Masses and the ACRIN 6666 ExperienceRadiologic Clinics of North America, 2010
- Multicenter Study of Ultrasound Real-Time Tissue Elastography in 779 Cases for the Assessment of Breast Lesions: Improved Diagnostic Performance by Combining the BI-RADS®-US Classification System with SonoelastographyUltraschall in der Medizin - European Journal of Ultrasound, 2010
- Breast cancer detection using automated whole breast ultrasound and mammography in radiographically dense breastsEuropean Radiology, 2009
- The “Laboratory” Effect: Comparing Radiologists' Performance and Variability during Prospective Clinical and Laboratory Mammography InterpretationsRadiology, 2008
- Combined Screening With Ultrasound and Mammography vs Mammography Alone in Women at Elevated Risk of Breast CancerJAMA, 2008
- Radiologist Characteristics Associated With Interpretive Performance of Diagnostic MammographyJNCI Journal of the National Cancer Institute, 2007
- Interexamination variation of whole breast ultrasoundThe British Journal of Radiology, 2003
- Sonography of solid breast lesions: observer variability of lesion description and assessment.American Journal of Roentgenology, 1999