Quantitative evaluation of automated skull‐stripping methods applied to contemporary and legacy images: Effects of diagnosis, bias correction, and slice location

28 June 2005

journal article
research article
Published by Wiley in Human Brain Mapping

Vol. 27 (2), 99-113
https://doi.org/10.1002/hbm.20161

Abstract

Performance of automated methods to isolate brain from nonbrain tissues in magnetic resonance (MR) structural images may be influenced by MR signal inhomogeneities, type of MR image set, regional anatomy, and age and diagnosis of subjects studied. The present study compared the performance of four methods: Brain Extraction Tool (BET; Smith [2002]: Hum Brain Mapp 17:143-155); 3dIntracranial (Ward [1999] Milwaukee: Biophysics Research Institute, Medical College of Wisconsin; in AFNI); a Hybrid Watershed algorithm (HWA, Segonne et al. [2004] Neuroimage 22:1060-1075; in FreeSurfer); and Brain Surface Extractor (BSE, Sandor and Leahy [1997] IEEE Trans Med Imag 16:41-54; Shattuck et al. [2001] Neuroimage 13:856-876) to manually stripped images. The methods were applied to uncorrected and bias-corrected datasets; Legacy and Contemporary T1-weighted image sets; and four diagnostic groups (depressed, Alzheimer's, young and elderly control). To provide a criterion for outcome assessment, two experts manually stripped six sagittal sections for each dataset in locations where brain and nonbrain tissue are difficult to distinguish. Methods were compared on Jaccard similarity coefficients, Hausdorff distances, and an Expectation-Maximization algorithm. Methods tended to perform better on contemporary datasets; bias correction did not significantly improve method performance. Mesial sections were most difficult for all methods. Although AD image sets were most difficult to strip, HWA and BSE were more robust across diagnostic groups compared with 3dIntracranial and BET. With respect to specificity, BSE tended to perform best across all groups, whereas HWA was more sensitive than other methods. The results of this study may direct users towards a method appropriate to their T1-weighted datasets and improve the efficiency of processing for large, multisite neuroimaging studies.

Keywords

This publication has 21 references indexed in Scilit:

Simultaneous Truth and Performance Level Estimation (STAPLE): An Algorithm for the Validation of Image Segmentation
IEEE Transactions on Medical Imaging, 2004
Three validation metrics for automated probabilistic image segmentation of brain tumours
Statistics in Medicine, 2004
Fast robust automated brain extraction
Human Brain Mapping, 2002
Qualitative and Quantitative Evaluation of Six Algorithms for Correcting Intensity Nonuniformity Effects
NeuroImage, 2001
Cortical Surface-Based Analysis
NeuroImage, 1999
Cortical Surface-Based Analysis
NeuroImage, 1999
Surface-based labeling of cortical anatomy using a deformable atlas
IEEE Transactions on Medical Imaging, 1997
AFNI: Software for Analysis and Visualization of Functional Magnetic Resonance Neuroimages
Computers and Biomedical Research, 1996
Comparing images using the Hausdorff distance
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1993
Method for Quantification of Brain, Ventricular, and Subarachnoid CSF Volumes from MR Images
Journal of Computer Assisted Tomography, 1992

Cited by 148 articles