TY - JOUR
T1 - Robust statistical fusion of image labels
AU - Landman, Bennett A.
AU - Asman, Andrew J.
AU - Scoggins, Andrew G.
AU - Bogovic, John A.
AU - Xing, Fangxu
AU - Prince, Jerry L.
N1 - Funding Information:
Manuscript received June 16, 2011; accepted August 05, 2011. Date of publication October 14, 2011; date of current version February 03, 2012. This work was supported in part by the NIH/NINDS 1R01NS056307 and NIH/NINDS 1R21NS064534. Asterisk indicates corresponding author.
PY - 2012/2
Y1 - 2012/2
N2 - Image labeling and parcellation (i.e., assigning structure to a collection of voxels) are critical tasks for the assessment of volumetric and morphometric features in medical imaging data. The process of image labeling is inherently error prone as images are corrupted by noise and artifacts. Even expert interpretations are subject to subjectivity and the precision of the individual raters. Hence, all labels must be considered imperfect with some degree of inherent variability. One may seek multiple independent assessments to both reduce this variability and quantify the degree of uncertainty. Existing techniques have exploited maximum a posteriori statistics to combine data from multiple raters and simultaneously estimate rater reliabilities. Although quite successful, wide-scale application has been hampered by unstable estimation with practical datasets, for example, with label sets with small or thin objects to be labeled or with partial or limited datasets. As well, these approaches have required each rater to generate a complete dataset, which is often impossible given both human foibles and the typical turnover rate of raters in a research or clinical environment. Herein, we propose a robust approach to improve estimation performance with small anatomical structures, allow for missing data, account for repeated label sets, and utilize training/catch trial data. With this approach, numerous raters can label small, overlapping portions of a large dataset, and rater heterogeneity can be robustly controlled while simultaneously estimating a single, reliable label set and characterizing uncertainty. The proposed approach enables many individuals to collaborate in the construction of large datasets for labeling tasks (e.g., human parallel processing) and reduces the otherwise detrimental impact of rater unavailability
AB - Image labeling and parcellation (i.e., assigning structure to a collection of voxels) are critical tasks for the assessment of volumetric and morphometric features in medical imaging data. The process of image labeling is inherently error prone as images are corrupted by noise and artifacts. Even expert interpretations are subject to subjectivity and the precision of the individual raters. Hence, all labels must be considered imperfect with some degree of inherent variability. One may seek multiple independent assessments to both reduce this variability and quantify the degree of uncertainty. Existing techniques have exploited maximum a posteriori statistics to combine data from multiple raters and simultaneously estimate rater reliabilities. Although quite successful, wide-scale application has been hampered by unstable estimation with practical datasets, for example, with label sets with small or thin objects to be labeled or with partial or limited datasets. As well, these approaches have required each rater to generate a complete dataset, which is often impossible given both human foibles and the typical turnover rate of raters in a research or clinical environment. Herein, we propose a robust approach to improve estimation performance with small anatomical structures, allow for missing data, account for repeated label sets, and utilize training/catch trial data. With this approach, numerous raters can label small, overlapping portions of a large dataset, and rater heterogeneity can be robustly controlled while simultaneously estimating a single, reliable label set and characterizing uncertainty. The proposed approach enables many individuals to collaborate in the construction of large datasets for labeling tasks (e.g., human parallel processing) and reduces the otherwise detrimental impact of rater unavailability
KW - Data fusion
KW - delineation
KW - labeling
KW - parcellation
KW - simultaneous truth and performance level estimation (STAPLE)
KW - statistical analysis
UR - http://www.scopus.com/inward/record.url?scp=84856697459&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856697459&partnerID=8YFLogxK
U2 - 10.1109/TMI.2011.2172215
DO - 10.1109/TMI.2011.2172215
M3 - Article
C2 - 22010145
AN - SCOPUS:84856697459
SN - 0278-0062
VL - 31
SP - 512
EP - 522
JO - IEEE transactions on medical imaging
JF - IEEE transactions on medical imaging
IS - 2
M1 - 6046134
ER -