TY - JOUR
T1 - Control-group feature normalization for multivariate pattern analysis of structural MRI data using the support vector machine
AU - Linn, Kristin A.
AU - Gaonkar, Bilwaj
AU - Satterthwaite, Theodore D.
AU - Doshi, Jimit
AU - Davatzikos, Christos
AU - Shinohara, Russell T.
N1 - Funding Information:
The authors would like to acknowledge funding by NIH grants R01 NS085211 , R01 AG014971 , R01 MH107703 , and K23 MH098130 , as well as a seed grant from the Center for Biomedical Image Computing and Analytics at the University of Pennsylvania. This work represents the opinions of the researchers and not necessarily that of the granting institutions.
Publisher Copyright:
© 2016 Elsevier Inc.
PY - 2016/5/15
Y1 - 2016/5/15
N2 - Normalization of feature vector values is a common practice in machine learning. Generally, each feature value is standardized to the unit hypercube or by normalizing to zero mean and unit variance. Classification decisions based on support vector machines (SVMs) or by other methods are sensitive to the specific normalization used on the features. In the context of multivariate pattern analysis using neuroimaging data, standardization effectively up- and down-weights features based on their individual variability. Since the standard approach uses the entire data set to guide the normalization, it utilizes the total variability of these features. This total variation is inevitably dependent on the amount of marginal separation between groups. Thus, such a normalization may attenuate the separability of the data in high dimensional space. In this work we propose an alternate approach that uses an estimate of the control-group standard deviation to normalize features before training. We study our proposed approach in the context of group classification using structural MRI data. We show that control-based normalization leads to better reproducibility of estimated multivariate disease patterns and improves the classifier performance in many cases.
AB - Normalization of feature vector values is a common practice in machine learning. Generally, each feature value is standardized to the unit hypercube or by normalizing to zero mean and unit variance. Classification decisions based on support vector machines (SVMs) or by other methods are sensitive to the specific normalization used on the features. In the context of multivariate pattern analysis using neuroimaging data, standardization effectively up- and down-weights features based on their individual variability. Since the standard approach uses the entire data set to guide the normalization, it utilizes the total variability of these features. This total variation is inevitably dependent on the amount of marginal separation between groups. Thus, such a normalization may attenuate the separability of the data in high dimensional space. In this work we propose an alternate approach that uses an estimate of the control-group standard deviation to normalize features before training. We study our proposed approach in the context of group classification using structural MRI data. We show that control-based normalization leads to better reproducibility of estimated multivariate disease patterns and improves the classifier performance in many cases.
KW - Feature normalization
KW - Multivariate pattern analysis
KW - Structural MRI
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=84959316755&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84959316755&partnerID=8YFLogxK
U2 - 10.1016/j.neuroimage.2016.02.044
DO - 10.1016/j.neuroimage.2016.02.044
M3 - Article
C2 - 26915498
AN - SCOPUS:84959316755
SN - 1053-8119
VL - 132
SP - 157
EP - 166
JO - NeuroImage
JF - NeuroImage
ER -