TY - JOUR
T1 - Linear feature selection in texture analysis - A PLS based method
AU - Marques, Joselene
AU - Igel, Christian
AU - Lillholm, Martin
AU - Dam, Erik
PY - 2013/10
Y1 - 2013/10
N2 - We present a texture analysis methodology that combined uncommitted machine-learning techniques and partial least square (PLS) in a fully automatic framework. Our approach introduces a robust PLS-based dimensionality reduction (DR) step to specifically address outliers and high-dimensional feature sets. The texture analysis framework was applied to diagnosis of knee osteoarthritis (OA). To classify between healthy subjects and OA patients, a generic bank of texture features was extracted from magnetic resonance images of tibial knee bone. The features were used as input to the DR algorithm, which first applied a PLS regression to rank the features and then defined the best number of features to retain in the model by an iterative learning phase. The outliers in the dataset, that could inflate the number of selected features, were eliminated by a pre-processing step. To cope with the limited number of samples, the data were evaluated using Monte Carlo cross validation (CV). The developed DR method demonstrated consistency in selecting a relatively homogeneous set of features across the CV iterations. Per each CV group, a median of 19 % of the original features was selected and considering all CV groups, the methods selected 36 % of the original features available. The diagnosis evaluation reached a generalization area-under-the-ROC curve of 0.92, which was higher than established cartilage-based markers known to relate to OA diagnosis.
AB - We present a texture analysis methodology that combined uncommitted machine-learning techniques and partial least square (PLS) in a fully automatic framework. Our approach introduces a robust PLS-based dimensionality reduction (DR) step to specifically address outliers and high-dimensional feature sets. The texture analysis framework was applied to diagnosis of knee osteoarthritis (OA). To classify between healthy subjects and OA patients, a generic bank of texture features was extracted from magnetic resonance images of tibial knee bone. The features were used as input to the DR algorithm, which first applied a PLS regression to rank the features and then defined the best number of features to retain in the model by an iterative learning phase. The outliers in the dataset, that could inflate the number of selected features, were eliminated by a pre-processing step. To cope with the limited number of samples, the data were evaluated using Monte Carlo cross validation (CV). The developed DR method demonstrated consistency in selecting a relatively homogeneous set of features across the CV iterations. Per each CV group, a median of 19 % of the original features was selected and considering all CV groups, the methods selected 36 % of the original features available. The diagnosis evaluation reached a generalization area-under-the-ROC curve of 0.92, which was higher than established cartilage-based markers known to relate to OA diagnosis.
U2 - 10.1007/s00138-012-0461-1
DO - 10.1007/s00138-012-0461-1
M3 - Journal article
SN - 0932-8092
VL - 24
SP - 1435
EP - 1444
JO - Machine Vision and Applications
JF - Machine Vision and Applications
IS - 7
ER -