TY - GEN
T1 - Studying the relevance of breast imaging features
AU - Ferreira, Pedro
AU - Dutra, Inés
AU - Fonseca, Nuno A.
AU - Woods, Ryan
AU - Burnside, Elizabeth
PY - 2011
Y1 - 2011
N2 - Breast screening is the regular examination of a woman's breasts to find breast cancer in an initial stage. The sole exam approved for this purpose is mammography that, despite the existence of more advanced technologies, is considered the cheapest and most efficient method to detect cancer in a preclinical stage. We investigate, using machine learning techniques, how attributes obtained from mammographies can relate to malignancy. In particular, this study focus is on how mass density can influence malignancy from a data set of 348 patients containing, among other information, results of biopsies. To this end, we applied different learning algorithms on the data set using the WEKA tools, and performed significance tests on the results. The conclusions are threefold: (1) automatic classification of a mammography can reach equal or better results than the ones annotated by specialists, which can help doctors to quickly concentrate on some specific mammogram for a more thorough study; (2) mass density seems to be a good indicator of malignancy, as previous studies suggested; (3) we can obtain classifiers that can predict mass density with a quality as good as the specialist blind to biopsy.
AB - Breast screening is the regular examination of a woman's breasts to find breast cancer in an initial stage. The sole exam approved for this purpose is mammography that, despite the existence of more advanced technologies, is considered the cheapest and most efficient method to detect cancer in a preclinical stage. We investigate, using machine learning techniques, how attributes obtained from mammographies can relate to malignancy. In particular, this study focus is on how mass density can influence malignancy from a data set of 348 patients containing, among other information, results of biopsies. To this end, we applied different learning algorithms on the data set using the WEKA tools, and performed significance tests on the results. The conclusions are threefold: (1) automatic classification of a mammography can reach equal or better results than the ones annotated by specialists, which can help doctors to quickly concentrate on some specific mammogram for a more thorough study; (2) mass density seems to be a good indicator of malignancy, as previous studies suggested; (3) we can obtain classifiers that can predict mass density with a quality as good as the specialist blind to biopsy.
KW - Breast cancer
KW - Classification methods
KW - Data mining
KW - Machine learning
KW - Mammograms
KW - Mass density
UR - http://www.scopus.com/inward/record.url?scp=79960190863&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960190863&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:79960190863
SN - 9789898425348
T3 - HEALTHINF 2011 - Proceedings of the International Conference on Health Informatics
SP - 337
EP - 342
BT - HEALTHINF 2011 - Proceedings of the International Conference on Health Informatics
T2 - International Conference on Health Informatics, HEALTHINF 2011
Y2 - 26 January 2011 through 29 January 2011
ER -