TY - GEN
T1 - Rhabdomyosarcoma Histology Classification using Ensemble of Deep Learning Networks
AU - Agarwal, Saloni
AU - Eltigani, Mohamedelfatih
AU - Abaker, Osman
AU - Zhang, Xinyi
AU - Daescu, Ovidiu
AU - Barkauskas, Donald A.
AU - Rudzinski, Erin R.
AU - Leavey, Patrick
N1 - Funding Information:
This work was supported by Children’s Cancer Fund. The material from COG was supported through NCTN Operations Center Grant U10CA180886, NCTN SDC Grant U10CA180899 and COG Biospecimen Bank Grant U24CA196173.
Publisher Copyright:
© 2020 Owner/Author.
PY - 2020/9/21
Y1 - 2020/9/21
N2 - A significant number of machine learning methods have been developed to identify major tumor types in histology images, yet much less is known about automatic classification of tumor subtypes. Rhabdomyosarcoma (RMS), the most common type of soft tissue cancer in children, has several subtypes, the most common being Embryonal, Alveolar, and Spindle Cell. Classifying RMS to the right subtype is critical, since subtypes are known to respond to different treatment protocols. Manual classification requires high expertise and is time consuming due to subtle variance in appearance of histopathology images. In this paper, we introduce and compare machine learning based architectures for automatic classification of Rhabdomyosarcoma into the three major subtypes, from whole slide images (WSI). For training purpose, we only know the class assigned to a WSI, having no manual annotations on the image, while most related work on tumor classification requires manual region or nuclei annotations on WSIs. To predict the class of a new WSI we first divide it into tiles, predict the class of each tile, then use thresholding with soft voting to convert tile level predictions to WSI level prediction. We obtain 94.87% WSI tumor subtype classification accuracy on a large and diverse test dataset. We achieve such accurate classification at 5X magnification level of WSIs, departing from related work, that uses 20X or 10X for best results. A direct advantage of our method is that both training and testing can be performed much faster computationally due to the lower image resolution.
AB - A significant number of machine learning methods have been developed to identify major tumor types in histology images, yet much less is known about automatic classification of tumor subtypes. Rhabdomyosarcoma (RMS), the most common type of soft tissue cancer in children, has several subtypes, the most common being Embryonal, Alveolar, and Spindle Cell. Classifying RMS to the right subtype is critical, since subtypes are known to respond to different treatment protocols. Manual classification requires high expertise and is time consuming due to subtle variance in appearance of histopathology images. In this paper, we introduce and compare machine learning based architectures for automatic classification of Rhabdomyosarcoma into the three major subtypes, from whole slide images (WSI). For training purpose, we only know the class assigned to a WSI, having no manual annotations on the image, while most related work on tumor classification requires manual region or nuclei annotations on WSIs. To predict the class of a new WSI we first divide it into tiles, predict the class of each tile, then use thresholding with soft voting to convert tile level predictions to WSI level prediction. We obtain 94.87% WSI tumor subtype classification accuracy on a large and diverse test dataset. We achieve such accurate classification at 5X magnification level of WSIs, departing from related work, that uses 20X or 10X for best results. A direct advantage of our method is that both training and testing can be performed much faster computationally due to the lower image resolution.
KW - Deep Learning
KW - Histology
KW - Rhabdomyosarcoma
KW - ensemble
UR - http://www.scopus.com/inward/record.url?scp=85096969103&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096969103&partnerID=8YFLogxK
U2 - 10.1145/3388440.3412486
DO - 10.1145/3388440.3412486
M3 - Conference contribution
AN - SCOPUS:85096969103
T3 - Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020
BT - Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020
PB - Association for Computing Machinery, Inc
T2 - 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020
Y2 - 21 September 2020 through 24 September 2020
ER -