Training Multiscale-CNN for Large Microscopy Image Classification in One Hour

Datta, Kushal; Hossain, Imtiaz; Choi, Sun; Saletore, Vikram; Ambert, Kyle; Godinez, William J.; Zhang, Xian

doi:10.1007/978-3-030-34356-9_35

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11887))

Included in the following conference series:

International Conference on High Performance Computing

6001 Accesses
2 Citations

Abstract

Existing approaches to train neural networks that use large images require to either crop or down-sample data during pre-processing, use small batch sizes, or split the model across devices mainly due to the prohibitively limited memory capacity available on GPUs and emerging accelerators. These techniques often lead to longer time to convergence or time to train (TTT), and in some cases, lower model accuracy. CPUs, on the other hand, can leverage significant amounts of memory. While much work has been done on parallelizing neural network training on multiple CPUs, little attention has been given to tune neural network training with large images on CPUs. In this work, we train a multi-scale convolutional neural network (M-CNN) to classify large biomedical images for high content screening in one hour. The ability to leverage large memory capacity on CPUs enables us to scale to larger batch sizes without having to crop or down-sample the input images. In conjunction with large batch sizes, we find a generalized methodology of linearly scaling of learning rate and train M-CNN to state-of-the-art (SOTA) accuracy of 99% within one hour. We achieve fast time to convergence using 128 two socket Intel\(\circledR \) Xeon\(\circledR \) 6148 processor nodes with 192 GB DDR4 memory connected with 100 Gbps Intel\(\circledR \) Omnipath architecture.

K. Datta and I. Hossain—These authors have made equal contributions to the paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arbabshirani, M.R., et al.: Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. npj Digit. Med. 1, 9 (2018)
Article Google Scholar
Akkus, Z., Galimzianova, A., Hoogi, A., Rubin, D.L., Erickson, B.J.: Deep learning for brain MRI segmentation: state of the art and future directions. J. Digit. Imaging 30, 449–459 (2017)
Article Google Scholar
Cireşan, D.C., Giusti, A., Gambardella, L.M., Schmidhuber, J.: Mitosis detection in breast cancer histology images with deep neural networks. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds.) MICCAI 2013. LNCS, vol. 8150, pp. 411–418. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40763-5_51
Chapter Google Scholar
Litjens, G., et al.: Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Sci. Rep. 6, 26286 (2016)
Article Google Scholar
Janowczyk, A., Madabhushi, A.: Deep learning for digital pathology image analysis: a comprehensive tutorial with selected use cases. J. Pathol. Inform. 7, 29 (2016). https://doi.org/10.4103/2153-3539
Kraus, O.Z., et al.: Automated analysis of high-content microscopy data with deep learning. Mol. Syst. Biol. 13(4), 924 (2017). https://doi.org/10.15252/msb.20177551
Sommer, C., Hoefler, R., Samwer, M., Gerlich, D.W., Boone, C.: A deep learning and novelty detection framework for rapid phenotyping in high-content screening. Mol. Biol. Cell 28(23), 3428–3436 (2017)
Article Google Scholar
Ciresan, D.C., Giusti, A., Gambardella, L.M., Schmidhuber, J.: Deep neural networks segment neuronal membranes in electron microscopy images. In: Bartlett, P.L., Pereira, F.C.N., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) NIPS, pp. 2852–2860 (2012)
Google Scholar
Litjens, G., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017)
Article Google Scholar
Usaj, M.M., Styles, E.B., Verster, A.J., Friesen, H., Boone, C., Andrews, B.J.: High-content screening for quantitative cell biology. Trends Cell Biol. 26(8), 598–611 (2016)
Article Google Scholar
Boutros, M., Heigwer, F., Laufer, C.: Microscopy-based high-content screening. Cell 163(6), 1314–1325 (2015)
Article Google Scholar
Singh, S., Carpenter, A.E., Genovesio, A.: Increasing the content of high-content screening: an overview. J. Biomol. Screen. 19, 640–650 (2014)
Article Google Scholar
Scheeder, C., Heigwer, F., Boutros, M.: Machine learning and image-based profiling in drug discovery. Curr. Opin. Syst. Biol. 10, 43–52 (2018). Pharmacology and drug discovery
Article Google Scholar
Zock, J.M.: Applications of high content screening in life science research. Combin. Chem. High Throughput Screen. 12(9), 870–876 (2009)
Article Google Scholar
Buchser, W., et al.: Assay development guidelines for image-based high content screening, high content analysis and high content imaging. Eli Lilly & Company and the National Center for Advancing Translational Sciences (2014)
Google Scholar
Godinez, W.J., Hossain, I., Lazic, S.E., Davies, J.W., Zhang, X.: A multi-scale convolutional neural network for phenotyping high-content cellular images. Bioinformatics 33(13), 2010–2019 (2017)
Article Google Scholar
Godinez, W.J., Hossain, I., Zhang, X.: Unsupervised phenotypic analysis of cellular images with multi-scale convolutional neural networks. bioRxiv (2018). https://www.biorxiv.org/content/early/2018/07/03/361410
Ando, D.M., McLean, C., Berndl, M.: Improving phenotypic measurements in high-content imaging screens. bioRxiv (2017). https://www.biorxiv.org/content/early/2017/07/10/161422
Buyssens, P., Elmoataz, A., Lézoray, O.: Multiscale convolutional neural networks for vision–based classification of cells. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7725, pp. 342–352. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37444-9_27
Chapter Google Scholar
Jouppi, N.P., et al.: In-datacenter performance analysis of a tensor processing unit. In: 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA), pp. 1–12. IEEE (2017)
Google Scholar
Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Stat. 22, 400–407 (1951)
Article MathSciNet Google Scholar
You, Y., Zhang, Z., Hsieh, C.-J., Demmel, J., Keutzer, K.: ImageNet training in minutes. In: Proceedings of the 47th International Conference on Parallel Processing, p. 1. ACM (2018)
Google Scholar
Ljosa, V., Sokolnicki, K.L., Carpenter, A.E.: Annotated high-throughput microscopy image sets for validation. Nat. Methods 9(7), 637 (2012)
Article Google Scholar
Caie, P.D., et al.: High-content phenotypic profiling of drug response signatures across distinct cancer cells. Mol. Cancer Ther. 9(6), 1913–1926 (2010)
Google Scholar
Google. TPU benchmarks. https://github.com/tensorflow/tpu.git
Sergeev, A., Del Balso, M.: Horovod: fast and easy distributed deep learning in TensorFlow. arXiv preprint arXiv:1802.05799 (2018)
Saletore, V., Karkada, D., Sripathi, V., Sankaranarayanan, A., Datta, K.: Boosting deep learning training and inference performance on Intel Xeon and Intel Xeon Phi processors. https://software.intel.com/en-us/articles/boosting-deep-learning-training-inference-performance-on-xeon-and-xeon-phi

Download references

Acknowledgements

We would like to acknowledge Wolfgang Zipfel from the Novartis Institutes for Biomedical Research, Basel, Switzerland; Michael Derby, Michael Steeves and Steve Litster from the Novartis Institutes for Biomedical Research, Cambridge, MA, USA; Deepthi Karkada, Vivek Menon, Kristina Kermanshahche, Mike Demshki, Patrick Messmer, Andy Bartley, Bruno Riva and Hema Chamraj from Intel Corporation, USA, for their contributions to this work. The authors also acknowledge the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing HPC resources that have contributed to the research results reported within this paper.

Author information

Authors and Affiliations

Artificial Intelligence Products Group, Intel Corporation, Hillsboro, OR, USA
Kushal Datta, Sun Choi, Vikram Saletore & Kyle Ambert
Novartis Institutes for Biomedical Research, Basel, Switzerland
Imtiaz Hossain & Xian Zhang
Novartis Institutes for Biomedical Research, Emeryville, CA, USA
William J. Godinez

Authors

Kushal Datta
View author publications
You can also search for this author in PubMed Google Scholar
Imtiaz Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Sun Choi
View author publications
You can also search for this author in PubMed Google Scholar
Vikram Saletore
View author publications
You can also search for this author in PubMed Google Scholar
Kyle Ambert
View author publications
You can also search for this author in PubMed Google Scholar
William J. Godinez
View author publications
You can also search for this author in PubMed Google Scholar
Xian Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Kushal Datta or Imtiaz Hossain .

Editor information

Editors and Affiliations

University of Edinburgh, Edinburgh, UK
Michèle Weiland
Helmholtz-Zentrum Dresden-Rossendorf, Dresden, Sachsen, Germany
Guido Juckeland
Swiss National Supercomputing Centre, Lugano, Ticino, Switzerland
Sadaf Alam
University of Tennessee at Knoxville, Knoxville, TN, USA
Heike Jagode

Ethics declarations

Intel® Xeon® Gold 6148 processor, Intel® OPA and Intel® SSD storage drive are registered products of Intel Corporation. The authors declare no other conflicts of interest.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Datta, K. et al. (2019). Training Multiscale-CNN for Large Microscopy Image Classification in One Hour. In: Weiland, M., Juckeland, G., Alam, S., Jagode, H. (eds) High Performance Computing. ISC High Performance 2019. Lecture Notes in Computer Science(), vol 11887. Springer, Cham. https://doi.org/10.1007/978-3-030-34356-9_35

Download citation

DOI: https://doi.org/10.1007/978-3-030-34356-9_35
Published: 03 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34355-2
Online ISBN: 978-3-030-34356-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics