Skip to main content

Training Multiscale-CNN for Large Microscopy Image Classification in One Hour

  • Conference paper
  • First Online:
High Performance Computing (ISC High Performance 2019)

Abstract

Existing approaches to train neural networks that use large images require to either crop or down-sample data during pre-processing, use small batch sizes, or split the model across devices mainly due to the prohibitively limited memory capacity available on GPUs and emerging accelerators. These techniques often lead to longer time to convergence or time to train (TTT), and in some cases, lower model accuracy. CPUs, on the other hand, can leverage significant amounts of memory. While much work has been done on parallelizing neural network training on multiple CPUs, little attention has been given to tune neural network training with large images on CPUs. In this work, we train a multi-scale convolutional neural network (M-CNN) to classify large biomedical images for high content screening in one hour. The ability to leverage large memory capacity on CPUs enables us to scale to larger batch sizes without having to crop or down-sample the input images. In conjunction with large batch sizes, we find a generalized methodology of linearly scaling of learning rate and train M-CNN to state-of-the-art (SOTA) accuracy of 99% within one hour. We achieve fast time to convergence using 128 two socket Intel\(\circledR \) Xeon\(\circledR \) 6148 processor nodes with 192 GB DDR4 memory connected with 100 Gbps Intel\(\circledR \) Omnipath architecture.

K. Datta and I. Hossain—These authors have made equal contributions to the paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arbabshirani, M.R., et al.: Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. npj Digit. Med. 1, 9 (2018)

    Article  Google Scholar 

  2. Akkus, Z., Galimzianova, A., Hoogi, A., Rubin, D.L., Erickson, B.J.: Deep learning for brain MRI segmentation: state of the art and future directions. J. Digit. Imaging 30, 449–459 (2017)

    Article  Google Scholar 

  3. Cireşan, D.C., Giusti, A., Gambardella, L.M., Schmidhuber, J.: Mitosis detection in breast cancer histology images with deep neural networks. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds.) MICCAI 2013. LNCS, vol. 8150, pp. 411–418. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40763-5_51

    Chapter  Google Scholar 

  4. Litjens, G., et al.: Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Sci. Rep. 6, 26286 (2016)

    Article  Google Scholar 

  5. Janowczyk, A., Madabhushi, A.: Deep learning for digital pathology image analysis: a comprehensive tutorial with selected use cases. J. Pathol. Inform. 7, 29 (2016). https://doi.org/10.4103/2153-3539

  6. Kraus, O.Z., et al.: Automated analysis of high-content microscopy data with deep learning. Mol. Syst. Biol. 13(4), 924 (2017). https://doi.org/10.15252/msb.20177551

  7. Sommer, C., Hoefler, R., Samwer, M., Gerlich, D.W., Boone, C.: A deep learning and novelty detection framework for rapid phenotyping in high-content screening. Mol. Biol. Cell 28(23), 3428–3436 (2017)

    Article  Google Scholar 

  8. Ciresan, D.C., Giusti, A., Gambardella, L.M., Schmidhuber, J.: Deep neural networks segment neuronal membranes in electron microscopy images. In: Bartlett, P.L., Pereira, F.C.N., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) NIPS, pp. 2852–2860 (2012)

    Google Scholar 

  9. Litjens, G., et al.: A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017)

    Article  Google Scholar 

  10. Usaj, M.M., Styles, E.B., Verster, A.J., Friesen, H., Boone, C., Andrews, B.J.: High-content screening for quantitative cell biology. Trends Cell Biol. 26(8), 598–611 (2016)

    Article  Google Scholar 

  11. Boutros, M., Heigwer, F., Laufer, C.: Microscopy-based high-content screening. Cell 163(6), 1314–1325 (2015)

    Article  Google Scholar 

  12. Singh, S., Carpenter, A.E., Genovesio, A.: Increasing the content of high-content screening: an overview. J. Biomol. Screen. 19, 640–650 (2014)

    Article  Google Scholar 

  13. Scheeder, C., Heigwer, F., Boutros, M.: Machine learning and image-based profiling in drug discovery. Curr. Opin. Syst. Biol. 10, 43–52 (2018). Pharmacology and drug discovery

    Article  Google Scholar 

  14. Zock, J.M.: Applications of high content screening in life science research. Combin. Chem. High Throughput Screen. 12(9), 870–876 (2009)

    Article  Google Scholar 

  15. Buchser, W., et al.: Assay development guidelines for image-based high content screening, high content analysis and high content imaging. Eli Lilly & Company and the National Center for Advancing Translational Sciences (2014)

    Google Scholar 

  16. Godinez, W.J., Hossain, I., Lazic, S.E., Davies, J.W., Zhang, X.: A multi-scale convolutional neural network for phenotyping high-content cellular images. Bioinformatics 33(13), 2010–2019 (2017)

    Article  Google Scholar 

  17. Godinez, W.J., Hossain, I., Zhang, X.: Unsupervised phenotypic analysis of cellular images with multi-scale convolutional neural networks. bioRxiv (2018). https://www.biorxiv.org/content/early/2018/07/03/361410

  18. Ando, D.M., McLean, C., Berndl, M.: Improving phenotypic measurements in high-content imaging screens. bioRxiv (2017). https://www.biorxiv.org/content/early/2017/07/10/161422

  19. Buyssens, P., Elmoataz, A., Lézoray, O.: Multiscale convolutional neural networks for vision–based classification of cells. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7725, pp. 342–352. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37444-9_27

    Chapter  Google Scholar 

  20. Jouppi, N.P., et al.: In-datacenter performance analysis of a tensor processing unit. In: 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA), pp. 1–12. IEEE (2017)

    Google Scholar 

  21. Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Stat. 22, 400–407 (1951)

    Article  MathSciNet  Google Scholar 

  22. You, Y., Zhang, Z., Hsieh, C.-J., Demmel, J., Keutzer, K.: ImageNet training in minutes. In: Proceedings of the 47th International Conference on Parallel Processing, p. 1. ACM (2018)

    Google Scholar 

  23. Ljosa, V., Sokolnicki, K.L., Carpenter, A.E.: Annotated high-throughput microscopy image sets for validation. Nat. Methods 9(7), 637 (2012)

    Article  Google Scholar 

  24. Caie, P.D., et al.: High-content phenotypic profiling of drug response signatures across distinct cancer cells. Mol. Cancer Ther. 9(6), 1913–1926 (2010)

    Google Scholar 

  25. Google. TPU benchmarks. https://github.com/tensorflow/tpu.git

  26. Sergeev, A., Del Balso, M.: Horovod: fast and easy distributed deep learning in TensorFlow. arXiv preprint arXiv:1802.05799 (2018)

  27. Saletore, V., Karkada, D., Sripathi, V., Sankaranarayanan, A., Datta, K.: Boosting deep learning training and inference performance on Intel Xeon and Intel Xeon Phi processors. https://software.intel.com/en-us/articles/boosting-deep-learning-training-inference-performance-on-xeon-and-xeon-phi

Download references

Acknowledgements

We would like to acknowledge Wolfgang Zipfel from the Novartis Institutes for Biomedical Research, Basel, Switzerland; Michael Derby, Michael Steeves and Steve Litster from the Novartis Institutes for Biomedical Research, Cambridge, MA, USA; Deepthi Karkada, Vivek Menon, Kristina Kermanshahche, Mike Demshki, Patrick Messmer, Andy Bartley, Bruno Riva and Hema Chamraj from Intel Corporation, USA, for their contributions to this work. The authors also acknowledge the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing HPC resources that have contributed to the research results reported within this paper.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Kushal Datta or Imtiaz Hossain .

Editor information

Editors and Affiliations

Ethics declarations

Intel® Xeon® Gold 6148 processor, Intel® OPA and Intel® SSD storage drive are registered products of Intel Corporation. The authors declare no other conflicts of interest.

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Datta, K. et al. (2019). Training Multiscale-CNN for Large Microscopy Image Classification in One Hour. In: Weiland, M., Juckeland, G., Alam, S., Jagode, H. (eds) High Performance Computing. ISC High Performance 2019. Lecture Notes in Computer Science(), vol 11887. Springer, Cham. https://doi.org/10.1007/978-3-030-34356-9_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-34356-9_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-34355-2

  • Online ISBN: 978-3-030-34356-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics