Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach

Short, Ryan G.; Bralich, John; Bogaty, Dave; Befera, Nicholas T.

doi:10.1007/s10278-018-0141-4

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach

Published: 18 October 2018

Volume 32, pages 685–692, (2019)
Cite this article

Journal of Digital Imaging Aims and scope Submit manuscript

Ryan G. Short ORCID: orcid.org/0000-0003-3483-2895¹,
John Bralich²,
Dave Bogaty² &
…
Nicholas T. Befera¹

794 Accesses
13 Citations
2 Altmetric
Explore all metrics

Abstract

Radiology reports contain a large amount of potentially valuable unstructured data. Recently, neural networks have been employed to perform classification of radiology reports over a few classes at the document level. The success of neural networks in sequence-labeling problems such as named entity recognition and part of speech tagging suggests that they could be used to classify radiology report text with greater granularity. We employed a neural network architecture to comprehensively classify mammography report text at the word level using a sequence labeling approach. Two radiologists devised a comprehensive classification system for screening mammography reports. Each word in each report was manually categorized by a radiologist into one of 33 categories according to the classification system. Tagged words referencing the same finding were grouped into unique sets. We pre-labeled reports with a rule-based algorithm and then manually edited these annotations for 6705 screening mammography reports (25.1%, 66.8%, and 8.1% BI-RADS 0, 1, and 2, respectively). A combined convolutional and recurrent neural network model was used to label words in each sentence of the individual reports. A siamese recurrent neural network was then used to group findings into sets. Performance of the neural network-based method was compared to a rule-based algorithm and a conditional random field (CRF) model. Global accuracy (percentage of documents where all word tags were predicted correctly) and keyword accuracy (percentage of all words that were labeled correctly, excluding words tagged as unimportant) were calculated on an unseen 519 report test set. Two-tailed t tests were used to assess differences between algorithm performance, and p < 0.05 was used to determine statistical significance. The neural network-based approach showed significantly higher global accuracy compared to both the rule-based algorithm (88.3 vs 57.0%, p < 0.001) and the CRF model (88.3% vs. 75.8%, p < 0.001). The neural network also showed significantly higher keyword level accuracy compared to the rule-based algorithm (95.5% vs. 80.9% p < 0.001) and CRF model (95.5% vs. 76.9%, p < 0.001). We demonstrate the potential of neural networks to accurately perform word-level multilabel classification of free text radiology reports across 33 classes, thus showing the utility of a sequence labeling approach to NLP of radiology reports. We found that a neural network classifier outperforms a rule-based algorithm and a CRF classifier for comprehensive multilabel classification of free text screening mammography reports at the word level. By approaching radiology report classification as a sequence-labeling problem, we demonstrate the ability of neural networks to extract data from free text radiology reports at a level of granularity not previously reported.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic Fully-Contextualized Recommendation Extraction from Radiology Reports

Article 10 February 2021

Selective Information Extraction Strategies for Cancer Pathology Reports with Convolutional Neural Networks

Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning

Article Open access 15 April 2022

References

Friedman PJ: Radiologic reporting: structure. AJR Am J Roentgenol 140:171–172, 1983
Article CAS PubMed Google Scholar
Bruno MA, Petscavage-Thomas JM, Mohr MJ, Bell SK, Brown SD: The “Open Letter”: Radiologists’ Reports in the Era of Patient Web Portals. J Am Coll Radiol 11:863–867, 2014
Article PubMed Google Scholar
Cai T, Giannopoulos AA, Yu S, Kelil T, Ripley B, Kumamaru KK, Rybicki FJ, Mitsouras D: Natural Language Processing Technologies in Radiology Research and Clinical Applications. Radiographics 36:176–191, 2016
Article PubMed PubMed Central Google Scholar
Pons E, Braun LMM, Hunink MGM, Kors JA: Natural Language Processing in Radiology: A Systematic Review. Radiology 279:329–343, 2016
Article PubMed Google Scholar
Hassanpour S, Bay G, Langlotz CP: Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing. J Digit Imaging 30:314–322, 2017
Article PubMed PubMed Central Google Scholar
Masino AJ, Grundmeier RW, Pennington JW, Germiller JA, Crenshaw, 3rd. EB: Temporal bone radiology report classification using open source machine learning and natural langue processing libraries. BMC Med Inform Decis Mak 16:65, 2016
Article PubMed PubMed Central Google Scholar
Chen P-H, Zafar H, Galperin-Aizenberg M, Cook T: Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports. J Digit Imaging 31:178–184, 2017. https://doi.org/10.1007/s10278-017-0027-x
Article PubMed Central Google Scholar
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J: Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst:3111–3119, 2013
Shin B, Chokshi FH, Lee T, Choi JD: Classification of radiology reports using neural attention models. 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 2017, pp 4363–4370. https://doi.org/10.1109/ijcnn.2017.7966408 .
Chokshi F, Shin B, Lee T, Lemmon A, Necessary S, Choi J. Natural Langeuage Processing for Classification of Acute, Communicable Findings on Unstructured Head CT Reports: Comparison of Neural Network and Non-Neural Machine Learning Techniques. bioRxiv 2017:173310. https://doi.org/10.1101/173310 .
Chen MC, Ball RL, Yang L, Moradzadeh N, Chapman BE, Larson DB et al.: Deep Learning to Classify Radiology Free-Text Reports. Radiology 171115, 2017
Banerjee I, Chen MC, Lungren MP, Rubin DL: Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort. J Biomed Inform 77:11–20, 2018
Article PubMed Google Scholar
Wang P, Qian Y, Soong FK, He L, Zhao H. Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. arXiv [csCL] 2015.
Dernoncourt F, Lee JY, Szolovits P. NeuroNER: an easy-to-use program for named-entity recognition based on neural networks. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2017. https://doi.org/10.18653/v1/d17-2017 .
Cornegruta S, Bakewell R, Withey S, Montana G. Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks. arXiv [csCL] 2016.
Short RG, Middleton D, Befera NT, Gondalia R, Tailor TD: Patient-Centered Radiology Reporting: Using Online Crowdsourcing to Assess the Effectiveness of a Web-Based Interactive Radiology Report. J Am Coll Radiol 14:1489–1497, 2017
Article PubMed Google Scholar
Gale W, Oakden-Rayner L, Carneiro G, Bradley AP, Palmer LJ. Detecting hip fractures with radiologist-level performance using deep neural networks. arXiv [csCV] 2017.
Bird S, Klein E, Loper E. Natural Language Processing with Python. “O’Reilly Media, Inc.”; 2009.
Welcome to Python.org. Python.org n.d. https://www.python.org/ (accessed March 12, 2018).
6.2. re — Regular expression operations — Python 3.6.4 documentation n.d. https://docs.python.org/3/library/re.html#module-re (accessed March 12, 2018).
American College of Radiology. ACR BI-RADS atlas: Breast Imaging Reporting and Data System ; Mammography, Ultrasound, Magnetic Resonance Imaging, Follow-up and Outcome Monitoring, Data Dictionary. 2013.
Lafferty JD, Mc Callum A, Pereira FCN. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the Eighteenth International Conference on Machine Learning, Morgan Kaufmann Publishers Inc.; 2001, p. 282–9.
Finkel JR, Grenager T, Manning C: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2005, pp. 363–370
Google Scholar
Silfverberg M, Ruokolainen T, Linden K, Kurimo M: Others. Part-of-speech tagging using conditional random fields: Exploiting sub-label dependencies for improved accuracy, 2014
python-crfsuite — python-crfsuite 0.9.5 documentation n.d. https://python-crfsuite.readthedocs.io/en/latest/index.html (accessed March 31, 2018).
Andrew G, Gao J. Scalable training ofL1-regularized log-linear models. Proceedings of the 24th international conference on Machine learning - ICML ‘07, 2007. https://doi.org/10.1145/1273496.1273501.
Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv [csCL] 2015.
Řehůřek R, Sojka P: Software Framework for Topic Modelling with Large Corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. Valletta, ELRA, 2010, pp. 45–50
Chollet F. Keras 2015.
Cho K, van Merrienboer B, Bahdanau D, Bengio Y. On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. arXiv [csCL] 2014.
Mueller J, Thyagarajan A. Siamese Recurrent Architectures for Learning Sentence Similarity. AAAI 2016.
Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. arXiv [csLG] 2014.
Liu F, Baldwin T, Cohn T. Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields. arXiv [csCL] 2017.

Download references

Author information

Authors and Affiliations

Department of Radiology, Duke University Medical Center, 2301 Erwin Road, Box 3808, Durham, NC, 27710, USA
Ryan G. Short & Nicholas T. Befera
Scanslated, Inc., Durham, NC, USA
John Bralich & Dave Bogaty

Authors

Ryan G. Short
View author publications
You can also search for this author in PubMed Google Scholar
John Bralich
View author publications
You can also search for this author in PubMed Google Scholar
Dave Bogaty
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas T. Befera
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryan G. Short.

Ethics declarations

Conflicts of Interest

Ryan G. Short is co-founder and CMO of Scanslated, Inc.

John Bralich is an employee of Scanslated, Inc.

Dave Bogaty is an employee of Scanslated, Inc.

Nicholas T. Befera is co-founder and CEO of Scanslated, Inc.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Short, R.G., Bralich, J., Bogaty, D. et al. Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach. J Digit Imaging 32, 685–692 (2019). https://doi.org/10.1007/s10278-018-0141-4

Download citation

Published: 18 October 2018
Issue Date: 15 October 2019
DOI: https://doi.org/10.1007/s10278-018-0141-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach

Abstract

Access this article

Similar content being viewed by others

Automatic Fully-Contextualized Recommendation Extraction from Radiology Reports

Selective Information Extraction Strategies for Cancer Pathology Reports with Convolutional Neural Networks

Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of Interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach

Abstract

Access this article

Similar content being viewed by others

Automatic Fully-Contextualized Recommendation Extraction from Radiology Reports

Selective Information Extraction Strategies for Cancer Pathology Reports with Convolutional Neural Networks

Multi-label annotation of text reports from computed tomography of the chest, abdomen, and pelvis using deep learning

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of Interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation