Automated Classification of Issue Reports from a Software Issue Tracker

  • Nitish PandeyEmail author
  • Abir Hudait
  • Debarshi Kumar Sanyal
  • Amitava Sen
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 518)


Software issue trackers are used by software users and developers to submit bug reports and various other change requests and track them till they are finally closed. However, it is common for submitters to misclassify an improvement request as a bug and vice versa. Hence, it is extremely useful to have an automated classification mechanism for the submitted reports. In this paper we explore how different classifiers might perform this task. We use datasets from the open-source projects HttpClient and Lucene. We apply naïve Bayes (NB), support vector machine (SVM), logistic regression (LR) and linear discriminant analysis (LDA) separately for classification and evaluate their relative performance in terms of precision, recall, F-measure and accuracy.


Bug classification Naïve Bayes Support vector machine Precision Recall F-measure 


  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
    F. Sebastiani. Machine learning in automated text categorization. ACM Computing Surveys (CSUR) 34.1:1–47, 2002.Google Scholar
  7. 7.
    D. Čubranić. Automatic bug triage using text categorization. In Proceedings of the 16th International Conference on Software Engineering & Knowledge Engineering (SEKE’2004), 2004.Google Scholar
  8. 8.
    K. Herzig, S. Just, and A. Zeller. It’s not a bug, it’s a feature: How Misclassification Impacts Bug Prediction. In Proceedings of the 35th IEEE/ACM International Conference on Software Engineering, 2013.Google Scholar
  9. 9.
    G. Antoniol, et al. Is it a bug or an enhancement? A text-based approach to classify change requests. In Proceedings of the 2008 Conference of the Center for Advanced Studies on Collaborative Research: Meeting of Minds (CASCON’2008), ACM, 2008.Google Scholar
  10. 10.
    M. Ohira, et al. A dataset of high impact bugs: manually-classified issue reports. In Proceedings of the IEEE/ACM 12th Working Conference on Mining Software Repositories (MSR’2015), 2015.Google Scholar
  11. 11.
    N. Pingclasai, H. Hata, K. Matsumoto. Classifying bug reports to bugs and other requests using topic modeling. In Proceedings of 20th Asia-Pacific Software Engineering Conference (APSEC’2013), IEEE, 2013.Google Scholar
  12. 12.
    I. Chawla, S. K. Singh. An automated approach for bug classification using fuzzy logic. In Proceedings of the 8th ACM India Software Engineering Conference (ISEC’2015), 2015.Google Scholar
  13. 13.
    L. L. Wu, B. Xie, G. E. Kaiser, R. Passonneau. BugMiner: software reliability analysis via Data Mining of Bug Reports. In Proceedings of the 23rd International Conference on Software Engineering & Knowledge Engineering (SEKE’2011), 2011.Google Scholar
  14. 14.
    Y. Zhou, Y. Tong, R. Gu, H. Gall. Combining text mining and data Mining for bug report classification. In Proceedings of 30th IEEE International Conference on Software Maintenance and Evolution (ICSME’2014), 2014.Google Scholar
  15. 15.

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  • Nitish Pandey
    • 1
    Email author
  • Abir Hudait
    • 1
  • Debarshi Kumar Sanyal
    • 1
  • Amitava Sen
    • 2
  1. 1.School of Computer EngineeringKIIT UniversityBhubaneswarIndia
  2. 2.Dr. Sudhir Chandra Sur Degree Engineering CollegeKolkataIndia

Personalised recommendations