Accelerating Airline Delay Prediction-Based P-CUDA Computing Environment

  • Dharavath Ramesh
  • Neeraj Patidar
  • Teja Vunnam
  • Gaurav Kumar
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 705)

Abstract

Machine learning techniques have enabled machines to achieve human-like thinking and learning abilities. The sudden surge in the rate of data production has enabled enormous research opportunities in the field of machine learning to introduce new and improved techniques that deal with the challenging tasks of higher level. However, this rise in size of data quality has introduced a new challenge in this field, regarding the processing of such huge chunks of the dataset in limited available time. To deal such problems, in this paper, we present a parallel method of solving and interpreting the ML problems to achieve the required efficiency in the available time period. To solve this problem, we use CUDA, a GPU-based approach, to modify and accelerate the training and testing phases of machine learning problems. We also emphasize to demonstrate the efficiency achieved via predicting airline delay through both the sequential as well as CUDA-based parallel approach. Experimental results show that the proposed parallel CUDA approach outperforms in terms of its execution time.

Keywords

Machine learning (ML) Naïve Bayes GPU CUDA Tree reduction 

Notes

Acknowledgements

This work is partially supported by Indian Institute of Technology (ISM), Government of India. The authors wish to express their gratitude and thanks to the Department of Computer Science and Engineering, Indian Institute of Technology (ISM), Dhanbad, India, for providing their support in arranging necessary computing facilities.

References

  1. 1.
    Liao, S.H., Chu, P.H., Hsiao, P.Y.: Data mining techniques and application—a decade review from 2000 to 2011. Expert Syst. Appl. 39(12), 11303–11311 (2012)CrossRefGoogle Scholar
  2. 2.
    Carbonell, J.G., Michalski, R.S., Mitchell, T.M.: An overview of machine learning. In: Machine Learning, pp. 3–23. Springer, Berlin, Heidelberg (1983)Google Scholar
  3. 3.
    Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., Skadron, K.: A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 68(10), 1370–1380 (2008)CrossRefGoogle Scholar
  4. 4.
    Wu, R., Zhang, B., Hsu, M.: GPU-accelerated large scale analytics. IACM UCHPC (2009)Google Scholar
  5. 5.
    Ghorpade, J., Parande, J., Kulkarni, M., Bawaskar, A.: GPGPU processing in CUDA architecture (2012). arXiv:1202.4347MathSciNetGoogle Scholar
  6. 6.
    Farber, R.: CUDA Application Design and Development. Elsevier (2011)Google Scholar
  7. 7.
    Yang, C.T., Huang, C.L., Lin, C.F.: Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters. Comput. Phys. Commun. 182(1), 266–269 (2011)CrossRefGoogle Scholar
  8. 8.
    Harris, M.: Optimizing CUDA. In: SC07: High Performance Computing with CUDA (2007)Google Scholar
  9. 9.
    Data for experimentation, American Statistical Association, Data Expo (2009). http://stat-computing.org/dataexpo/2009/the-data.html
  10. 10.
    Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia (2006)Google Scholar
  11. 11.
    Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, No. 22, pp. 41–46. IBM, New York (Aug 2001)Google Scholar
  12. 12.
    Jia, P.T., He, H.C., Lin, W.: Decision by maximum of posterior probability average with weights: a method of multiple classifiers combination. In: Proceedings of 2005 International Conference on Machine Learning and Cybernetics, vol. 4, pp. 1949–1954. IEEE (Aug 2005)Google Scholar
  13. 13.
    Jian, L., Wang, C., Liu, Y., Liang, S., Yi, W., Shi, Y.: Parallel data mining techniques on graphics processing unit with compute unified device architecture (CUDA). J. Supercomput. 64(3), 942–967 (2013)CrossRefGoogle Scholar
  14. 14.
    Zhou, L., Wang, H., Wang, W.: Parallel implementation of classification algorithms based on cloud computing environment. TELKOMNIKA Indones. J. Electr. Eng. 10(5), 1087–1092 (2012)Google Scholar
  15. 15.
    Fang, W., Lau, K.K., Lu, M., Xiao, X., Lam, C.K., Yang, P.Y., Yang, K. et al.: Parallel data mining on graphics processors. In: Technical Report HKUST-CS08-07. Hong Kong University Science and Technology, Hong Kong, China (2008)Google Scholar
  16. 16.
    Chengpeng, Y., Zhanchun, G., Yanjun, J.A.: GPU-based Native Bayesian algorithm for document classification. http://www.paper.edu.cn/lwzx/en_releasepaper/content/4570429. Accessed 26 Nov 2013
  17. 17.
    Viegas, F., Andrade, G., Almeida, J., Ferreira, R., Gonçalves, M., Ramos, G., Rocha, L.: GPU-NB: a fast CUDA-based implementation of naive bayes. In: 2013 25th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp. 168–175. IEEE (Oct 2013)Google Scholar
  18. 18.
    Zhou, L., Yu, Z., Lin, J., Zhu, S., Shi, W., Zhou, H., Zeng, X. et al.: Acceleration of Naive-Bayes algorithm on multicore processor for massive text classification. In: 2014 14th International Symposium on Integrated Circuits (ISIC), pp. 344–347. IEEE (Dec 2014)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  • Dharavath Ramesh
    • 1
  • Neeraj Patidar
    • 1
  • Teja Vunnam
    • 1
  • Gaurav Kumar
    • 1
  1. 1.Department of Computer Science and EngineeringIndian Institute of Technology (ISM)DhanbadIndia

Personalised recommendations