Abstract
Machine learning techniques have enabled machines to achieve human-like thinking and learning abilities. The sudden surge in the rate of data production has enabled enormous research opportunities in the field of machine learning to introduce new and improved techniques that deal with the challenging tasks of higher level. However, this rise in size of data quality has introduced a new challenge in this field, regarding the processing of such huge chunks of the dataset in limited available time. To deal such problems, in this paper, we present a parallel method of solving and interpreting the ML problems to achieve the required efficiency in the available time period. To solve this problem, we use CUDA, a GPU-based approach, to modify and accelerate the training and testing phases of machine learning problems. We also emphasize to demonstrate the efficiency achieved via predicting airline delay through both the sequential as well as CUDA-based parallel approach. Experimental results show that the proposed parallel CUDA approach outperforms in terms of its execution time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liao, S.H., Chu, P.H., Hsiao, P.Y.: Data mining techniques and application—a decade review from 2000 to 2011. Expert Syst. Appl. 39(12), 11303–11311 (2012)
Carbonell, J.G., Michalski, R.S., Mitchell, T.M.: An overview of machine learning. In: Machine Learning, pp. 3–23. Springer, Berlin, Heidelberg (1983)
Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., Skadron, K.: A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 68(10), 1370–1380 (2008)
Wu, R., Zhang, B., Hsu, M.: GPU-accelerated large scale analytics. IACM UCHPC (2009)
Ghorpade, J., Parande, J., Kulkarni, M., Bawaskar, A.: GPGPU processing in CUDA architecture (2012). arXiv:1202.4347
Farber, R.: CUDA Application Design and Development. Elsevier (2011)
Yang, C.T., Huang, C.L., Lin, C.F.: Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters. Comput. Phys. Commun. 182(1), 266–269 (2011)
Harris, M.: Optimizing CUDA. In: SC07: High Performance Computing with CUDA (2007)
Data for experimentation, American Statistical Association, Data Expo (2009). http://stat-computing.org/dataexpo/2009/the-data.html
Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia (2006)
Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, vol. 3, No. 22, pp. 41–46. IBM, New York (Aug 2001)
Jia, P.T., He, H.C., Lin, W.: Decision by maximum of posterior probability average with weights: a method of multiple classifiers combination. In: Proceedings of 2005 International Conference on Machine Learning and Cybernetics, vol. 4, pp. 1949–1954. IEEE (Aug 2005)
Jian, L., Wang, C., Liu, Y., Liang, S., Yi, W., Shi, Y.: Parallel data mining techniques on graphics processing unit with compute unified device architecture (CUDA). J. Supercomput. 64(3), 942–967 (2013)
Zhou, L., Wang, H., Wang, W.: Parallel implementation of classification algorithms based on cloud computing environment. TELKOMNIKA Indones. J. Electr. Eng. 10(5), 1087–1092 (2012)
Fang, W., Lau, K.K., Lu, M., Xiao, X., Lam, C.K., Yang, P.Y., Yang, K. et al.: Parallel data mining on graphics processors. In: Technical Report HKUST-CS08-07. Hong Kong University Science and Technology, Hong Kong, China (2008)
Chengpeng, Y., Zhanchun, G., Yanjun, J.A.: GPU-based Native Bayesian algorithm for document classification. http://www.paper.edu.cn/lwzx/en_releasepaper/content/4570429. Accessed 26 Nov 2013
Viegas, F., Andrade, G., Almeida, J., Ferreira, R., Gonçalves, M., Ramos, G., Rocha, L.: GPU-NB: a fast CUDA-based implementation of naive bayes. In: 2013 25th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp. 168–175. IEEE (Oct 2013)
Zhou, L., Yu, Z., Lin, J., Zhu, S., Shi, W., Zhou, H., Zeng, X. et al.: Acceleration of Naive-Bayes algorithm on multicore processor for massive text classification. In: 2014 14th International Symposium on Integrated Circuits (ISIC), pp. 344–347. IEEE (Dec 2014)
Acknowledgements
This work is partially supported by Indian Institute of Technology (ISM), Government of India. The authors wish to express their gratitude and thanks to the Department of Computer Science and Engineering, Indian Institute of Technology (ISM), Dhanbad, India, for providing their support in arranging necessary computing facilities.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ramesh, D., Patidar, N., Vunnam, T., Kumar, G. (2018). Accelerating Airline Delay Prediction-Based P-CUDA Computing Environment. In: Reddy Edla, D., Lingras, P., Venkatanareshbabu K. (eds) Advances in Machine Learning and Data Science. Advances in Intelligent Systems and Computing, vol 705. Springer, Singapore. https://doi.org/10.1007/978-981-10-8569-7_2
Download citation
DOI: https://doi.org/10.1007/978-981-10-8569-7_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8568-0
Online ISBN: 978-981-10-8569-7
eBook Packages: EngineeringEngineering (R0)