Abstract
The growing demand for an efficient approach to classify time series data is bringing forth numerous research efforts in data mining field. Popularly known applications like business, medical and meteorology and so on, typically involves majority of data type in the form of time series. Hence, it is crucial to identify and scope out the potential of time series data owing to its importance on understanding the past trend as well as predicting about what would occur in future. To efficiently analyze the time series data, a system design based on Sliding Window Technique-Improved Association Rule Mining (SWT-IARM) with Enhanced Support Vector Machine (ESVM) has been largely adopted in the recent past. However, it does not provide a high accuracy for larger size of the dataset along with huge number of attributes. To solve this problem the proposed system designed a Sequence Mining algorithm-based Support Vector Machine with Decision Tree algorithm (SM-SVM with DT) for efficient time series analysis. In this proposed work, the larger size of the dataset is considered along with huge number of attributes. The preprocessing is performed using Kalman filtering. The hybrid segmentation method is proposed by combining a clustering technique and Particle Swarm Optimization (PSO) algorithm. Based on the sequence mining algorithm, the rule discovery is performed to reduce the computational complexity prominently by extracting the most frequent and important rules. In order to provide better time series classification results, the Support Vector Machine with Decision Tree (SVM-DT) method is utilized. Finally, the Pattern matching-based modified Spearmen’s rank correlation coefficient technique is introduced to provide more similarity and classification results for the given larger time series dataset accurately. The experimental results shows that the proposed system achieves better accuracy, time complexity and rule discovery compared with the existing system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wei, W. W. (2006) Time series analysis. In The Oxford handbook of quantitative methods in psychology
Lu, C. J., Lee, T. S., & Chiu, C. C. (2009). Financial time series forecasting using independent component analysis and support vector regression. Decision Support Systems, 47(2), 115–125.
Fu, T. C. (2011). A review on time series data mining. Engineering Applications of Artificial Intelligence, 24(1), 164–181.
Esling, P., & Agon, C. (2012). Time-series data mining. ACM Computing Surveys (CSUR), 45(1), 1–32.
Senthil, D., & Suseendran, G. (2018). Efficient time series data classification using sliding window technique based improved association rule mining with enhanced support vector machine. Submitted to International Journal of Engineering & Technology, 7(2), 218–223.
Povinelli, R. J., Johnson, M. T., Lindgren, A. C., & Ye, J. (2004). Time series classification using Gaussian mixture models of reconstructed phase spaces. IEEE Transactions on Knowledge and Data Engineering, 16(6), 779–783.
Keogh, E., Chu, S., Hart, D., & Pazzani, M. (2001). An online algorithm for segmenting time series. In Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference, pp. 289–296.
Schmidt, R., & Gierl, L. (2005). A prognostic model for temporal courses that combines temporal abstraction and case–based reasoning. International Journal of Medical Informatics, 74(2–4), 307–315.
Ghalwash, M. F., & Obradovic, Z. (2012). Early classification of multivariate temporal observations by extraction of interpretable shapelets. BMC Bioinformatics, 13(1), 1–12.
Thiyagaraj, M., & Suseendran, G. (2017) Review of chronic kidney disease based on data mining proceedings of the 11th INDIACom; INDIACom–2017; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 01st–03rd March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp. 2873–2878.
Thiyagaraj, M., & Suseendran, G. (2017). Survey on heart disease prediction system based on data mining techniques. Indian Journal of Innovations and Developments, 6(1), pp. 1–9.
Rohini, K., & Suseendran, G. (2016). Aggregated K means clustering and decision tree algorithm for spirometry data. Indian Journal of Science and Technology, 9(44), 1–6.
Thiyagaraj, M., & Suseendran, G. (2018). An efficient heart disease prediction system using modified firefly algorithm based radial basis function with support vector machine. International Journal of Engineering & Technology, 7(2), 1040–1045.
Senthil, D., & Suseendran, G. (2017). Data mining techniques using time series analysis. In Proceedings of the 11th INDIACom; INDIACom–2017; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 01st–03rd March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp. 2864–2872.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Senthil, D., Suseendran, G. (2020). Sequence Mining-Based Support Vector Machine with Decision Tree Approach for Efficient Time Series Data Classification. In: Sharma, N., Chakrabarti, A., Balas, V. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 1016. Springer, Singapore. https://doi.org/10.1007/978-981-13-9364-8_1
Download citation
DOI: https://doi.org/10.1007/978-981-13-9364-8_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9363-1
Online ISBN: 978-981-13-9364-8
eBook Packages: EngineeringEngineering (R0)