Skip to main content

Sequence Mining-Based Support Vector Machine with Decision Tree Approach for Efficient Time Series Data Classification

  • Conference paper
  • First Online:
Data Management, Analytics and Innovation

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1016))

  • 1322 Accesses

Abstract

The growing demand for an efficient approach to classify time series data is bringing forth numerous research efforts in data mining field. Popularly known applications like business, medical and meteorology and so on, typically involves majority of data type in the form of time series. Hence, it is crucial to identify and scope out the potential of time series data owing to its importance on understanding the past trend as well as predicting about what would occur in future. To efficiently analyze the time series data, a system design based on Sliding Window Technique-Improved Association Rule Mining (SWT-IARM) with Enhanced Support Vector Machine (ESVM) has been largely adopted in the recent past. However, it does not provide a high accuracy for larger size of the dataset along with huge number of attributes. To solve this problem the proposed system designed a Sequence Mining algorithm-based Support Vector Machine with Decision Tree algorithm (SM-SVM with DT) for efficient time series analysis. In this proposed work, the larger size of the dataset is considered along with huge number of attributes. The preprocessing is performed using Kalman filtering. The hybrid segmentation method is proposed by combining a clustering technique and Particle Swarm Optimization (PSO) algorithm. Based on the sequence mining algorithm, the rule discovery is performed to reduce the computational complexity prominently by extracting the most frequent and important rules. In order to provide better time series classification results, the Support Vector Machine with Decision Tree (SVM-DT) method is utilized. Finally, the Pattern matching-based modified Spearmen’s rank correlation coefficient technique is introduced to provide more similarity and classification results for the given larger time series dataset accurately. The experimental results shows that the proposed system achieves better accuracy, time complexity and rule discovery compared with the existing system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wei, W. W. (2006) Time series analysis. In The Oxford handbook of quantitative methods in psychology

    Google Scholar 

  2. Lu, C. J., Lee, T. S., & Chiu, C. C. (2009). Financial time series forecasting using independent component analysis and support vector regression. Decision Support Systems, 47(2), 115–125.

    Article  Google Scholar 

  3. Fu, T. C. (2011). A review on time series data mining. Engineering Applications of Artificial Intelligence, 24(1), 164–181.

    Article  Google Scholar 

  4. Esling, P., & Agon, C. (2012). Time-series data mining. ACM Computing Surveys (CSUR), 45(1), 1–32.

    Article  Google Scholar 

  5. Senthil, D., & Suseendran, G. (2018). Efficient time series data classification using sliding window technique based improved association rule mining with enhanced support vector machine. Submitted to International Journal of Engineering & Technology, 7(2), 218–223.

    Google Scholar 

  6. Povinelli, R. J., Johnson, M. T., Lindgren, A. C., & Ye, J. (2004). Time series classification using Gaussian mixture models of reconstructed phase spaces. IEEE Transactions on Knowledge and Data Engineering, 16(6), 779–783.

    Article  Google Scholar 

  7. Keogh, E., Chu, S., Hart, D., & Pazzani, M. (2001). An online algorithm for segmenting time series. In Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference, pp. 289–296.

    Google Scholar 

  8. Schmidt, R., & Gierl, L. (2005). A prognostic model for temporal courses that combines temporal abstraction and case–based reasoning. International Journal of Medical Informatics, 74(2–4), 307–315.

    Article  Google Scholar 

  9. Ghalwash, M. F., & Obradovic, Z. (2012). Early classification of multivariate temporal observations by extraction of interpretable shapelets. BMC Bioinformatics, 13(1), 1–12.

    Article  Google Scholar 

  10. Thiyagaraj, M., & Suseendran, G. (2017) Review of chronic kidney disease based on data mining proceedings of the 11th INDIACom; INDIACom–2017; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 01st–03rd March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp. 2873–2878.

    Google Scholar 

  11. Thiyagaraj, M., & Suseendran, G. (2017). Survey on heart disease prediction system based on data mining techniques. Indian Journal of Innovations and Developments, 6(1), pp. 1–9.

    Google Scholar 

  12. Rohini, K., & Suseendran, G. (2016). Aggregated K means clustering and decision tree algorithm for spirometry data. Indian Journal of Science and Technology, 9(44), 1–6.

    Article  Google Scholar 

  13. Thiyagaraj, M., & Suseendran, G. (2018). An efficient heart disease prediction system using modified firefly algorithm based radial basis function with support vector machine. International Journal of Engineering & Technology, 7(2), 1040–1045.

    Google Scholar 

  14. Senthil, D., & Suseendran, G. (2017). Data mining techniques using time series analysis. In Proceedings of the 11th INDIACom; INDIACom–2017; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 01st–03rd March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp. 2864–2872.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to D. Senthil .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Senthil, D., Suseendran, G. (2020). Sequence Mining-Based Support Vector Machine with Decision Tree Approach for Efficient Time Series Data Classification. In: Sharma, N., Chakrabarti, A., Balas, V. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 1016. Springer, Singapore. https://doi.org/10.1007/978-981-13-9364-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-9364-8_1

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-9363-1

  • Online ISBN: 978-981-13-9364-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics