Bag of recurrence patterns representation for time-series classification
Abstract
Time-series classification (TSC) has attracted a lot of attention in pattern recognition, because wide range of applications from different domains such as finance and health informatics deal with time-series signals. Bag-of-features (BoF) model has achieved a great success in TSC task by summarizing signals according to the frequencies of “feature words” of a data-learned dictionary. This paper proposes embedding the recurrence plots (RP), a visualization technique for analysis of dynamic systems, in the BoF model for TSC. While the traditional BoF approach extracts features from 1D signal segments, this paper uses the RP to transform time-series into 2D texture images and then applies the BoF on them. Image representation of time-series enables us to explore different visual descriptors that are not available for 1D signals and to treat TSC task as a texture recognition problem. Experimental results on the UCI time-series classification archive demonstrates a significant accuracy boost by the proposed bag of recurrence patterns, compared not only to the existing BoF models, but also to the state-of-the art algorithms.
Keywords
Time-series classification (TSC) Bag of features (BoF) representation Visual words Dictionary Recurrence plots (RP)Notes
Acknowledgements
This research is partially supported by the French national research agency (ANR) under the PANDORE Grant with reference number ANR-14-CE28-0027.
N. Hatami also acknowledges support of the LABEX PRIMES (ANR-11-LABX-0063) of Université de Lyon, within the program “Investissements d’Avenir” (ANR-11-IDEX-0007) operated by the ANR.
References
- 1.Acharya U, Sree S, Chattopadhyay S, Yu W, Ang P (2011) Application of recurrence quantification analysis for the automated identification of epileptic EEG signals. Int J Neural Syst 21(03):199–211CrossRefGoogle Scholar
- 2.Allili M (2012) Wavelet modeling using finite mixtures of generalized Gaussian distributions: application to texture discrimination and retrieval. IEEE Trans Image Process 21(4):1452–1464MathSciNetCrossRefMATHGoogle Scholar
- 3.Armano G, Chira C, Hatami N (2012) Error-correcting output codes for multi-label text categorization. In: 3rd Italian information retrieval workshop (IIR), pp 26–37Google Scholar
- 4.Bailly A, Malinowski S, Tavenard R, Guyet T, Chapel L (2015) Bag-of-temporal-sift-words for time series classification. In: ECML/PKDD workshop on advanced analytics and learning on temporal dataGoogle Scholar
- 5.Bailly A, Malinowski S, Tavenard R, Guyet T, Chapel L (2016) Dense bag-of-temporal-sift-words for time series classification. Lecture Notes in Artificial IntelligenceGoogle Scholar
- 6.Baydogan M, Runger G (2016) Time series representation and similarity based on local auto patterns. Data Min Knowl Discov 30(2):476–509MathSciNetCrossRefGoogle Scholar
- 7.Baydogan M, Runger G, Tuv E (2013) A bag-of-features framework to classify time series. IEEE Trans PAMI 35(11):2796–2802CrossRefGoogle Scholar
- 8.Bromuri S, Zufferey D, Hennebert J, Schumacher M (2014) Multi-label classification of chronically ill patients with bag of words and supervised dimensionality reduction algorithms. J Biomed Inform 51:165–175CrossRefGoogle Scholar
- 9.Chang C, Lin C (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27CrossRefGoogle Scholar
- 10.Chen Y, Keogh E, Hu B, Begum N, Bagnall A, Mueen A, Batista G (2015) The ucr time series classification archiveGoogle Scholar
- 11.Chen Y, Yang H (2012) Multiscale recurrence analysis of long-term nonlinear and nonstationary time series. Chaos Solitons Fractals 45(7):978–987CrossRefGoogle Scholar
- 12.Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. CVPR, San DiegoCrossRefGoogle Scholar
- 13.Dong Y, Feng J, Liang L, Zheng L (2017) Multiscale Sampling Based Texture Image Classification. IEEE Signal Process Lett 24(5):614–618CrossRefGoogle Scholar
- 14.Dong Y, Tao D, Li X, Ma J, Pu J (2015) Texture classification and retrieval using shearlets and linear regression. IEEE Trans Cybern 45(3):358–369CrossRefGoogle Scholar
- 15.Eads D, Glocer K, Perkins S, Theiler J (2005) Grammar-guided feature extraction for time series classification. NIPSGoogle Scholar
- 16.Eckmann JP, Kamphorst SO, Ruelle D (1987) Recurrence plots of dynamical systems. Europhys Lett 4(9):973–977CrossRefGoogle Scholar
- 17.Fu Z, Lu G, Ting K, Zhang D (2011) Music classification via the bag-of-features approach. Pattern Recognit Lett 32(14):1768–1777CrossRefGoogle Scholar
- 18.Geurts P (2009) Pattern extraction for time series classification. In: 5th EU conference principles of data mining and knowledge discovery, pp 115–127Google Scholar
- 19.Grabocka J, Wistuba M, Schmidt-Thieme L (2015) Scalable classification of repetitive time series through frequencies of local polynomials. IEEE Trans Knowl Data Eng 27(6):1683–1695CrossRefGoogle Scholar
- 20.Hatami N, Gavet Y, Debayle J (2017) Classification of time-series images using deep convolutional neural networks. In: International conference on machine vision (ICMV)Google Scholar
- 21.Hatami N (2008) Thinned ecoc decomposition for gene expression based cancer classification. In: 8th IEEE Conference on intelligent systems design and applications, pp 216–221Google Scholar
- 22.Hatami N (2012a) Some proposals for combining ensemble classifiers. Ph.D. thesis, University of CagliariGoogle Scholar
- 23.Hatami N (2012b) Thinned-ecoc ensemble based on sequential code shrinking. Expert Syst Appl 39(1):936–947CrossRefGoogle Scholar
- 24.Hatami N, Chira C (2013) Classifiers with a reject option for early time-series classification. In: IEEE symposium on computational intelligence and ensemble learning (CIEL), pp 9–16Google Scholar
- 25.Hatami N, Ebrahimpour R, Ghaderi R (2008) Ecoc-based training of neural networks for face recognition. In: IEEE conference on cybernetics and intelligent systems, pp 450–454Google Scholar
- 26.Jeong Y, Jeong M, Omitaomu O (2011) Weighted dynamic time warping for time series classification. Pattern Recognit 44(9):2231–2240CrossRefGoogle Scholar
- 27.Keogh E, Kasetty S (2003) On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min Knowl Discov 7(4):349–371MathSciNetCrossRefGoogle Scholar
- 28.Kumar N, Lolla V, Keogh E, Lonardi S, Ratanamahatana C, Wei L (2005) Time-series bitmaps: a practical visualization tool for working with large time series databases. In SIAM international conference on data mining, pp 531–535Google Scholar
- 29.Lategahn H, Gross S, Stehle T, Aach T (2010) Texture classification by modeling joint distributions of local patterns with Gaussian mixtures. IEEE Trans image Process 19(6):1548–1557MathSciNetCrossRefMATHGoogle Scholar
- 30.Lin J, Khade R, Li Y (2012) Rotation-invariant similarity in time series using bag-of-patterns representation. J Intell Inf Syst 39(2):287–315CrossRefGoogle Scholar
- 31.Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRefGoogle Scholar
- 32.Marwan N, Romano MC, Thiel M, Kurths J (2007) Recurrence plots for the analysis of complex systems. Phys Rep 438(5–6):237–329MathSciNetCrossRefGoogle Scholar
- 33.Marwan N, Wessel N, Meyerfeldt U, Schirdewan A, Kurths J (2002) Recurrence plot based measures of complexity and its application to heart rate variability data. Phys Rev E 66(2):026702CrossRefMATHGoogle Scholar
- 34.Marwan N, Kurths J (2009) Comment on stochastic analysis of recurrence plots with applications to the detection of deterministic signals by rohde et al.[physica d 237 (2008) 619629]</CHECK>. Physica D 238:1711–1715CrossRefMATHGoogle Scholar
- 35.Mehta R, Eguiazarian K (2016) Texture classification using dense micro-block difference. IEEE Trans Image Process 25(4):1604–1616MathSciNetCrossRefGoogle Scholar
- 36.Nanopoulos A, Alcock R, Manolopoulos Y (2001) Feature-based classification of time-series data. Int J Comput Res 10:49–61Google Scholar
- 37.Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. TPAMI 7(24):971–987CrossRefMATHGoogle Scholar
- 38.Ratanamahatana C, Keogh E (2004) Making time-series classification more accurate using learned constraints. In: Proceedings of SIAM International Conference on Data Mining, pp 11–22Google Scholar
- 39.Rodriguez J, Alonso C (2004) Interval and dynamic time warping-based decision trees. In: ACM symposium on applied computing, pp 548–552Google Scholar
- 40.Rodriguez J, Alonso C, Maestro J (2005) Support vector machines of interval-based features for time series classification. Knowl Based Syst 18:171–178CrossRefGoogle Scholar
- 41.Rohde G, Nichols J, Dissinger B, Bucholtz F (2008) Stochastic analysis of recurrence plots with applications to the detection of deterministic signals. Phys D Nonlinear Phenom 237(5):619–629MathSciNetCrossRefMATHGoogle Scholar
- 42.Roma G, Nogueira W, Herrera P, de Boronat R (2013) Recurrence quantification analysis features for auditory scene classification. In: IEEE AASP challenge on detection and classification of acoustic scenes and eventsGoogle Scholar
- 43.Schafer P (2015) The boss is concerned with time series classification in the presence of noise. Data Min Knowl Discov 29(6):1505–1530MathSciNetCrossRefGoogle Scholar
- 44.Senin P, Malinchik S (2013) Sax-vsm: Interpretable time series classification using sax and vector space model. In: IEEE 13th international conference on data mining (ICDM), pp 1175–1180Google Scholar
- 45.Souza V, Silva D, Batista G (2013) Time series classification using compression distance of recurrence plots. In: IEEE 13th international conference on data mining, pp 687–696Google Scholar
- 46.Souza V, Silva D, Batista G (2014) Extracting texture features for time series classification. In: 22nd international conference on pattern recognition (ICPR), pp 1425–1430Google Scholar
- 47.Thiel M, Romano MC, Kurths J (2003) Analytical description of recurrence plots of white noise and chaotic processes,.arXiv:nlin/0301027Google Scholar
- 48.Ueno K, Xi X, Keogh E, Lee D (2007) Anytime classification using the nearest neighbour algorithm with applications to stream mining. In: IEEE international conference on data mining, pp 623–632Google Scholar
- 49.Wang J, Liu P, She M, Nahavandi S, Kouzani A (2013) Bag-of-words representation for biomedical time series classification. Biomed Signal Process Control 8(6):634–644CrossRefGoogle Scholar
- 50.Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: CVPR, pp 3360–3367Google Scholar
- 51.Wang Z, Oates T (2014) Time warping symbolic aggregation approximation with bag-of-patterns representation for time series classification. In: 13th international conference on machine learning and applications (ICMLA), pp 270–275Google Scholar
- 52.Wang Z, Oates T (2015) Pooling sax-bop approaches with boosting to classify multivariate synchronous physiological time series data. In: FLAIRS conference, pp 335–341Google Scholar
- 53.Xing Z, Pei J, Yu P (2011) Early prediction on time series: a nearest neighbor approach. Int Joint Conf Artif Intell 2168:1297–1302Google Scholar
- 54.Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: CVPR, pp 1794–1801Google Scholar
- 55.Yang H (2011) Multiscale recurrence quantification analysis of spatial cardiac vectorcardiogram signals. IEEE Trans Biomed Eng 58(2):339–347CrossRefGoogle Scholar
- 56.Zbilut JP, Webber CL Jr (1992) Embeddings and delays as derived from quantification of recurrence plots. Phys Lett A 171(3–4):199–203CrossRefGoogle Scholar
- 57.Zhang M, Sawchuk A (2012) Motion primitive-based human activity recognition using a bag-of-features approach. In: ACM SIGHIT symposium on international health informatics (IHI), pp 631–640Google Scholar
- 58.Zhao J, Itti L (2016) Classifying time series using local descriptors with hybrid sampling. IEEE Trans Knowl Data Eng 28:623–637CrossRefGoogle Scholar