A novel logistic-NARX model as a classifier for dynamic binary classification

Ayala Solares, Jose Roberto; Wei, Hua-Liang; Billings, Stephen A.

doi:10.1007/s00521-017-2976-x

A novel logistic-NARX model as a classifier for dynamic binary classification

Original Article
Published: 27 April 2017

Volume 31, pages 11–25, (2019)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Jose Roberto Ayala Solares¹,
Hua-Liang Wei ORCID: orcid.org/0000-0002-4704-7346¹ &
Stephen A. Billings¹

969 Accesses
20 Citations
1 Altmetric
Explore all metrics

Abstract

System identification and data-driven modeling techniques have seen ubiquitous applications in the past decades. In particular, parametric modeling methodologies such as linear and nonlinear autoregressive with exogenous input models (ARX and NARX) and other similar and related model types have been preferably applied to handle diverse data-driven modeling problems due to their easy-to-compute linear-in-the-parameter structure, which allows the resultant models to be easily interpreted. In recent years, several variations of the NARX methodology have been proposed that improve the performance of the original algorithm. Nevertheless, in most cases, NARX models are applied to regression problems where all output variables involve continuous or discrete-time sequences sampled from a continuous process, and little attention has been paid to classification problems where the output signal is a binary sequence. Therefore, we developed a novel classification algorithm that combines the NARX methodology with logistic regression and the proposed method is referred to as logistic-NARX model. Such a combination is advantageous since the NARX methodology helps to deal with the multicollinearity problem while the logistic regression produces a model that predicts categorical outcomes. Furthermore, the NARX approach allows for the inclusion of lagged terms and interactions between them in a straight forward manner resulting in interpretable models where users can identify which input variables play an important role individually and/or interactively in the classification process, something that is not achievable using other classification techniques like random forests, support vector machines, and k-nearest neighbors. The efficiency of the proposed method is tested with five case studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Classification Models

Automatic piecewise linear regression

Article Open access 01 March 2024

Mathias von Ottenbreit & Riccardo De Bin

Regression

References

Billings SA (2013) Nonlinear system identification: NARMAX methods in the time, frequency, and spatio-temporal domains. Wiley
Söderström T, Stoica P (1989) System identification. Prentice Hall
Pope KJ, Rayner PJW (1994) In: 1994 IEEE international conference on acoustics, speech, and signal processing, 1994. ICASSP-94, vol IV, pp 457–460
Billings SA, Chen S, Backhouse RJ (1989) The identification of linear and non-linear models of a turbocharged automotive diesel engine. Mech Syst Signal Process 3(2):123
Article Google Scholar
Billings SA, Wei HL (2007) Sparse model identification using a forward orthogonal regression algorithm aided by mutual information. IEEE Trans Neural Netw 18(1):306
Article Google Scholar
Wei HL, Zhu DQ, Billings S, Balikhin MA (2007) Forecasting the geomagnetic activity of the Dst index using multiscale radial basis function networks. Adv Space Res 40(12):1863. http://www.sciencedirect.com/science/article/pii/S0273117707002086
Article Google Scholar
Billings SA, Wei HL (2008) An adaptive orthogonal search algorithm for model subset selection and non-linear system identification. Int J Control 81(5):714
Article MathSciNet MATH Google Scholar
Wei HL, Billings SA (2008) Model structure selection using an integrated forward orthogonal search algorithm assisted by squared correlation and mutual information. Int J Model Ident Control 3(4):341
Article Google Scholar
Alexandridis AK, Zapranis AD (2013) Wavelet neural networks: A practical guide. Neural Netw 42(0):1. doi:10.1016/j.neunet.2013.01.008. http://www.sciencedirect.com/science/article/pii/S0893608013000129
Article MATH Google Scholar
Billings SA, Wei HL (2005) The wavelet-NARMAX representation: a hybrid model structure combining polynomial models with multiresolution wavelet decompositions. Int J Syst Sci 36(3): 137
Article MathSciNet MATH Google Scholar
Billings SA, Wei HL (2005) A new class of wavelet networks for nonlinear system identification. IEEE Trans Neural Netw 16(4):862
Article Google Scholar
Wei HL, Billings SA, Zhao Y, Guo L (2009) Lattice dynamical wavelet neural networks implemented using particle swarm optimization for spatio temporal system identification. IEEE Trans Neural Netw 20(1):181
Article Google Scholar
Billings S, Wei HL, Balikhin MA (2007) Generalized multiscale radial basis function networks. Neural Netw 20(10): 1081. http://www.sciencedirect.com/science/article/pii/S0893608007001876
Article MATH Google Scholar
Koller D, Sahami M (1996) Toward optimal feature selection. In: 13th international conference on machine learning. Bari, Italy, pp 284–292
Wang S, Wei HL, Coca D, Billings SA (2013) Model term selection for spatio-temporal system identification using mutual information. Int J Syst Sci 44(2):223
Article MathSciNet MATH Google Scholar
Speed T (2011) A correlation for the 21st century. Science 334(6062):1502. doi:10.1126/science.1215894. http://www.sciencemag.org/content/334/6062/1502.short
Article Google Scholar
Reshef DN, Reshef YA, Finucane HK, Grossman SR, McVean G, Turnbaugh PJ, Lander ES, Mitzenmacher M, Sabeti PC (2011) Detecting novel associations in large data sets. Science 334(6062):1518. doi:10.1126/science.1205438. http://www.sciencemag.org/content/334/6062/1518.abstract
Article MATH Google Scholar
Székely GJ, Rizzo ML, Bakirov NK (2007) Measuring and testing dependence by correlation of distances. Ann Stat 35(6): 2769
Article MathSciNet MATH Google Scholar
Székely GJ, Rizzo ML (2013) Energy statistics: A class of statistics based on distances. J Stat Plan Infer 143(8):1249
Article MathSciNet MATH Google Scholar
Piroddi L, Spinelli W (2003) An identification algorithm for polynomial NARX models based on simulation error minimization. Int J Control 76(17):1767. doi:10.1080/00207170310001635419
Article MathSciNet MATH Google Scholar
Ayala Solares J, Wei HL (2015) Nonlinear model structure detection and parameter estimation using a novel bagging method based on distance correlation metric. Nonlinear Dynamics, pp 1–15. doi:10.1007/s11071-015-2149-3
Wei HL, Lang Z, Billings SA (2008) Constructing an overall dynamical model for a system with changing design parameter properties. Int J Model Ident Control 5(2):93
Article Google Scholar
Li P, Wei HL, Billings SA, Balikhin MA, Boynton R (2013) Nonlinear model identification from multiple data sets using an orthogonal forward search algorithm. J Comput Nonlinear Dyn 8(4):10
Google Scholar
Li Y, Wei HL, Billings S, Sarrigiannis P (2015) Identification of nonlinear time-varying systems using an online sliding-window and common model structure selection (CMSS) approach with applications to EEG. International Journal of Systems Science, pp 1–11. doi:10.1080/00207721.2015.1014448 10.1080/00207721.2015.1014448
Guo Y, Guo L, Billings S, Wei HL (2015) An iterative orthogonal forward regression algorithm. Int J Syst Sci 46(5):776. doi:10.1080/00207721.2014.981237
Article MathSciNet MATH Google Scholar
Guo Y, Guo LZ, Billings S, Wei HL (2015) Ultra-orthogonal forward regression algorithms for the identification of non-linear dynamic systems. Neurocomputing 173:715–723. http://www.sciencedirect.com/science/article/pii/S0925231215011741 http://www.sciencedirect.com/science/article/pii/S0925231215011741
James G, Witten D, Hastie T, Tibshirani R (2013) An introduction to statistical learning with application in r, Springer Texts in Statistics, vol 103. Springer
Harrell F (2015) Regression modeling strategies: with applications to linear models, logistic and ordinal regression and survival analysis. Springer
Pallant J (2013) SPSS survival manual. McGraw-Hill Education, UK
Google Scholar
Breiman L (2001) Random forests. Mach Learn 45(1):5. doi:10.1023/A%3A1010933404324
Article MATH Google Scholar
Vapnik VN (1998) Statistical learning theory. Wiley
Kuhn M, Johnson K (2013) Applied predictive modeling. Springer
Wei HL, Billings SA, Liu J (2004) Term and variable selection for non-linear system identification. Int J Control 77(1):86
Article MathSciNet MATH Google Scholar
Rashid MT, Frasca M, Ali AA, Ali RS, Fortuna L, Xibilia MG (2012) Nonlinear model identification for Artemia population motion. Nonlinear Dyn 69(4):2237. doi:10.1007/s11071-012-0422-2
Article MathSciNet Google Scholar
Wickham H (2016) R for Data Science. Hadley Wickham, Garrett Grolemund, O’Reilly, Canada
Aguirre LA, Jácôme C (1998) Cluster analysis of NARMAX models for signal-dependent systems IEEE proceedings of the control theory and applications, vol 145. IET, pp 409–414
Feil B, Abonyi J, Szeifert F (2004) Model order selection of nonlinear input–output models—a clustering based approach. J Process Control 14(6):593
Article Google Scholar
Kukreja SL, Lofberg J, Brenner MJ (2006) A least absolute shrinkage and selection operator (LASSO) for nonlinear system identification. In: IFAC proceedings volumes, vol 39, no 1, pp 814–819
Qin P, Nishii R, Yang ZJ (2012) Selection of NARX models estimated using weighted least squares method via GIC-based method and L ₁-norm regularization methods. Nonlinear Dyn 70(3):1831. doi:10.1007/s11071-012-0576-y
Article Google Scholar
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B (Stat Methodol) 67(2):301
Article MathSciNet MATH Google Scholar
Hong X, Chen S (2012) An elastic net orthogonal forward regression algorithm 16th IFAC symposium on system identification, pp 1814–1819
Google Scholar
Sette S, Boullart L (2001) Genetic programming: principles and applications. Eng Appl Artif Intell 14 (6):727
Article Google Scholar
Madár J, Abonyi J, Szeifert F (2005) Genetic programming for the identification of nonlinear input–output models. Ind Eng Chem Res 44(9):3178
Article Google Scholar
Baldacchino T, Anderson SR, Kadirkamanathan V (2012) Structure detection and parameter estimation for NARX models in a unified EM framework. Automatica 48(5):857
Article MathSciNet MATH Google Scholar
Teixeira BO, Aguirre LA (2011) Using uncertain prior knowledge to improve identified nonlinear dynamic models. J Process Control 21(1):82
Article Google Scholar
Billings SA, Voon WSF (1986) A prediction-error and stepwise-regression estimation algorithm for non-linear systems. Int J Control 44(1):235
Article MATH Google Scholar
Dietterich TG (2002) Machine learning for sequential data: a review structural, syntactic, and statistical pattern recognition Structural, syntactic, and statistical pattern recognition. Springer, pp 15–30
Aguirre LA, Letellier C (2009) Modeling nonlinear dynamics and chaos: a review. Math Probl Eng 2009:35
Article MathSciNet MATH Google Scholar
Wei HL, Balikhin MA, Walker SN (2015) A new ridge basis function neural network for data-driven modeling and prediction 2015 10th international conference on computer science & education (ICCSE). IEEE, pp 125–130
Billings S, Mao K (1998) Model identification and assessment based on model predicted output. Tech. rep., Department of Automatic Control and Systems Engineering. The University of Sheffield, UK
Google Scholar
Nepomuceno EG, Martins SAM (2016) A lower bound error for free-run simulation of the polynomial NARMAX. Syst Sci Control Eng 4(1):50. doi:10.1080/21642583.2016.1163296
Article Google Scholar
Chen S, Billings S, Luo W (1989) Orthogonal least squares methods and their application to non-linear system identification. Int J Control 50(5):1873
Article MATH Google Scholar
Komarek P (2004) Logistic regression for data mining and high-dimensional classification. Master’s thesis, Robotics Institute - School of Computer Science. Carnegie Mellon University , USA
Google Scholar
Senawi A, Wei HL, Billings S (2017) A new maximum relevance-minimum multicollinearity (MRmMC) method for feature selection and ranking. Pattern Recognition. Accepted
Bennett KP, Mangasarian OL (1992) Robust linear programming discrimination of two linearly inseparable sets. Opt Methods Softw 1(1):23
Article Google Scholar
Mangasarian OL, Street WN, Wolberg WH (1995) Breast cancer diagnosis and prognosis via linear programming. Oper Res 43(4):570
Article MathSciNet MATH Google Scholar
Lichman M (2013) Breast cancer diagnosis and prognosis via linear programming. UCI machine learning repository. http://archive.ics.uci.edu/ml
WHO Breast cancer: prevention and control. http://www.who.int/cancer/detection/breastcancer/en/
Wang T, Guan SU, Man KL, Ting TO (2014) EEG eye state identification using incremental attribute learning with time-series classification. Mathematical Problems in Engineering 2014
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321
Article MATH Google Scholar

Download references

Acknowledgements

The authors acknowledge the financial support to J. R. Ayala Solares from the University of Sheffield and the Mexican National Council of Science and Technology (CONACYT). The authors gratefully acknowledge that part of this work was supported by the Engineering and Physical Sciences Research Council (EPSRC) under Grant EP/I011056/1 and Platform Grant EP/H00453X/1, and ERC Horizon 2020 Research and Innovation Action Framework Programme under Grant No 637302 (PROGRESS).

Author information

Authors and Affiliations

Department of Automatic Control and Systems Engineering, Faculty of Engineering, The University of Sheffield, Sheffield, UK
Jose Roberto Ayala Solares, Hua-Liang Wei & Stephen A. Billings

Authors

Jose Roberto Ayala Solares
View author publications
You can also search for this author in PubMed Google Scholar
Hua-Liang Wei
View author publications
You can also search for this author in PubMed Google Scholar
Stephen A. Billings
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hua-Liang Wei.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ayala Solares, J.R., Wei, HL. & Billings, S.A. A novel logistic-NARX model as a classifier for dynamic binary classification. Neural Comput & Applic 31, 11–25 (2019). https://doi.org/10.1007/s00521-017-2976-x

Download citation

Received: 12 October 2016
Accepted: 24 March 2017
Published: 27 April 2017
Issue Date: 18 January 2019
DOI: https://doi.org/10.1007/s00521-017-2976-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A novel logistic-NARX model as a classifier for dynamic binary classification

Abstract

Access this article

Similar content being viewed by others

Classification Models

Automatic piecewise linear regression

Regression

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A novel logistic-NARX model as a classifier for dynamic binary classification

Abstract

Access this article

Similar content being viewed by others

Classification Models

Automatic piecewise linear regression

Regression

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation