Input Value Skewness and Class Label Confusion in the NEFCLASS Neuro-Fuzzy System

Yousefi, Jamileh; Hamilton-Wright, Andrew

doi:10.1007/978-3-319-99283-9_9

Jamileh Yousefi⁹ &
Andrew Hamilton-Wright^9,10

Part of the book series: Studies in Computational Intelligence ((SCI,volume 792))

Included in the following conference series:

International Joint Conference on Computational Intelligence

244 Accesses

Abstract

Nefclass is a common example of the construction of a neuro-fuzzy system. The popular Nefclass classifier exhibits surprising behaviour when the feature values of the training and testing data sets exhibit significant skew. As skewed feature values are commonly observed in biological data sets, this is a topic that is of interest in terms of the applicability of such a classifier to these types of problems. This paper presents an approach to improve the classification accuracy of the Nefclass classifier, when data distribution exhibits positive skewness. The Nefclass classifier is extended to provide improved classification accuracy over the original Nefclass classifier when trained on skewed data. The proposed model uses two alternative discretization methods, MME and CAIM, to initialize fuzzy sets. From this study it is found that using the MME and CAIM discretization methods results in greater improvements in the classification accuracy of Nefclass as compared to using the original Equal-Width technique Nefclass uses by default.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Gao, J., Hu, W., Li, W., Zhang, Z., Wu, O.: Local outlier detection based on kernel regression. In: Proceedings of the 10th International Conference on Pattern Recognition, Washington, DC, USA, pp. 585–588. EEE Computer Society (2010)
Google Scholar
Ben-Gal, I.: Outlier detection. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 131–146. Springer Science & Business Media, Berlin (2010)
Google Scholar
Mansoori, E., Zolghadri, M., Katebi, S.: A weighting function for improving fuzzy classification systems performance. Fuzzy Sets Syst. 158, 588–591 (2007)
Article MathSciNet Google Scholar
Liu, Y., Liu, X., Su, Z.: A new fuzzy approach for handling class labels in canonical correlation analysis. Neurocomputing 71, 1785–1740 (2008)
Article Google Scholar
Peker, N.E.S.: Exponential membership function evaluation based on frequency. Asian J. Math. Stat. 4, 8–20 (2011)
MathSciNet Google Scholar
Chittineni, S., Bhogapathi, R.B.: A study on the behavior of a neural network for grouping the data. Int. J. Comput. Sci. 9, 228–234 (2012)
Google Scholar
Changyong, F., Hongyue, W., Naiji, L., Tian, C., Hua, H., Ying, L., Xin, M.: Log-transformation and its implications for data analysis. Shanghai Arch Psychiatry 26, 105–109 (2014)
Google Scholar
Qiang, Q., Guillermo, S.: Learning transformations for clustering and classification. J. Mach. Learn. Res. 16, 187–225 (2015)
MathSciNet Google Scholar
Zadkarami, M.R., Rowhani, M.: Application of skew-normal in classification of satellite image. J. Data Sci. 8, 597–606 (2010)
Article Google Scholar
Hubert, M., Van der Veeken, S.: Robust classification for skewed data. Adv. Data Anal. Classif. 4, 239–254 (2010)
Article MathSciNet Google Scholar
Yousefi, J., Hamilton-Wright, A.: Classification confusion within NEFCLASS caused by feature value skewness in multi-dimensional datasets. In: Proceedings of the \(9^{\text{th}}\) International Joint Conference on Computational Intelligence, Porto, IJCCI-2016 (2016)
Google Scholar
Chemielewski, M.R., Grzymala-Busse, J.W.: Global discretization of continuous attributes as preprocessing for machine learning. Int. J. Approx. Reason. 15, 319–331 (1996)
Article Google Scholar
Monti, S., Cooper, G.: A latent variable model for multivariate discretization. In: The Seventh International Workshop on Artificial Intelligence and Statistics, Fort Lauderdale, FL, pp. 249–254 (1999)
Google Scholar
Chau, T.: Marginal maximum entropy partitioning yields asymptotically consistent probability density functions. IEEE Trans. Pattern Anal. Mach. Intell. 23, 414–417 (2001)
Article Google Scholar
Gokhale, D.V.: On joint and conditional entropies. Entropy 1, 21–24 (1999)
Article MathSciNet Google Scholar
Bertoluzza, C., Forte, B.: Mutual dependence of random variables and maximum discretized entropy. Ann. Probab. 13, 630–637 (1985)
Article MathSciNet Google Scholar
Kerber, R.: ChiMerge discretization of numeric attributes. In: Proceedings of AAAI-92, San Jose Convention Center, San Jose, California, pp. 123–128 (1992)
Google Scholar
Kurgan, L.A., Cios, K.: CAIM discretization algorithm. IEEE Trans. Knowl. Data Eng. 16, 145–153 (2004)
Article Google Scholar
Cano, A., Nguyen, D.T., Ventura, S., Cios, K.J.: ur-CAIM: improved CAIM discretization for unbalanced and balanced data. Soft Comput. 33, 173–188 (2016)
Google Scholar
Nauck, D., Klawonn, F., Kruse, R.: Neuro-Fuzzy Systems. Wiley, New York (1996)
Book Google Scholar
Nauck, D., Kruse, R.: NEFCLASS-X - a soft computing tool to build readable fuzzy classifiers. BT Technol. J. 16, 180–190 (1998)
Article Google Scholar
Mendel, J.M.: Uncertain Rule-Based Fuzzy Logic Systems. Prentice-Hall, Englewood Cliffs (2001)
Google Scholar
Natrella, M.: NIST SEMATECH eHandbook of Statistical Methods. NIST (2003)
Google Scholar
Stashuk, D.W., Brown, W.F.: Quantitative electromyography. In: Brown, W.F., Bolton, C.F., Aminoff, M.J. (eds.) Neuromuscular Function and Disease, vol. 1, pp. 311–348. W.B. Saunders, Philadelphia (2002)
Google Scholar
Enoka, R., Fuglevand, A.: Motor unit physiology: some unresolved issues. Muscle & Nerve 24, 4–17 (2001)
Article Google Scholar
Varga, R., Matheson, S.M., Hamilton-Wright, A.: Aggregate features in multi-sample classification problems. IEEE Trans. Biomed. Health Inf. 99, 1 (2014). (in press)
Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the support of NSERC, the National Sciences and Engineering Research Council of Canada, for ongoing grant support.

Author information

Authors and Affiliations

School of Computer Science (SOCS), University of Guelph, Guelph, ON, Canada
Jamileh Yousefi & Andrew Hamilton-Wright
Department of Mathematics and Computer Science, Mount Allison University, Sackville, NB, Canada
Andrew Hamilton-Wright

Authors

Jamileh Yousefi
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Hamilton-Wright
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew Hamilton-Wright .

Editor information

Editors and Affiliations

Department of Computer Architecture and Computer Technology, Universidad de Granada, Granada, Spain
Juan Julian Merelo
ISEL-Instituto Politécnico de Lisboa, Lisboa, Portugal
Fernando Melício
Facultad de Informática, Department of Information and Communications Engineering, University of Murcia, Murcia, Spain
José M. Cadenas
University of Coimbra, Coimbra, Portugal
António Dourado
Université Paris-Est Créteil (UPEC), Créteil, France
Kurosh Madani
University of Algarve, Faro, Portugal
António Ruano
INSTICC, Setúbal, Portugal
Joaquim Filipe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yousefi, J., Hamilton-Wright, A. (2019). Input Value Skewness and Class Label Confusion in the NEFCLASS Neuro-Fuzzy System. In: Merelo, J.J., et al. Computational Intelligence. IJCCI 2016. Studies in Computational Intelligence, vol 792. Springer, Cham. https://doi.org/10.1007/978-3-319-99283-9_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-99283-9_9
Published: 04 October 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99282-2
Online ISBN: 978-3-319-99283-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics