Abstract
Nefclass is a common example of the construction of a neuro-fuzzy system. The popular Nefclass classifier exhibits surprising behaviour when the feature values of the training and testing data sets exhibit significant skew. As skewed feature values are commonly observed in biological data sets, this is a topic that is of interest in terms of the applicability of such a classifier to these types of problems. This paper presents an approach to improve the classification accuracy of the Nefclass classifier, when data distribution exhibits positive skewness. The Nefclass classifier is extended to provide improved classification accuracy over the original Nefclass classifier when trained on skewed data. The proposed model uses two alternative discretization methods, MME and CAIM, to initialize fuzzy sets. From this study it is found that using the MME and CAIM discretization methods results in greater improvements in the classification accuracy of Nefclass as compared to using the original Equal-Width technique Nefclass uses by default.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gao, J., Hu, W., Li, W., Zhang, Z., Wu, O.: Local outlier detection based on kernel regression. In: Proceedings of the 10th International Conference on Pattern Recognition, Washington, DC, USA, pp. 585–588. EEE Computer Society (2010)
Ben-Gal, I.: Outlier detection. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 131–146. Springer Science & Business Media, Berlin (2010)
Mansoori, E., Zolghadri, M., Katebi, S.: A weighting function for improving fuzzy classification systems performance. Fuzzy Sets Syst. 158, 588–591 (2007)
Liu, Y., Liu, X., Su, Z.: A new fuzzy approach for handling class labels in canonical correlation analysis. Neurocomputing 71, 1785–1740 (2008)
Peker, N.E.S.: Exponential membership function evaluation based on frequency. Asian J. Math. Stat. 4, 8–20 (2011)
Chittineni, S., Bhogapathi, R.B.: A study on the behavior of a neural network for grouping the data. Int. J. Comput. Sci. 9, 228–234 (2012)
Changyong, F., Hongyue, W., Naiji, L., Tian, C., Hua, H., Ying, L., Xin, M.: Log-transformation and its implications for data analysis. Shanghai Arch Psychiatry 26, 105–109 (2014)
Qiang, Q., Guillermo, S.: Learning transformations for clustering and classification. J. Mach. Learn. Res. 16, 187–225 (2015)
Zadkarami, M.R., Rowhani, M.: Application of skew-normal in classification of satellite image. J. Data Sci. 8, 597–606 (2010)
Hubert, M., Van der Veeken, S.: Robust classification for skewed data. Adv. Data Anal. Classif. 4, 239–254 (2010)
Yousefi, J., Hamilton-Wright, A.: Classification confusion within NEFCLASS caused by feature value skewness in multi-dimensional datasets. In: Proceedings of the \(9^{\text{th}}\) International Joint Conference on Computational Intelligence, Porto, IJCCI-2016 (2016)
Chemielewski, M.R., Grzymala-Busse, J.W.: Global discretization of continuous attributes as preprocessing for machine learning. Int. J. Approx. Reason. 15, 319–331 (1996)
Monti, S., Cooper, G.: A latent variable model for multivariate discretization. In: The Seventh International Workshop on Artificial Intelligence and Statistics, Fort Lauderdale, FL, pp. 249–254 (1999)
Chau, T.: Marginal maximum entropy partitioning yields asymptotically consistent probability density functions. IEEE Trans. Pattern Anal. Mach. Intell. 23, 414–417 (2001)
Gokhale, D.V.: On joint and conditional entropies. Entropy 1, 21–24 (1999)
Bertoluzza, C., Forte, B.: Mutual dependence of random variables and maximum discretized entropy. Ann. Probab. 13, 630–637 (1985)
Kerber, R.: ChiMerge discretization of numeric attributes. In: Proceedings of AAAI-92, San Jose Convention Center, San Jose, California, pp. 123–128 (1992)
Kurgan, L.A., Cios, K.: CAIM discretization algorithm. IEEE Trans. Knowl. Data Eng. 16, 145–153 (2004)
Cano, A., Nguyen, D.T., Ventura, S., Cios, K.J.: ur-CAIM: improved CAIM discretization for unbalanced and balanced data. Soft Comput. 33, 173–188 (2016)
Nauck, D., Klawonn, F., Kruse, R.: Neuro-Fuzzy Systems. Wiley, New York (1996)
Nauck, D., Kruse, R.: NEFCLASS-X - a soft computing tool to build readable fuzzy classifiers. BT Technol. J. 16, 180–190 (1998)
Mendel, J.M.: Uncertain Rule-Based Fuzzy Logic Systems. Prentice-Hall, Englewood Cliffs (2001)
Natrella, M.: NIST SEMATECH eHandbook of Statistical Methods. NIST (2003)
Stashuk, D.W., Brown, W.F.: Quantitative electromyography. In: Brown, W.F., Bolton, C.F., Aminoff, M.J. (eds.) Neuromuscular Function and Disease, vol. 1, pp. 311–348. W.B. Saunders, Philadelphia (2002)
Enoka, R., Fuglevand, A.: Motor unit physiology: some unresolved issues. Muscle & Nerve 24, 4–17 (2001)
Varga, R., Matheson, S.M., Hamilton-Wright, A.: Aggregate features in multi-sample classification problems. IEEE Trans. Biomed. Health Inf. 99, 1 (2014). (in press)
Acknowledgements
The authors gratefully acknowledge the support of NSERC, the National Sciences and Engineering Research Council of Canada, for ongoing grant support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Yousefi, J., Hamilton-Wright, A. (2019). Input Value Skewness and Class Label Confusion in the NEFCLASS Neuro-Fuzzy System. In: Merelo, J.J., et al. Computational Intelligence. IJCCI 2016. Studies in Computational Intelligence, vol 792. Springer, Cham. https://doi.org/10.1007/978-3-319-99283-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-99283-9_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99282-2
Online ISBN: 978-3-319-99283-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)