A Non-ordered Rule Induction Algorithm through Multi-Objective Particle Swarm Optimization: Issues and Applications

de Carvalho, André B.; Pozo, Aurora; Vergilio, Silvia

doi:10.1007/978-3-642-05165-4_2

André B. de Carvalho⁴,
Aurora Pozo⁴ &
Silvia Vergilio⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 261))

924 Accesses

Abstract

Multi-Objective Metaheuristics permit to conceive a complete novel approach to induce classifiers, where the properties of the rules can be expressed in different objectives, and then the algorithm finds these rules in an unique run by exploring Pareto dominance concepts. Furthermore, these rules can be used as an unordered classifier, in this way, the rules are more intuitive and easier to understand because they can be interpreted independently one of the other. The quality of the learned rules is not affected during the learning process because the dataset is not modified, as in traditional rule induction approaches. With this philosophy, this chapter describes a Multi-Objective Particle Swarm Optimization (MOPSO) algorithm. One reason to choose the Particle Swarm Optimization Meta heuristic is its recognized ability to work in numerical domains. This propriety allows the described algorithm deals with both numerical and discrete attributes. The algorithm is evaluated by using the area under ROC curve and, by comparing the performance of the induced classifiers with other ones obtained with well known rule induction algorithms. The produced Pareto Front coverage of the algorithm is also analyzed following a Multi-Objective methodology. In addition to this, some application results in the Software Engineering domain are described, more specifically in the context of software testing. Software testing is a fundamental Software Engineering activity for quality assurance that is traditionally very expensive. The algorithm is used to induce rules for fault-prediction that can help to reduce testing efforts. The empirical evaluation and the comparison show the effectiveness and scalability of this new approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alshayeb, M., Li, W.: An empirical validation of object-oriented metrics in two different iterative software processes. IEEE Transaction on Software Engineering 29(11), 1043–1049 (2003)
Article Google Scholar
Asuncion, A., Newman, D.: UCI machine learning repository (2007)
Google Scholar
Baronti, F., Starita, A.: Hypothesis Testing with Classifier Systems for Rule-Based Risk Prediction, pp. 24–34. Springer, Heidelberg (2007), http://dx.doi.org/10.1007/978-3-540-71783-6_3
Google Scholar
Basili, V.R., Briand, L.C., Melo, W.L.: A validation of object-oriented design metrics as quality indicators. IEEE Transaction on Software Engineering 22(10), 751–761 (1996)
Article Google Scholar
Batista, G., Milare, C., Prati, R.C., Monard, M.: A comparison of methods for rule subset selection applied to associative classification. Inteligencia Artificial. Revista Iberoamericana de IA 7(32), 29–35 (2006)
Google Scholar
Bleuler, S., Laumanns, M., Thiele, L., Zitzler, E.: PISA — a platform and programming language independent interface for search algorithms. In: Fonseca, C.M., Fleming, P.J., Zitzler, E., Deb, K., Thiele, L. (eds.) EMO 2003. LNCS, vol. 2632, pp. 494–508. Springer, Heidelberg (2003)
Chapter Google Scholar
Bleuler, S., Laumanns, M., Thiele, L., Zitzler, E.: The PISA homepage (2003), http://www.tik.ee.ethz.ch/pisa/
Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30(7), 1145–1159 (1997)
Article Google Scholar
Bratton, D., Kennedy, J.: Defining a standard for particle swarm optimization. In: Proceedings of IEEE Swarm Intelligence Symposium (SIS 2007), Honolulu, Hawaii, USA, pp. 120–127. IEEE Computer Society, Los Alamitos (2007)
Chapter Google Scholar
Briand, L.C., Wust, J., Daly, J., Porter, V.: A comprehensive empirical validation of design measures for object-oriented systems. In: METRICS 1998: Proceedings of the 5th International Symposium on Software Metrics, Washington, DC, USA, p. 246. IEEE Computer Society, Los Alamitos (1998)
Google Scholar
Briand, L.C., Wust, J., Daly, J.W., Porter, D.V.: Exploring the relationships between design measures and software quality in object-oriented systems. The Journal of Systems and Software 51(3), 245–273 (2000)
Article Google Scholar
Chidamber, S., Kemerer, C.: A metrics suite for object-oriented design. IEEE Transaction on Software Engineering 20(6), 476–493 (1994)
Article Google Scholar
Clark, P., Niblett, T.: Rule induction with CN2: Some recent improvements. In: ECML: European Conference on Machine Learning. Springer, Heidelberg (1991)
Google Scholar
Cohen, W.W.: Fast effective rule induction. In: Proceedings of the Twelfth International Conference on Machine Learning, pp. 115–123 (1995)
Google Scholar
Conover, W.J.: Practical nonparametric statistics. Wiley, Chichester (1971)
Google Scholar
de Carvalho, A.B., Pozo, A., Vergilio, S., Lenz, A.: Predicting fault proneness of classes trough a multiobjective particle swarm optimization algorithm. In: Poceedings of 20th IEEE International Conference on Tools with Artificial Intelligence (2008)
Google Scholar
de la Iglesia, B., Philpott, M.S., Bagnall, A.J., Rayward-Smith, V.J.: Data mining rules using multi-objective evolutionary algorithms. In: Congress on Evolutionary Computation, pp. 1552–1559. IEEE Computer Society, Los Alamitos (2003)
Chapter Google Scholar
de la Iglesia, B., Reynolds, A., Rayward-Smith, V.J.: Developments on a multi-objective metaheuristic (momh) algorithm for finding interesting sets of classification rules. In: Coello Coello, C.A., Hernández Aguirre, A., Zitzler, E. (eds.) EMO 2005, vol. 3410, pp. 826–840. Springer, Heidelberg (2005)
Google Scholar
Egan, J.: Signal detection theory and ROC analysis. Academic Press, New York (1975)
Google Scholar
Pérez-Miñana, E., Gras, J.-J.: Improving fault prediction using bayesian networks for the development of embedded software applications: Research articles. Softw. Test. Verif. Reliab. 16(3), 157–174 (2006)
Article Google Scholar
Fawcett, T.: Using rule sets to maximize ROC performance. In: IEEE International Conference on Data Mining, pp. 131–138. IEEE Computer Society Press, Los Alamitos (2001)
Chapter Google Scholar
Fenton, N., Neil, M., Marsh, W., Hearty, P., Marquez, D., Krause, P., Mishra, R.: Predicting software defects in varying development lifecycles using bayesian nets. Infromation on Software Technology 49(1), 32–43 (2007)
Article Google Scholar
Ferri, C., Flach, P., Hernandez-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Sammut, C., Hoffmann, A. (eds.) Proceedings of the 19th International Conference on Machine Learning, July 2002, pp. 139–146. Morgan Kaufmann, San Francisco (2002)
Google Scholar
Group, W.M.L.: Weka machine learning project (2007), http://www.cs.waikato.ac.nz/ml/weka
Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)
Google Scholar
Hansen, M.P., Jaszkiewicz, A.: Evaluating the quality of approximations to the non-dominated set. Technical Report IMM-REP-1998-7, Technical University of Denmark (March 1998)
Google Scholar
Ishibuchi, H.: Multiobjective association rule mining. In: PPSN Workshop on Multiobjective Problem Solving from Nature, Reykjavik, Iceland, pp. 39–48 (2006)
Google Scholar
Ishibuchi, H., Nojima, Y.: Accuracy-complexity tradeoff analysis by multiobjective rule selection. In: ICDM, pp. 39–48. IEEE Computer Society, Los Alamitos (2005)
Google Scholar
Ishida, C., de Carvalho, A.B., Pozo, A.: Exploring Multi-objective PSO and GRASP-PR for rule induction. In: van Hemert, J., Cotta, C. (eds.) EvoCOP 2008. LNCS, vol. 4972, pp. 73–84. Springer, Heidelberg (2008)
Chapter Google Scholar
Ishida, C.Y., Pozo, A.T.R.: Optimization of the auc criterion for rule subset selection. In: 7th. International Conference on Intelligent Systems Design and Applications, New York, NY, USA. IEEE Computer Society, Los Alamitos (2007)
Google Scholar
Jin, Y.: Multi-Objective Machine Learning. Springer, Berlin (2006)
Book MATH Google Scholar
Jovanoski, V., Lavrac, N.: Classification rule learning with APRIORI-C. In: Brazdil, P.B., Jorge, A.M. (eds.) EPIA 2001. LNCS, vol. 2258, pp. 44–51. Springer, Heidelberg (2001)
Google Scholar
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: IEEE International Conference on Neural Networks, pp. 1492–1948. IEEE Press, Los Alamitos (1955)
Google Scholar
Kennedy, J., Eberhart, R.C.: Swarm intelligence. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Google Scholar
Knowles, J., Thiele, L., Zitzler, E.: A Tutorial on the Performance Assessment of Stochastic Multiobjective Optimizers. In: Computer Engineering and Networks Laboratory (TIK), ETH Zurich, Switzerland, Febuary 2006, vol. 214 (2006) (revised version)
Google Scholar
Lavrac, N., Flach, P., Zupan, B.: Rule evaluation measures: A unifying view. In: Džeroski, S., Flach, P.A. (eds.) ILP 1999. LNCS, vol. 1634, pp. 174–185. Springer, Heidelberg (1999)
Chapter Google Scholar
Li, W., Han, J., Pei, J.: CMAR: Accurate and efficient classification based on multiple class-association rules. In: Cercone, N., Lin, T.Y., Wu, X. (eds.) ICDM, pp. 369–376. IEEE Computer Society, Los Alamitos (2001)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Knowledge Discovery and Data Mining, pp. 80–86 (1998)
Google Scholar
Lounis, H., Ait-Mehedine, L.: Machine-learning techniques for software product quality assessment. In: Fourth International Conference QSIC 2004: Proceedings of the Quality Software, Washington, DC, USA, pp. 102–109. IEEE Computer Society, Los Alamitos (2004)
Chapter Google Scholar
Martin, B.: Instance-Based learning: Nearest Neighbor With Generalization. PhD thesis, Department of Computer Science, University of Waikato, New Zealand (1995)
Google Scholar
Mostaghim, S., Teich, J.: Strategies for finding good local guides in multi-objective particle swarm optimization. In: Proceedings of the 2003 IEEE Swarm Intelligence Symposium SIS 2003 Swarm Intelligence Symposium, pp. 26–33. IEEE Computer Society, Los Alamitos (2003)
Chapter Google Scholar
Pai, G.J., Dugan, J.B.: Empirical analysis of software fault content and fault proneness using bayesian methods. IEEE Transaction on Software Engineering 33(10), 675–686 (2007)
Article Google Scholar
Pareto, V.: Manuel d”economie politique (1927)
Google Scholar
Prati, R.C., Flach, P.A.: ROCCER: An algorithm for rule learning based on ROC analysis. In: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pp. 823–828 (2005)
Google Scholar
Program, N.I.F.M.D.: Metrics data repository, http://mdp.ivv.nasa.gov/
Provost, F., Fawcett, T.: Robust classification for imprecise environments. Machine Learning 42(3), 203 (2001)
Article MATH Google Scholar
Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation for comparing induction algorithms. In: Proceedings 15th International Conference on Machine Learning, pp. 445–453. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Provost, F.J., Fawcett, T.: Analysis and visualization of classifier performance: Comparison under imprecise class and cost distributions. In: KDD, pp. 43–48 (1997)
Google Scholar
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Google Scholar
Rakotomamonjy, A.: Optimizing area under roc curve with SVMs. In: Hernández-Orallo, J., Ferri, C., Lachiche, N., Flach, P.A. (eds.) ROCAI, pp. 71–80 (2004)
Google Scholar
Reyes-Sierra, M., Coello, C.A.C.: Multi-objective particle swarm optimizers: A survey of‘the state-of-the-art. International Journal of Computational Intelligence Research 2(3), 287–308 (2006)
MathSciNet Google Scholar
Sebag, M., Aze, J., Lucas, N.: ROC-based evolutionary learning: Application to medical data mining. In: International Conference on Artificial Evolution, Evolution Artificielle. LNCS, vol. 6 (2003)
Google Scholar
Subramanyam, R., Krishnan, M.S.: Empirical analysis of CK metrics for object-oriented design complexity: Implications for software defects. IEEE Transaction on Software Engineering 29(4), 297–310 (2003)
Article Google Scholar
Succi, G., Pedrycz, W., Stefanovic, M., Miller, J.: Practical assessment of the models for identification of defect-prone classes in object-oriented commercial systems using design metrics. The Journal of Systems and Software 65(1), 1–12 (2003)
Article Google Scholar
Thwin, M.M.T., Quah, T.-S.: Application of neural networks for software quality prediction using object-oriented metrics. The Journal of Systems and Software 76(2), 147–156 (2005)
Article Google Scholar
Toracio, A., Pozo, A.: Multiple objective particle swarm for classification-rule discovery. In: Proceedings of CEC 2007, pp. 684–691. IEEE Computer Society, Los Alamitos (2007)
Google Scholar
Yin, X., Han, J.: CPAR: Classification based on predictive association rules. In: Proceedings SIM International Conference on Data Mining (SDM 2003), pp. 331–335 (2003)
Google Scholar
Zhou, Y., Leung, H.: Empirical analysis of object-oriented design metrics for predicting high and low severity faults. IEEE Transaction on Software Engineering 32(10), 771–789 (2006)
Article Google Scholar
Zitzler, E., Thiele, L.: Multiobjective Evolutionary Algorithms: A Comparative Case Study and the Strength Pareto Approach. IEEE Transactions on Evolutionary Computation 3(4), 257–271 (1999)
Article Google Scholar
Zitzler, E., Thiele, L., Laumanns, M., Fonseca, C.M., da Fonseca., V.G.: Performance assessment of multiobjective optimizers: an analysis and review. IEEE Transactions on Evolutionary Computation 7, 117–132 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Sciences Department, Federal University of Paraná, Curitiba, PR, CEP: 19081, Brazil
André B. de Carvalho, Aurora Pozo & Silvia Vergilio

Authors

André B. de Carvalho
View author publications
You can also search for this author in PubMed Google Scholar
Aurora Pozo
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Vergilio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculdade de Engenharia, Universidade do Estado do Rio de Janeiro, sala 5022-D Rua São Francisco Xavier 524, 20550-900, MARACANÃ, RJ, Brazil
Nadia Nedjah & Luiza de Macedo Mourelle &
Universidade Federal do Paraná Departamento de Engenharia Elétrica Pós-Graduação em Engenharia Elétrica Centro Politécnico 81531-980, Curitiba-PR, Brazil and Pontifícia Universidade Católica do Paraná Centro, Brazil
Leandro dos Santos Coelho

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

de Carvalho, A.B., Pozo, A., Vergilio, S. (2010). A Non-ordered Rule Induction Algorithm through Multi-Objective Particle Swarm Optimization: Issues and Applications. In: Nedjah, N., dos Santos Coelho, L., de Macedo Mourelle, L. (eds) Multi-Objective Swarm Intelligent Systems. Studies in Computational Intelligence, vol 261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05165-4_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-05165-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05164-7
Online ISBN: 978-3-642-05165-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics