Advertisement

Genetic Algorithms for Rule Discovery

  • Alex A. Freitas
Part of the Natural Computing Series book series (NCS)

Abstract

In this chapter we discuss several issues related to developing genetic algorithms (GAs) for prediction-rule discovery. The development of a GA for rule discovery involves a number of nontrivial design decisions. In this chapter we categorize these decisions into five groups, each of them discussed in a separate section, as follows.

Keywords

Genetic Algorithm Fitness Function Data Instance Rule Condition Classification Rule 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Anglano et al. 1997]
    C. Anglano, A. Giordana, G. Lo Bello and L. Saitta. A network genetic algorithm for concept learning. Proceedings of the 7th International Conference Genetic Algorithms (ICGA ‘87), 434–441. 1997.Google Scholar
  2. [Araujo et al. 1999]
    D.L.A. Araujo, H.S. Lopes and A.A. Freitas. A parallel genetic algorithm for rule discovery in large databases. Proceedings of the 1999 IEEE Systems, Man and Cybernetics Conference, v. III, 940–945. Tokyo, 1999.Google Scholar
  3. [Augier et al. 1995]
    S. Augier, G. Venturini and Y. Kodratoff. Learning first order logic rules with a genetic algorithm. Proceedings of the 1st International Conference on Knowledge Discovery and Data Mining (KDD ‘85), 21–26. AAAI Press, 1995.Google Scholar
  4. [Back 2000]
    T. Back. Binary strings. In: T. Back, D.B. Fogel and T. Michalewicz (Eds.) Evolutionary Computation 1, 132–135. Institute of Physics Publishing, 2000.Google Scholar
  5. [Banzhaf 2000]
    W. Banzhaf. Interactive evolution. In: T. Back, D.B. Fogel and T. Michalewicz (Eds.) Evolutionary Computation 1, 228–236. Institute of Physics Publishing, 2000.Google Scholar
  6. [Bhattacharyya 2000a]
    S. Bhattacharyya. Evolutionary algorithms in data mining: multi-objective performance modeling for direct marketing. Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ‘2000), 465–473. ACM, 2000.Google Scholar
  7. [Bhattacharyya 2000b]
    S. Bhattacharyya. Multi-objective data mining using genetic algorithms. In: A.S. Wu (Ed.) Proceedings of the 2000 Genetic and Evolutionary Computation Conference — Workshop on Data Mining with Evolutionary Algorithms, 76–79. 2000.Google Scholar
  8. [Bohanec 1994]
    M. Bohanec and I. Bratko. Trading accuracy for simplicity in decision trees. Machine Learning, 15, 223–250, 1994.zbMATHGoogle Scholar
  9. [Carvalho and Freitas 2000a]
    D.R. Carvalho and A.A. Freitas. A hybrid decision tree/genetic algorithm for coping with the problem of small disjuncts in data mining. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO ‘2000), 1061–1068. Morgan Kaufmann, 2000.Google Scholar
  10. [Carvalho and Freitas 2000b]
    D.R. Carvalho and A.A. Freitas. A genetic algorithm-based solution for the problem of small disjuncts. Principles of Data Mining and Knowledge Discovery (Proceedings of the 4th European Conference, PKDD ‘2000). Lecture Notes in Artificial Intelligence 1910, 345–352. Springer, 2000.Google Scholar
  11. [Carvalho and Freitas 2002]
    D.R. Carvalho and A.A. Freitas. A genetic algorithm with sequential niching for discovering small-disjunct rules. To appear in Proceedings of the Genetic and Evolutionary Computation Conference (GECCO ‘2002). New York, 2002.Google Scholar
  12. Catlett 1991] J.Catlett. Overpruning large decision trees. Proceedings of the 1991 International Joint Conference on Artificial Intelligence (IJCAI ‘81). Sidney, Australia. 1991.Google Scholar
  13. [Deb 2001]
    K. Deb. Multi-Objective Optimization Using Evolutionary Algorithms. Wiley, 2001.Google Scholar
  14. [Deb and Spears 2000]
    K. Deb and W.M. Spears. In: T. Back, D.B. Fogel and T. Michalewicz (Eds.) Evolutionary Computation 2, 93–100. Institute of Physics Publishing, 2000.Google Scholar
  15. [De Jong et al. 1993]
    K. De Jong, W.M. Spears and D.F. Gordon. Using genetic algorithms for concept learning. Machine Learning 13, 161–188, 1993.CrossRefGoogle Scholar
  16. [Dhar et al. 2000]
    V. Dhar, D. Chou and F. Provost. Discovering interesting patterns for investment decision making with GLOWER — a genetic learner overlaid with entropy reduction. Data Mining and Knowledge Discovery Journal 4(4), 251–280, 2000.Google Scholar
  17. [Emmanouilidis et al. 2000]
    C. Emmanouilidis, A. Hunter and J. MacIntyre. A multiobjective evolutionary setting for feature selection and a commonality-based crossover operator. Proceedings of the 2000 Congress on Evolutionary Computation (CEC ‘2000), 309–316. IEEE, 2000.Google Scholar
  18. Fidelis et al. 2000] M.V. Fidelis, H.S. Lopes and A.A. Freitas. Discovering comprehensible classification rules with a genetic algorithm. Proceedings of the Congress on Evolutionary Computation — 2000 (CEC ‘2000),805–810. La Jolla, CA, USA, IEEE, 2000.Google Scholar
  19. [Flockhart and Radcliffe 1995]
    I.W. Flockhart and N.J. Radcliffe. GA-MINER: parallel data mining with hierarchical genetic algorithms - final report. EPCCAIKMS-GA-MINER-Report 1.0. University of Edinburgh, UK, 1995.Google Scholar
  20. [Fonseca and Fleming 2000]
    C.M. Fonseca and P.J. Fleming. Multiobjective optimization. In: T. Back, D.B. Fogel and T. Michalewicz (Eds.) Evolutionary Computation 2, 25–37. Institute of Physics Publishing, 2000.Google Scholar
  21. [Freitas 2002]
    A.A. Freitas. A survey of evolutionary algorithms for data mining and knowledge discovery. To appear in: A. Ghosh and S. Tsutsui (Eds.) Advances in Evolutionary Computation. Springer, 2002.Google Scholar
  22. [Giordana et al. 1994]
    A. Giordana, L. Saitta and F. Zini. Learning disjunctive concepts by means of genetic algorithms. Proceedings of the 10th International Conference Machine Learning (ML ‘84), 96–104. Morgan Kaufmann, 1994.Google Scholar
  23. [Goldberg 1989]
    D.E. Goldberg. Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, 1989.Google Scholar
  24. [Greene and Smith 1993]
    D.P. Greene and S.F. Smith. Competition-based induction of decision models from examples. Machine Learning 13, 229–257, 1993.CrossRefGoogle Scholar
  25. [Hand 1997]
    D.J. Hand. Construction and Assessment of Classification Rules. Wiley, 1997.Google Scholar
  26. Hekanaho 1995] J. Hekanaho. Symbiosis in multimodal concept learning. Proceedings of the 1995 International Conference on Machine Learning (ML ‘85),278–285.Google Scholar
  27. [Hekanaho 1996a]
    J. Hekanaho. Background knowledge in GA-based concept learning. Proceedings of the 13th International Conference on Machine Learning (ICML ‘86), 234–242. 1996.Google Scholar
  28. [Hekanaho 1996b]
    J. Hekanaho. Testing different sharing methods in concept learning. TUCS Technical Report No. 71. Turku Centre for Computer Science, Finland. 1996.Google Scholar
  29. [Iglesia et al. 1996]
    B. Iglesia, J.C.W. Debuse and V.J. Rayward-Smith. Discovering knowledge in commercial databases using modern heuristic techniques. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD ‘86), 44–49. AAAI Press, 1996.Google Scholar
  30. [Janikow 1993]
    C.Z. Janikow. A knowledge-intensive genetic algorithm for supervised learning. Machine Learning 13, 189–228, 1993.CrossRefGoogle Scholar
  31. [Kwedlo and Kretowski 1998]
    W. Kwedlo and M. Kretowski. Discovery of decision rules from databases: an evolutionary approach. Proceedings of the 2nd European Symposium on Principles of Data Mining and Knowledge Discovery (PKDD ‘88) - Lecture Notes in Computer Science 1510, 370–378. Springer, 1998.Google Scholar
  32. [Kwedlo and Kretowski 1999]
    W. Kwedlo and M. Kretowski. An evolutionary algorithm using multivariate discretization for decision rule induction. Proceedings of the 3rd European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD ‘89) Lecture Notes in Computer Science 1704, 392–397. Springer, 1999.Google Scholar
  33. [Liu and Kwok 2000]
    J.J. Liu and J.T.-Y. Kwok. An extended genetic rule induction algorithm. Proceedings of the 2000 Congress on Evolutionary Computation (CEC ‘2000). IEEE, 2000.Google Scholar
  34. [Mansilla et al. 1999]
    E.B. Mansilla, A. Mekaouche and J.M.G. Guiu. A study of a genetic classifier system based on the Pittsburgh approach on a medical domain. Proceedings of the 12th International Conference Industrial and Engineering Applications of Artificial Intelligence and Exp. Syst. (IEA/AIE ‘89) - Lecture Notes in Artificial Intelligence 1611, 175–184. Springer, 1999.Google Scholar
  35. [Neri and Giordana 1995]
    A.Giordana and F. Neri. Search-intensive concept induction. Evolutionary Computation 3(4), 375–416, 1995.Google Scholar
  36. [Noda et al. 1999]
    E. Noda, A.A. Freitas and H.S. Lopes. Discovering interesting prediction rules with a genetic algorithm. Proceedings of the Conference on Evolutionary Computation — 1999 (CEC ‘89), 1322–1329. Washington D.C., USA, 1999.Google Scholar
  37. [Parpinelli et al. 2001]
    R.S. Parpinelli, H.S. Lopes and A.A. Freitas. An ant colony based system for data mining: applications to medical data. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO ‘2001), 791–798. Morgan Kaufmann, 2001.Google Scholar
  38. [Parpinelli et al. 2002]
    R.S. Parpinelli, H.S. Lopes and A.A. Freitas. In: H.A. Abbass, R. Sarker, C. Newton (Eds.) Data Mining: A Heuristic Approach, 191–208. Idea Group Publishing, London, 2002.Google Scholar
  39. [Pei et al. 1997]
    M. Pei, E.D. Goodman and W.F. Punch. Pattern discovery from data using genetic algorithms. Proceedings of the 1st Pacific Asia Conference on Knowledge Discovery and Data Mining. Feb. 1997.Google Scholar
  40. [Poon and Prasher 2001]
    J. Poon and S. Prasher. A hybrid approach to support interactive data mining. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO ‘2001), 527–534. Morgan Kaufmann, 2001.Google Scholar
  41. [Quinlan 1993]
    J.R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, 1993.Google Scholar
  42. [Romao et al. 2002]
    W. Romao, A.A. Freitas and R.C.S. Pacheco. A genetic algorithm for discovering interesting fuzzy prediction rules: applications to science and technology data. To appear in Proceedings of the 2002 Genetic and Evolutionary Computation Conference (GECCO ‘2002). New York, 2002.Google Scholar
  43. [Thomas and Sycara 1999]
    J.D. Thomas and K. Sycara. In: A.A. Freitas (Ed.) Data Mining with Evolutionary Algorithms: Research Directions — Papers from the AAAI ‘89/GECCO ‘89 Workshop. Technical Report WS-99–06, 7-11. AAAI Press, 1999.Google Scholar
  44. [Thomas and Sycara 2000]
    J.D. Thomas and K. Sycara. In: A.S. Wu (Ed.) Proceedings of the 2000 Genetic and Evolutionary Computation Conference Workshop Program — Worshop on Data Mining with Evolutionary Algorithms, 72–75. 2000.Google Scholar
  45. [Van Veldhuizen and Lamont 2000]
    D.A. Van Veldhuizen and G.B. Lamont. Multiobjective evolutionary algorithms: analyzing the state-of-the-art. Evolutionary Computation 8 (2), 125–147, 2000.CrossRefGoogle Scholar
  46. [Venturini 1993]
    G. Venturini. SIA: a supervised inductive algorithm with genetic search for learning attributes based concepts. Machine Learning: Proceedings of the 1993 European Conference (ECML ‘83) - Lecture Notes in Artificial Intelligence 667, 280–296. Springer, 1993.Google Scholar
  47. [Walter and Mohan 2000]
    D. Walter and C.K. Mohan. ClaDia: a fuzzy classifier system for disease diagnosis. Proceedings of the 2000 Congress on Evolutionary Computation (CEC ‘2000). IEEE, 2000.Google Scholar
  48. [Weiss 1999]
    G.M. Weiss. Timeweaver: a genetic algorithm for identifying predictive patterns in sequences of events. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO ‘89), 718–725. Morgan Kaufmann, 1999.Google Scholar
  49. [Weiss and Kulikowski 1991]
    S.M. Weiss and C.A. Kulikowski. Computer Systems that Learn. Morgan Kaufmann, San Mateo, 1991.Google Scholar
  50. [Williams 1999]
    G.J. Williams. Evolutionary hot spots data mining: an architecture for exploring for interesting discoveries. Proceedings of the 3rd Asia-Pacific Conference on Knowledge Discovery and Data Mining (PAKDD ‘89), 184–193. Springer, 1999.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Alex A. Freitas
    • 1
  1. 1.Computing LaboratoryUniversity of KentCanterburyUK

Personalised recommendations