Knowledge discovery in databases (KDD) is increasingly being accepted as a viable tool for collecting, analyzing, and making decision from massive data sets. Though many sophisticated techniques are developed by various interdisciplinary fields, only few of them are well equipped to handle multi-criteria issues of KDD. It seems to provide a new frontier of research directions. The KDD issues like feature selection, instance selection, rule mining and clustering involves simultaneous optimization of several (possibly conflicting) objectives. Further, considering a single criterion, as with the existing soft computing techniques like evolutionary algorithms (EA), neural network (NN), and particle swarm optimization (PSO) are not up to the mark. Therefore, the KDD issues have attracted considerable attention of the well established multi-objective genetic algorithms to optimize the identifiable objectives in KDD.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Deb K (2001) Multi-objective optimisation using evolutionary algorithms. Wiley, New York
Goldberg D E (1989) Genetic algorithms in search, optimisation and machine learning. Addison-Wesley
Rechenberg I (1973) Evolutionsstrategie: optimierung technischer systeme, nach prinzipien der biologischen evolution. Frammann-Holzboog Verlag, Stuttgart
Durham W (1994) Co-evolution:genes, culture, and human diversity. Stanford University Press
Koza J R (1992) Genetic programming: on the programming of computers by means of natural selection. MIT Press, Cambridge, MA
Back T (1996) Evolutionary algorithms in theory and practice. Oxford University Press, New York
Freitas A A (2002) Data mining and knowledge discovery with evolutionary algorithms. Springer-Verlag, New York
Cover J M, Thomas J A (1991) Elements of information theory. John Wiley
Liu H, Motoda H (1998) Feature selection for knowledge discovery and data mining. Kluwer Academic Publishers
Quinlan J R (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Mateo, CA
Fukunaga K (1972) Introduction to statistical pattern recognition. Academic Press, New York
Jain A K, Dubes R C (1988) Algorithm for clustering data. Pretince Hall, Englewood Cliffs, New Jersey
Adamo J M (2001) Data mining for association rules and sequential patterns. Springer-Verlag, New York
Fogel D B (2000) Evolutionary computation: Toward a new philosophy of machine intelligence. IEEE Press, Piscataway, New Jersey
Fogel D B (1998) Evolutionary computation: The fossil record. IEEE Press, Piscataway, New Jersey
Fayyad U M, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery: an Overview. In: Fayyad U M, Piatetsky-Shapiro G, Smyth P, Uthurusamy R (eds) Advances in Knowledge Discovery and Data Mining. MIT Press, MA 1–34
Brachman R J, Anand T (1996) The process of knowledge discovery in databases: a human centred approach. In: Fayyad U M, Piatetsky-Shapiro G, Smyth P, Uthurusamy R (eds) Advances in Knowledge Discovery and Data Mining. MIT Press, MA 37–57
Frawley W J, Piatetsky-Shapiro G, and Matheus C J (1991) Knowledge discovery in databases: an overview. In: Piatetsky-Shapiro G, Frawley B (eds) Knowledge Discovery in Databases. MIT Press, MA 1–27
Reeves C R, Bush D R (2001) Using genetic algorithms for training data selection in RBF networks. In: Liu H, Motoda H (eds) Instance Selection and Construction for Data Mining, Norwell, MA: Kluwer 339–356
Cano J R, Herrera F, Lozano M (2003) Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study. IEEE Transactions on Evolutionary Computation 7(6):561–575
Ghosh A, Nath B (2004) Multi-objective rule mining using genetic algorithms. Information Sciences 163:123–133
Babcock C (1994) Parallel processing mines retail data. Computer World, 6
Fashman K H, Cuticchia A J, Kingsbury D T (1994) The GDB (TM) human genome database. Anna. Nucl. Acid. R. 22(17):3462–3469
Weir N, Fayyad, U M, Djorgovski S G (1995) Automated star/galaxy classification for digitized POSS-II. Astron. Journal 109(6): 2401–2412
Weir N, Djorgovski S G, Fayyad U M (1995) Initial galaxy counts from digitized POSS-II. Astron. Journal 110(1):1–20
Back T, Schwefel H-P (1993) An overview of evolutionary algorithms for parameter optimisation. Evolutionary Computation 1(1):1–23
Ghosh A, Dehuri S (2004) Evolutionary algorithms for multi-criterion optimization: a survey. International Journal on Computing and Information Science 2(1):38–57
Laumanns M, Rudolph G, Schwefel H-P (1998) A spatial predator-prey approach to multi-objective optimization. Parallel Problem Solving from Nature 5:241–249
Deb K, Pratap A, Agrawal S, Meyarivan T (2002) A Fast and elitist multi-objective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation 6:182–197
Ziztler E, Thiele L (1999) Multi-objective evolutionary algorithms: a comparative case study and strength pareto approach. IEEE Transactions on Evolutionary Computation 3:257–271
Reinartz T (2002) A unifying view on instance selection. Data Mining and Knowledge Discovery 6:191–210
Kohavi R, John G H (1997) Wrappers for feature subset selection. Artificial Intelligence 97:273–324
Siedlecki W, Sklansky J (1989) A note on genetic algorithms for large scale feature selection. Pattern Recognition Letters 10:335–47
Jain A, Zongker D (1997) Feature selection: evaluation, application and small sample performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 19:153–158
Lim T-S, Loh W-Y, Shih Y-S (2000) A Comparison of prediction accuracy complexity and training time of thirty-three old and new classification algorithms. Machine Learning Journal 40:203–228
Krovi R (1991) Genetic algorithm for clustering: a preliminary investigation. IEEE Press 504–544
Krishna K, Murty M N (1999) Genetic K-means algorithms. IEEE Transactions on Systems, Man and Cybernetics- Part-B 29:433–439
Sarafis I, Zalzala AMS, Trinder P W (2001) A genetic rule based data clustering toolkit
Dehuri S, Mall R (2006) Predictive and comprehensible rule discovery using a multi-objective genetic algorithm. Knowledge Based Systems 19(6):413–421
Dehuri S, Patnaik S, Ghosh A, Mall R (2007) Application of elitist multi-objective genetic algorithm for classification rule generation. Applied Soft Computing (in press)
Ishibuchi H, Nakashima T (2000) Multiobjective pattern and feature selection by a genetic algorithm. Proceedings of Genetic and Evolutionary Computation Conference 1069–1076
Fayyad U M, Uthurusamy, R (1995) (eds) Proceedings 1st Internationl Conference Knowledge Discovery and Data Mining (KDD-95). AAAI Press, Montreal, Canada
Simoudis E, Han J, Fayyad, U M (1996) Proceedings 2nd International Conference Knowledge Discovery & Data Mining, Portland, Oregon
Holland J H (1990) ECHO: Explorations of evolution in a miniature world. In Famer J D, Doyne J (eds) Proceedings of the Second Conference on Artificial Life, Addison Wesley
Noda E, Freitas A A, Lopes H S (1999) Discovering interesting prediction rules with a genetic algorithm. Proceeding Conference on Evolutionary Computation (CEC-99), Washington D.C., USA 1322–1329
Emmanouilidis C, Hunter A, MacIntyre J (2000) A multiobjective evolutionary setting for feature selection and a commonality based crossover operator. Proceedings CEC-2000 La Jolla CA, USA 309–316
Dehuri S, Mall R (2004) Mining predictive and comprehensible classification rules using multi-objective genetic algorithm. Proceeding of ADCOM, India
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. Proceedings of the 20th International Conference on Very Large Databases
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. Proceedings of ACM SIGMOD Conference on Management of Data 207–216
Jie L, Xinbo G, Li-Chang J (2003) A GA based clustering algorithm for large datasets with mixed numeric and categorical values. Proceedings of the 5th International Conference on Computational Intelligence and Multi. Application
Schewefel H -P (1975) Evolutionsstrategie and numerische optimierung. Ph. D. Thesis, Technical University, Berlin
Handl J, Knowles J (2004) Multi-objective clustering with automatic determination of the number of clusters. Tech. Report TR-COMPSYSBIO-2004-02 UMIST, Manchester
Ayad A M (2000) A new algorithm for incremental mining of constrained association rules. Master Thesis, Department of Computer Sciences and Automatic Control, Alexandria University
Schaffer J D (1984) Multiple objective optimization with vector evaluated genetic algorithms. PhD Thesis, Vanderbilt University
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Dehuri, S., Ghosh, S., Ghosh, A. (2008). Genetic Algorithm for Optimization of Multiple Objectives in Knowledge Discovery from Large Databases. In: Ghosh, A., Dehuri, S., Ghosh, S. (eds) Multi-Objective Evolutionary Algorithms for Knowledge Discovery from Databases. Studies in Computational Intelligence, vol 98. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77467-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-77467-9_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77466-2
Online ISBN: 978-3-540-77467-9
eBook Packages: EngineeringEngineering (R0)