Abstract
In the present study an ACO algorithm is adopted as a part of a document classification system that classifies documents written in Greek, in thematic categories. The main purpose of the ACO module is to create a word map that will assist in the representation of the documents in the pattern space. The word map creation algorithm proposed involves additional deterministic sub-routines and aims at clustering together into groups thematically-related words. The performance of the proposed system is compared with an alternative system implementation that is based on the established SOM neural network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, X., Li, Y.: A Modified PSO Structure Resulting in High Exploration Ability With Convergence Guaranteed. IEEE Transactions on Systems, Man & Cybernetics, Part B: Cybernetics 37(5), 1271–1289 (2007)
Dorigo, M., Gambardella, M.: Ant Colony System: A Cooperative Learning Approach to the Travelling Salesman Problem. IEEE Transactions on Evolutionary Computation 1(1), 53–66 (1997)
Drucker, H., Wu, D., Vapnik, V.: Support Vector Machines for Spam categorization. IEEE Trans. on Neural Networks 10(5), 1048–1054 (1999)
Dussutour, A., Fourcassie, V., Helbing, D., Deneubourg, J.-L.: Optimal Traffic Organisation in Ants under Crowded Conditions. Nature 478(6978), 70–73 (2004)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, New York (2001)
Everitt, B.S., Landau, S., Leese, M.: Cluster Analysis. Hodder Arnold Publication (2001)
Georgakis, A., Kotropoulos, C., Xafopoulos, A., Pitas, I.: Marginal median SOM for document organization and retrieval. Neural Networks 17(3), 365–377 (2004)
Freeman, R.T., Yin, H.: Web content management by self-organization. IEEE Transactions on Neural Networks 16(5), 1256–1268 (2005)
Haykin, S.: NEURAL NETWORKS: A Comprehensive foundation, 2nd edn. Prentice-Hall, Englewood Cliffs (1999)
Igel, C., Huesken, M.: Empirical evaluation of the improved Rprop learning algorithm. Neurocomputing 50, 105–123 (2003)
Kaski, S.: Dimensionality Reduction by Random mapping: Fast Similarity Computation for Clustering. In: Proceedings of IJCNN 1998 Conference, International Joint Conference on Neural Networks, vol. 1, pp. 413–418 (1998)
Kennedy, J., Eberhart, R.C.: Swarm Intelligence. Morgan Kaufmann, San Francisco (2001)
Kohonen, T.: Self-Organizing Map, 2nd edn. Springer, Berlin (1997)
Kohonen, T., Somervuo, P.: Self-organizing maps of symbol strings. Neurocomputing 21(1-3), 19–30 (1998)
Kohonen, T., Kaski, S., Lagus, K., Salojarvi, H.J., Patero, V., Saarela, A.: Self Organisation of a Massive Document Collection. IEEE Transactions on Neural Networks 11(3), 574–585 (2000)
Lagus, K., Kaski, S., Kohonen, T.: Mining Massive Document Collections by the WEBSOM Method. Information Sciences 163(1-3), 135–156 (2004)
Lessing, L., Dumitrescu, I., Stützle, T.: A comparison between ACO algorithms for the set covering problem. In: Dorigo, M., Birattari, M., Blum, C., Gambardella, L.M., Mondada, F., Stützle, T. (eds.) ANTS 2004. LNCS, vol. 3172, pp. 1–12. Springer, Heidelberg (2004)
MacKay, D.: Information Theory, Inference, and Learning Algorithms. Cambridge University Press, Cambridge (2003)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Martens, D., De Backer, M., Haesen, R., Baesens, B., Mues, C., Vanthienen, J.: Ant-Based Approach to the Knowledge Fusion Problem. In: Dorigo, M., Gambardella, L.M., Birattari, M., Martinoli, A., Poli, R., Stützle, T. (eds.) ANTS 2006. LNCS, vol. 4150, pp. 84–95. Springer, Heidelberg (2006)
Nguyen, D., Widrow, B.: Improving the learning speed of 2-layer neural networks by choosing initial values of adaptive weights. In: Proceedings of the International Joint Conference on Neural Networks, vol. 3, pp. 21–26 (1990)
Papageorgiou, H., Prokopidis, P., Giouli, V., Piperidis, S.: A Unified PoS Tagging Architecture and its Application to Greek. In: Second International Conference on Language Resources and Evaluation Proceedings, Athens, Greece, vol. 3, pp. 1455–1462 (2000)
Riedmiller, M., Braun, H.: A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In: Proceedings of the IEEE International Conference on Neural Networks, San Francisco, CA, pp. 586–591 (1993)
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323, 533–536 (1986)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing & Management 24(5), 513–523 (1988)
Stützle, T., Hoos, H.: MAX-MIN Ant System. Future Generation Computer Systems 16, 889–914 (2000)
Tsimboukakis, N., Tambouratzis, G.: Self-Organizing Word Map for Context-Based Document Classification. In: Proceedings of the WSOM 2007 International Workshop on Self-Organizing Maps, Bielefeld, Germany (2007)
Tsimboukakis, N., Tambouratzis, G.: Document classification system based on HMM word map. In: CSTST 2008, 5th International Conference on Soft Computing as Transdisciplinary Science and Technology, Paris, France (2008)
Tweedie, F., Singh, S., Holmes, D.: An Introduction to Neural Networks in Stylometry. Research in Humanities Computing 5, 249–263 (1996)
Vesanto, J.: Neural Network Tool for Data Mining: SOM Toolbox. In: Proceedings of Symposium on Tool Environments and Development Methods for Intelligent Systems (TOOLMET 2000), Finland, pp. 184–196 (2000)
Visine, A.L., de Castro, L.N., Hruschka, E.R., Gudwin, R.R.: Towards Improving Clustering Ants: An Adaptive Ant Clustering Algorithm. Informatica 25, 143–154 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Tsimboukakis, N., Tambouratzis, G. (2009). ACO Hybrid Algorithm for Document Classification System. In: Lim, C.P., Jain, L.C., Dehuri, S. (eds) Innovations in Swarm Intelligence. Studies in Computational Intelligence, vol 248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04225-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-04225-6_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04224-9
Online ISBN: 978-3-642-04225-6
eBook Packages: EngineeringEngineering (R0)