An Immune Network for Contextual Text Data Clustering

Ciesielski, Krzysztof; Wierzchoń, Sławomir T.; Kłopotek, Mieczysław A.

doi:10.1007/11823940_33

Krzysztof Ciesielski¹⁸,
Sławomir T. Wierzchoń¹⁸ &
Mieczysław A. Kłopotek¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4163))

Included in the following conference series:

International Conference on Artificial Immune Systems

638 Accesses
8 Citations

Abstract

We present a novel approach to incremental document maps creation, which relies upon partition of a given collection of documents into a hierarchy of homogeneous groups of documents represented by different sets of terms. Further each group (defining in fact separate context) is explored by a modified version of the aiNet immune algorithm to extract its inner structure. The immune cells produced by the algorithm become reference vectors used in preparation of the final document map. Such an approach proves to be robust in terms of time and space requirements as well as the quality of the resulting clustering model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baraldi, A., Blonda, P.: A survey of fuzzy clustering algorithms for pattern recognition. IEEE Trans. on Systems, Man and Cybernetics 29B, 786–801 (1999)
Google Scholar
Becks, A.: Visual Knowledge Management with Adaptable Document Maps. GMD research series 15 (2001) ISBN 3-88457-398-5
Google Scholar
Berry, M.W., Drmač, Z., Jessup, E.R.: Matrices, vector spaces and information retrieval. SIAM Review 41(2), 335–362
Google Scholar
Bezdek, J.C., Pal, S.K.: Fuzzy Models for Pattern Recognition: Methods that Search for Structures in Data. IEEE, New York (1992)
Google Scholar
Boulis, C., Ostendorf, M.: Combining multiple clustering systems. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS, vol. 3202, pp. 63–74. Springer, Heidelberg (2004)
Chapter Google Scholar
Ciesielski, K., et al.: Adaptive document maps. In: Proceedings of the Intelligent Information Processing and Web Mining (IIS:IIPWM 2006), Ustron (2006)
Google Scholar
de Castro, L.N., von Zuben, F.J.: An evolutionary immune network for data clustering. In: SBRN 2000. IEEE Computer Society Press, Los Alamitos (2000)
Google Scholar
de Castro, L.N., Timmis, J.: Artificial Immune Systems: A New Computational Intelligence Approach. Springer, Heidelberg (2002)
MATH Google Scholar
Fritzke, B.: Some competitive learning methods. Draft available from, http://www.neuroinformatik.ruhr-uni-bochum.de/ini/VDM/research/gsn/JavaPaper
Gilchrist, M.: Taxonomies for business: Description of a research project. In: 11 Nordic Conference on Information and Documentation, Reykjavik, Iceland, May 30 – June 1 (2001), http://www.bokis.is/iod2001/papers/Gilchrist_paper.doc
Hung, C., Wermter, S.: A constructive and hierarchical self-organising model in a non-stationary environment. In: Int. Joint Conference in Neural Networks (2005)
Google Scholar
Kłopotek, M., Dramiński, M., Ciesielski, K., Kujawiak, M., Wierzchoń, S.T.: Mining document maps. In: Gori, M., Celi, M., Nanni, M. (eds.) Proceedings of Statistical Approaches to Web Mining Workshop (SAWM) at PKDD 2004, Pisa, pp. 87–98 (2004)
Google Scholar
Kłopotek, M., Wierzchoń, S., Ciesielski, K., Dramiński, M., Czerski, D.: Conceptual Maps and Intelligent Navigation in Document Space (in Polish). Akademicka Oficyna Wydawnicza EXIT Publishing, Warszawa (to appear, 2006)
Google Scholar
Kohonen, T.: Self-Organizing Maps. Springer Series in Information Sciences, vol. 30. Springer, Heidelberg (2001)
MATH Google Scholar
Lagus, K., Kaski, S., Kohonen, T.: Mining massive document collections by the WEBSOM method Information Sciences, vol. 163(1-3), pp. 135–156 (2004)
Google Scholar
van Rijsbergen, C.J.: Information Retrieval. Butterworths, London (1979), http://www.dcs.gla.ac.uk/Keith/Preface.html
Google Scholar
Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithms. Machine Learning 38, 257–286 (2000)
Article MATH Google Scholar
Zhang, T., Ramakrishan, R., Livny, M.: BIRCH: Efficient data clustering method for large databases. In: Proc. ACM SIGMOD Int. Conf. on Data Management (1997)
Google Scholar
Zhao, Y., Karypis, G.: Criterion functions for document clustering: Experiments and analysis, http://www-users.cs.umn.edu/~karypis/publications/ir.html

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, ul. Ordona 21, 01-237, Warszawa, Poland
Krzysztof Ciesielski, Sławomir T. Wierzchoń & Mieczysław A. Kłopotek

Authors

Krzysztof Ciesielski
View author publications
You can also search for this author in PubMed Google Scholar
Sławomir T. Wierzchoń
View author publications
You can also search for this author in PubMed Google Scholar
Mieczysław A. Kłopotek
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IRIDIA, CoDE, Université Libre de Bruxelles, Brussels, Belgium
Hugues Bersini
Instituto Gulbenkian de Ciencia, Apartado 14, 2781-901, Oeiras, Portugal
Jorge Carneiro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ciesielski, K., Wierzchoń, S.T., Kłopotek, M.A. (2006). An Immune Network for Contextual Text Data Clustering. In: Bersini, H., Carneiro, J. (eds) Artificial Immune Systems. ICARIS 2006. Lecture Notes in Computer Science, vol 4163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823940_33

Download citation

DOI: https://doi.org/10.1007/11823940_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37749-8
Online ISBN: 978-3-540-37751-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics