Abstract
The paper presents results of empirical evaluation of a Bayesian multinet classifier based on a new method of learning very large tree-like Bayesian networks. The study suggests that tree-like Bayesian networks are able to handle a classification task in one hundred thousand variables with sufficient speed and accuracy.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
L. D. Baker, A. K. McCallum: Distributional clustering of words for text classification, Proc. SIGIR-98, ACM Press, New York, US, pp. 96–103, 1998
C.K. Chow, C.N. Liu: Approximating discrete probability distributions with dependence trees, IEEE Trans. on Information Theory, Vol. IT-14, No. 3, (1968), 462–467
T. Joachims: A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization, Proc. ICML-97, Morgan Kaufmann, 143–151, 1997
M.A. Kłopotek: A New Bayesian Tree Learning Method with Reduced Time and Space Complexity. Fundamenta Informaticae, 49(4)2002, IOS Press, pp. 349–367.
M.A. Kłopotek: Minig Bayesian Networks Structure for Large Sets of Variables. In Foundations of Intelligent Systems. LNAI 2366, Springer-Verlag, pp. 114–122
M.A. Kłopotek On the Distance Hypothesis in Tree-like Bayesian Networks. ICS PAS Report 952, Warszawa, January 2003.
A. McCallum, K. Nigam: A comparison of event models for Naive Bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization, 1998.
A.K. McCallum, K. Nigam: Employing EM in pool-based active learning for text classification, In: Proc.of ICML-98, Morgan Kaufmann, San Francisco, USA, pp. 350–358, 1998
A.K. McCallum, R. Rosenfeld, T.M. Mitchell, A.Y. Ng: Improving text classification by shrinkage in a hierarchy of classes, In: Proc.of ICML-98,, Morgan Kaufmann, San Francisco, USA, pp. 359–367, 1998
Y. Yang: An Evaluation of Statistical Approaches to Text Categorization, J. Information Retrieval, vol, no 1, pp. 69–90, 1999
Y. Yang, X. Liu: A re-examination of text categorization methods, in: 22nd Annual International SIGIR, pp. 42–49, Berkley, 1999
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kłopotek, M.A., Woch, M. (2003). Very Large Bayesian Networks in Text Classification. In: Sloot, P.M.A., Abramson, D., Bogdanov, A.V., Dongarra, J.J., Zomaya, A.Y., Gorbachev, Y.E. (eds) Computational Science — ICCS 2003. ICCS 2003. Lecture Notes in Computer Science, vol 2657. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44860-8_41
Download citation
DOI: https://doi.org/10.1007/3-540-44860-8_41
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40194-0
Online ISBN: 978-3-540-44860-0
eBook Packages: Springer Book Archive