Abstract
This paper proposes a Dirichlet Process Mixture Model (DPMM) considering relevant topical information for adaptive topic tracking. The method has two characters: 1) It uses DPMM to implement topic tracking. Prior knowledge of known topics is combined in Gibbs sampling for model inference, and correlation between a story and each known topics can be estimated. 2) To alleviate topic excursion problem and topic deviation problem brought by existing adaptive tracking methods, the paper presents a new adaptive learning mechanism, the basic idea of which is to introduce tracking feedback with a reliability metric into the topic tracking procedure and make tracking feedback influence tracing computation under the condition of the reliability metric. The empirical results on TDT3 evaluation data show that the model, without a large scale of in-domain data, can solve topic excursion problem of topic tracking task and topic deviation problem brought by existing adaptive learning mechanisms significantly even with a few on-topic stories.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allan, J., Carbonell, J., Doddington, G., et al.: Topic detection and tracking pilot study: final report. In: Proceedings of DARPA BNTU Workshop, pp. 194–218. DARPA, Lansdowne (1998)
Makkonen, J., Anonen-Myka, H., Salmenkivi, M.: Simple semantics in topic detection and tracking. Information Retrieval 7(3/4), 347–368 (2004)
Chen, F., Farahat, A., Brants, T.: Multiple similarity measures and source-pair information in story link detection. In: HLT-NAACL, Boston, pp. 313–320 (2004)
Yamron, J.P., Knecht, S., van Mulbregt, P.: Dragon’s Tracking and Detection Systems for the TDT 2000 Evaluation. In: The Topic Detection and Tracking Workshop (2000)
Lo, Y., Gauvain, J.: The limsi topic tracking system for TDT 2001. In: The TDT Workshop. DARPA, Gaithersburg (2001)
Spitters, M., Kraaij, W.: Using language models for tracking events of interest over time. In: Proceedings of LMIR 2001, Pittsburgh, pp. 60–65 (2001)
Qiu, J., Liao, L.: Add Temporal Information to Dependency Structure Language Model for Topic Detection and Tracking. In: Proceedings of the International Conference on Machine Learning and Cybernetics, pp. 1575–1580. IEEE Press, Kunming (2008)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. Journal of Machine Learning Research 3(5), 993–1022 (2003)
Hong, Y., Zhang, Y., Liu, T., et al.: Topic detection and tracking review. Journal of Chinese Information Processing 21(6), 71–87 (2007)
Ferguson, T.S.: A Bayesian analysis of some nonparametric problems. Annals of Statistics 1(2), 209–230 (1973)
Neal, R.M.: Markov chain sampling methods for dirichlet process mixture models. Journal of Computational and Graphical Statistics 9(2), 249–265 (2000)
Luo, W., Liu, Q.: Development and Analysis of Technology of Topic Detection and Tracking. In: JSCL, Beijing, pp. 560–566 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, C., Wang, X., Yuan, C. (2012). Adaptive Topic Tracking Based on Dirichlet Process Mixture Model. In: Zhou, M., Zhou, G., Zhao, D., Liu, Q., Zou, L. (eds) Natural Language Processing and Chinese Computing. NLPCC 2012. Communications in Computer and Information Science, vol 333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34456-5_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-34456-5_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34455-8
Online ISBN: 978-3-642-34456-5
eBook Packages: Computer ScienceComputer Science (R0)