Abstract
An automatic news tracking and analysis system which records world events over long time periods is described. It allows to track country specific news, the activities of individual persons and groups, to derive trends, and to provide data for further analysis and research. The data source is the Europe Media Monitor (EMM) which monitors news from around the world in real time via the Internet and from various News Agencies. EMM’s main purpose is to provide rapid feedback of press coverage and breaking news for European Policy Makers. Increasingly, however it is being used for security applications and for foreign policy monitoring. This paper describes how language technologies and clustering techniques have been applied to the 30,000 daily news reports to derive the top stories in each of 13 languages, to locate events geospatially, and to extract and record entities involved. Related stories have been linked across time and across languages, allowing for national comparisons and to derive name variants. Results and future plans are described.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Best, C., et al.: Europe Media Monitor – System Description, EUR Report 22173 EN (2006), http://press.jrc.it
Really Simple Syndication V 2.0 Specifications, http://blogs.law.harvard.edu/tech/rss
Dunning, T.: Statistical Identification of Language. Computing Research Laboratory Technical Memo MCCS 94-273, New Mexico State University, Las Cruces, New Mexico, USA, 31 p. (1994), Also available at, http://citeseer.nj.nec.com/dunning94statistical.html
Multilingual Gazetteer - KNAB project of the Institute of the Estonian Language, http://www.eki.ee/knab/knab.htm
Ralf, S., Bruno, P., Ignat, C.: Exploiting Multilingual Nomenclatures and Language-Independent Text Features as an Interlingua for Cross-lingual Text Analysis Applications. In: Information Society 2004 (IS 2004) - Proceedings B of the 7th International Multiconference - Language Technologies, Ljubljana, Slovenia, October 13-14, pp. 2–12 (2004)
Wikipedia, the free encyclopedia, http://www.wikipedia.org/
Bruno, P., Steinberger, R., Ignat, C., Temnikova, I., Widiger, A., Zaghouani, W., Žižka, J.: Multilingual person name recognition and transliteration. Journal CORELA - Cognition, Représentation, Langage, available at (2005), http://edel.univ-poitiers.fr/corela/document.php?id=490
Eurovoc Multilingual Thesaurus see, http://europa.eu.int/celex/eurovoc/
Best, C., ven der Goot, E., Blackler, K., Garcia, T., Horby, D., Steinberger, R., Pouliquen, B.: Mapping World Events. In: Proceedings of Geo-Information for Disaster Management. Springer, Heidelberg (2005), ISBN 3-540-24988-5
Best, C., Van der Goot, E., de Paola, M.: Thematic Indicators Derived from World News Reports. In: Intelligence and Security Informatics, Proceedings of ISI 2005, pp. 436–447 (2005)
Pouliquen, B., Steinberger, R., Ignat, C., Oellinger, T.: Building and Displaying Name Relations using Automatic Unsupervised Analysis of Newspaper Articles. In: Proceedings of JADT 2006, Besancon, France, April 19-21 (2006)
Worldkit Web mapping system see, http://worldkit.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Best, C. et al. (2006). Towards Automatic Event Tracking. In: Mehrotra, S., Zeng, D.D., Chen, H., Thuraisingham, B., Wang, FY. (eds) Intelligence and Security Informatics. ISI 2006. Lecture Notes in Computer Science, vol 3975. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760146_3
Download citation
DOI: https://doi.org/10.1007/11760146_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34478-0
Online ISBN: 978-3-540-34479-7
eBook Packages: Computer ScienceComputer Science (R0)