NowOnWeb: News Search and Summarization
Agile access to the huge amount of information published by the thousands of news sites available on-line leads to the application of Information Retrieval techniques to this problem. The aim of this paper is to present NowOnWeb, a news retrieval system that obtains the articles from different on-line sources providing news searching and browsing. The main points solved during the development of NowOnWeb were: article recognition and extraction, redundancy detection and text summarization. For these points we provided effective solutions that put all them together had risen to a system that satisfies, in a reasonable way, the daily information needs of the user.
KeywordsUser Query Vector Space Model Summary Generation Text Summarization Tree Edit Distance
Unable to display preview. Download preview PDF.
- 4.Hatcher, E., Gospodnetic, O.: Lucene in Action (In Action series). Manning Publications Co., Greenwich, CT, USA (2004)Google Scholar
- 5.Hovy, E.: Text Summarization. In: Mitkov, R. (ed.) The Oxford Handbook of Computational Linguistics, ch. 32, pp. 583–598 (2005)Google Scholar
- 6.McKeown, K.R., Barzilay, R., Evans, D., Hatzivassiloglou, V., Klavans, J.L., Nenkova, A., Sable, C., Schiffman, B., Sigelman, S.: Tracking and summarizing news on a daily basis with Columbia’s Newsblaster. In: Proceedings of the Human Language Technology Conference (2002)Google Scholar