Abstract
Thousand of news stories are reported each day. How to extract the useful information from the large web news is the important technology today. However, information technology advances have partially automated to processing documents, reducing the amount of text which must be read. In this paper we present a Web News Search System, called WNSS. WNSS can discover automatically phrase extraction from large corpora of web news stories. In addition, we give concrete examples of how to preprocess texts based on the intended use of the discovered results. We also evaluate the extracted phrases can be used for important tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Feldman, R., Kloesgen, W., Zilberstein, A.: Document explorer: Discovering knowledge in document collections. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. Lecture Notes in Computer Science, LNAI, vol. 1325, pp. 137–146. Springer, Heidelberg (1997)
Brown, C.M., Danzig, B.B., Hardy, D., Manber, U., Schwartz, M.F.: The harvest information discovery and access system. In: Proc. 2nd International World Wide Web Conference (1994)
Konopnicki, D., Shmueli, O.: W3QS: A query system for the World Wide Web. In: Proc. of the 21th VLDB Conference, pp. 54–65 (1995)
Feldman, R., Dagan, I.: Kdt -knowledge discovery in texts. In: Proc. of the First Int. Conf. on Knowledge Discovery (KDD), pp. 112–117 (1995)
Nahm, U., Mooney, R.: Text mining with information extraction. In: i Proceedings of the AAAI 2002 Spring Symposium on Mining Answers from Texts and Knowledge Bases (2002)
Gaizauskas, R.: An information extraction perspective on text mining: Tasks, technologies and prototype applications (2003), http://www.itri.bton.ac.uk/projects/euromap/TextMiningEvent/Rob_Gaizauskas.pdf
Crispdm and CRISP,: Cross industry standard process for data mining (1999), http://www.crisp-dm.org/
Hearst, M.: Untangling text data mining. In: Proc. of ACL 1999 the 37th Annual Meeting of the Association for Computational Linguistics (1999)
Kodratoff, Y.: Knowledge discovery in texts: A definition and applications. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1999. LNCS, vol. 1609, pp. 16–29. Springer, Heidelberg (1999)
Hidalgo, J.: Tutorial on text mining and internet content filtering. Tutorial Notes Online (2002), http://ecmlpkdd.cs.helsinki.fi/pdf/hidalgo.pdf
Daille, B., Gaussier, E., Lange, J.M.: Towards Automatic Extraction of Monolingual and Bilingual Terminology. In: Proceedings of International Conference on Computational Linguistics, COLING, pp. 515–521 (1994)
Justeson, J.S., Katz, S.M.: Technical Terminology: Some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1(1), 9–27 (1995)
Frantzi, T.K.: Incorporating Context Information for the Extraction of Terms. In: Proceedings of ACLEACL 1997 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hsu, LF. (2010). Mining on Terms Extraction from Web News. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6421. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16693-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-16693-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16692-1
Online ISBN: 978-3-642-16693-8
eBook Packages: Computer ScienceComputer Science (R0)