Mining on Terms Extraction from Web News

Hsu, Li-Fu

doi:10.1007/978-3-642-16693-8_21

Li-Fu Hsu²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6421))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1191 Accesses
2 Citations

Abstract

Thousand of news stories are reported each day. How to extract the useful information from the large web news is the important technology today. However, information technology advances have partially automated to processing documents, reducing the amount of text which must be read. In this paper we present a Web News Search System, called WNSS. WNSS can discover automatically phrase extraction from large corpora of web news stories. In addition, we give concrete examples of how to preprocess texts based on the intended use of the discovered results. We also evaluate the extracted phrases can be used for important tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Feldman, R., Kloesgen, W., Zilberstein, A.: Document explorer: Discovering knowledge in document collections. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. Lecture Notes in Computer Science, LNAI, vol. 1325, pp. 137–146. Springer, Heidelberg (1997)
Chapter Google Scholar
Brown, C.M., Danzig, B.B., Hardy, D., Manber, U., Schwartz, M.F.: The harvest information discovery and access system. In: Proc. 2^nd International World Wide Web Conference (1994)
Google Scholar
Konopnicki, D., Shmueli, O.: W3QS: A query system for the World Wide Web. In: Proc. of the 21th VLDB Conference, pp. 54–65 (1995)
Google Scholar
Feldman, R., Dagan, I.: Kdt -knowledge discovery in texts. In: Proc. of the First Int. Conf. on Knowledge Discovery (KDD), pp. 112–117 (1995)
Google Scholar
Nahm, U., Mooney, R.: Text mining with information extraction. In: i Proceedings of the AAAI 2002 Spring Symposium on Mining Answers from Texts and Knowledge Bases (2002)
Google Scholar
Gaizauskas, R.: An information extraction perspective on text mining: Tasks, technologies and prototype applications (2003), http://www.itri.bton.ac.uk/projects/euromap/TextMiningEvent/Rob_Gaizauskas.pdf
Crispdm and CRISP,: Cross industry standard process for data mining (1999), http://www.crisp-dm.org/
Hearst, M.: Untangling text data mining. In: Proc. of ACL 1999 the 37th Annual Meeting of the Association for Computational Linguistics (1999)
Google Scholar
Kodratoff, Y.: Knowledge discovery in texts: A definition and applications. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1999. LNCS, vol. 1609, pp. 16–29. Springer, Heidelberg (1999)
Chapter Google Scholar
Hidalgo, J.: Tutorial on text mining and internet content filtering. Tutorial Notes Online (2002), http://ecmlpkdd.cs.helsinki.fi/pdf/hidalgo.pdf
Daille, B., Gaussier, E., Lange, J.M.: Towards Automatic Extraction of Monolingual and Bilingual Terminology. In: Proceedings of International Conference on Computational Linguistics, COLING, pp. 515–521 (1994)
Google Scholar
Justeson, J.S., Katz, S.M.: Technical Terminology: Some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1(1), 9–27 (1995)
Article Google Scholar
Frantzi, T.K.: Incorporating Context Information for the Extraction of Terms. In: Proceedings of ACLEACL 1997 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, Hwh Hsia Institute of Technology, 235, Taipei, Taiwan
Li-Fu Hsu

Authors

Li-Fu Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, 415 Chien-Kung Road, 807, Kaohsiung, Taiwan
Jeng-Shyang Pan
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, #43, Sec.4, Keelung Rd., 106, Taipei, Taiwan
Shyi-Ming Chen
Institute of Informatics, Wroclaw University of Technology, Str. Wyb. Wyspianskiego 27, 50-370, Wroclaw, Poland
Ngoc Thanh Nguyen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hsu, LF. (2010). Mining on Terms Extraction from Web News. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6421. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16693-8_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-16693-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16692-1
Online ISBN: 978-3-642-16693-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics