Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6421))

Included in the following conference series:

Abstract

Thousand of news stories are reported each day. How to extract the useful information from the large web news is the important technology today. However, information technology advances have partially automated to processing documents, reducing the amount of text which must be read. In this paper we present a Web News Search System, called WNSS. WNSS can discover automatically phrase extraction from large corpora of web news stories. In addition, we give concrete examples of how to preprocess texts based on the intended use of the discovered results. We also evaluate the extracted phrases can be used for important tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Feldman, R., Kloesgen, W., Zilberstein, A.: Document explorer: Discovering knowledge in document collections. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. Lecture Notes in Computer Science, LNAI, vol. 1325, pp. 137–146. Springer, Heidelberg (1997)

    Chapter  Google Scholar 

  2. Brown, C.M., Danzig, B.B., Hardy, D., Manber, U., Schwartz, M.F.: The harvest information discovery and access system. In: Proc. 2nd International World Wide Web Conference (1994)

    Google Scholar 

  3. Konopnicki, D., Shmueli, O.: W3QS: A query system for the World Wide Web. In: Proc. of the 21th VLDB Conference, pp. 54–65 (1995)

    Google Scholar 

  4. Feldman, R., Dagan, I.: Kdt -knowledge discovery in texts. In: Proc. of the First Int. Conf. on Knowledge Discovery (KDD), pp. 112–117 (1995)

    Google Scholar 

  5. Nahm, U., Mooney, R.: Text mining with information extraction. In: i Proceedings of the AAAI 2002 Spring Symposium on Mining Answers from Texts and Knowledge Bases (2002)

    Google Scholar 

  6. Gaizauskas, R.: An information extraction perspective on text mining: Tasks, technologies and prototype applications (2003), http://www.itri.bton.ac.uk/projects/euromap/TextMiningEvent/Rob_Gaizauskas.pdf

  7. Crispdm and CRISP,: Cross industry standard process for data mining (1999), http://www.crisp-dm.org/

  8. Hearst, M.: Untangling text data mining. In: Proc. of ACL 1999 the 37th Annual Meeting of the Association for Computational Linguistics (1999)

    Google Scholar 

  9. Kodratoff, Y.: Knowledge discovery in texts: A definition and applications. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1999. LNCS, vol. 1609, pp. 16–29. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  10. Hidalgo, J.: Tutorial on text mining and internet content filtering. Tutorial Notes Online (2002), http://ecmlpkdd.cs.helsinki.fi/pdf/hidalgo.pdf

  11. Daille, B., Gaussier, E., Lange, J.M.: Towards Automatic Extraction of Monolingual and Bilingual Terminology. In: Proceedings of International Conference on Computational Linguistics, COLING, pp. 515–521 (1994)

    Google Scholar 

  12. Justeson, J.S., Katz, S.M.: Technical Terminology: Some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1(1), 9–27 (1995)

    Article  Google Scholar 

  13. Frantzi, T.K.: Incorporating Context Information for the Extraction of Terms. In: Proceedings of ACLEACL 1997 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hsu, LF. (2010). Mining on Terms Extraction from Web News. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6421. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16693-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16693-8_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16692-1

  • Online ISBN: 978-3-642-16693-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics