Skip to main content

Text Mining of Internet Content: The Bridge Connecting Product Research with Customers in the Digital Era

  • Conference paper
  • First Online:
Product Research

Abstract

Primary and secondary market research usually deal with analysis of available data on existing products and customers’ preferences for features in possible new products. This analysis helps a manufacturer to identify nuggets of opportunity in defining and positioning of new products in global markets. Considering the fact that the number of Internet users and quantum of textual data available on the Internet are increasing exponentially, we can say that Internet is probably the largest data repository that manufacturer’s cannot ignore, in order to better understand customers’ opinions about products. This emphasizes the importance of web mining to locate and process relevant information from billions of documents available online. Its nature of being unstructured and dynamic, an online document adds more challenges to web mining. This paper focuses on application of web content analysis, a type of web mining in business intelligence for product review. We provide an overview of techniques used to solve the problem and challenges involved in the same.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Brin, S. and Page, L, “The Anatomy of a Large-Scale Hypertextual Web Search Engine”, 1998.

    Google Scholar 

  • Cooley, R. Mobasher, B. and Srivastave, J., “Web Mining: Information and Pattern Discovery on the World Wide Web”, In Proceedings of the 9th IEEE International Conference on Tool with Artificial Intelligence, 1997.

    Google Scholar 

  • Jiawei Han and Micheline Kamber, “Data Mining: Concepts and Techniques”, The Morgan Kaufmann Series in Data Management Systems, 2000.

    Google Scholar 

  • http://www.blogpulse.com/

  • http://en.wikipedia.org/

  • http://wordnet.princeton.edu/

  • http://www.nielsen-online.com/products.jsp?section = pro_buzz

  • http://nlp.stanford.edu/ner/index.shtml

  • http://alias-i.com/lingpipe/

  • Minqing Hu and Bing Liu, “Mining and summarizing customer reviews”, Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-2004, full paper), Seattle, Washington, USA, Aug 22–25, 2004.

    Google Scholar 

  • Nitin Jindal and Bing Liu, “Identifying Comparative Sentences in Text Documents”, Proceedings of the 29th Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR-06), Seattle 2006a.

    Google Scholar 

  • Nitin Jindal and Bing Liu, “Mining Comprative Sentences and Relations”, Proceedings of 21st National Conference on Artificial Intelligence (AAAI-2006), July 16.20, 2006b, Boston, Massachusetts, USA.

    Google Scholar 

  • Jure Leskovec and Eric Horvitz, “Worldwide Buzz: Planetary-Scale Views on an Instant-Messaging Network”, Microsoft Research Technical Report MSR-TR-2006–186, June 2007.

    Google Scholar 

  • Bing Liu, “Web Data Mining Exploring Hyperlinks, Contents and Usage Data”, Springer, December 2006.

    Google Scholar 

  • S.K. Madria, S.S. Rhowmich, W.K. Ng, and F.P. Lim, “Research issues in Web data mining”, In Proceedings of Data Warehousing and Knowledge Discovery, 1999.

    Google Scholar 

  • Chris Manning and Hinrich Schutze, “Foundations of Statistical Natural Language Processing”, MIT Press, Cambridge, MA: May 1999.

    Google Scholar 

  • David Nadeau and Satoshi Sekine, “A survey of named entity recognition and classification”, Journal of Linguisticae Investigationes, 2007.

    Google Scholar 

  • Whitelaw, C., Garg, N., and Argamon, S, “Using appraisal groups for sentiment analysis”, In Proceedings of the 14th ACM international Conference on information and Knowledge Management (Bremen, Germany, October 31 – November 05, 2005). CIKM ‘05. ACM, New York, NY, 625–631. DOI = http://doi.acm.org/10.1145/1099554.1099714.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. Ravindran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media B.V.

About this paper

Cite this paper

Shivashankar, S., Ravindran, B., Raghavan, N.R.S. (2009). Text Mining of Internet Content: The Bridge Connecting Product Research with Customers in the Digital Era. In: Raghavan, N., Cafeo, J. (eds) Product Research. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-2860-0_12

Download citation

  • DOI: https://doi.org/10.1007/978-90-481-2860-0_12

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-2859-4

  • Online ISBN: 978-90-481-2860-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics