Text Mining of Internet Content: The Bridge Connecting Product Research with Customers in the Digital Era

Shivashankar, S.; Ravindran, B.; Raghavan, N. R. Srinivasa

doi:10.1007/978-90-481-2860-0_12

S. Shivashankar³,
B. Ravindran³ &
N. R. Srinivasa Raghavan⁴

1198 Accesses

Abstract

Primary and secondary market research usually deal with analysis of available data on existing products and customers’ preferences for features in possible new products. This analysis helps a manufacturer to identify nuggets of opportunity in defining and positioning of new products in global markets. Considering the fact that the number of Internet users and quantum of textual data available on the Internet are increasing exponentially, we can say that Internet is probably the largest data repository that manufacturer’s cannot ignore, in order to better understand customers’ opinions about products. This emphasizes the importance of web mining to locate and process relevant information from billions of documents available online. Its nature of being unstructured and dynamic, an online document adds more challenges to web mining. This paper focuses on application of web content analysis, a type of web mining in business intelligence for product review. We provide an overview of techniques used to solve the problem and challenges involved in the same.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Brin, S. and Page, L, “The Anatomy of a Large-Scale Hypertextual Web Search Engine”, 1998.
Google Scholar
Cooley, R. Mobasher, B. and Srivastave, J., “Web Mining: Information and Pattern Discovery on the World Wide Web”, In Proceedings of the 9th IEEE International Conference on Tool with Artificial Intelligence, 1997.
Google Scholar
Jiawei Han and Micheline Kamber, “Data Mining: Concepts and Techniques”, The Morgan Kaufmann Series in Data Management Systems, 2000.
Google Scholar
http://www.blogpulse.com/
http://en.wikipedia.org/
http://wordnet.princeton.edu/
http://www.nielsen-online.com/products.jsp?section = pro_buzz
http://nlp.stanford.edu/ner/index.shtml
http://alias-i.com/lingpipe/
Minqing Hu and Bing Liu, “Mining and summarizing customer reviews”, Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-2004, full paper), Seattle, Washington, USA, Aug 22–25, 2004.
Google Scholar
Nitin Jindal and Bing Liu, “Identifying Comparative Sentences in Text Documents”, Proceedings of the 29th Annual International ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR-06), Seattle 2006a.
Google Scholar
Nitin Jindal and Bing Liu, “Mining Comprative Sentences and Relations”, Proceedings of 21st National Conference on Artificial Intelligence (AAAI-2006), July 16.20, 2006b, Boston, Massachusetts, USA.
Google Scholar
Jure Leskovec and Eric Horvitz, “Worldwide Buzz: Planetary-Scale Views on an Instant-Messaging Network”, Microsoft Research Technical Report MSR-TR-2006–186, June 2007.
Google Scholar
Bing Liu, “Web Data Mining Exploring Hyperlinks, Contents and Usage Data”, Springer, December 2006.
Google Scholar
S.K. Madria, S.S. Rhowmich, W.K. Ng, and F.P. Lim, “Research issues in Web data mining”, In Proceedings of Data Warehousing and Knowledge Discovery, 1999.
Google Scholar
Chris Manning and Hinrich Schutze, “Foundations of Statistical Natural Language Processing”, MIT Press, Cambridge, MA: May 1999.
Google Scholar
David Nadeau and Satoshi Sekine, “A survey of named entity recognition and classification”, Journal of Linguisticae Investigationes, 2007.
Google Scholar
Whitelaw, C., Garg, N., and Argamon, S, “Using appraisal groups for sentiment analysis”, In Proceedings of the 14th ACM international Conference on information and Knowledge Management (Bremen, Germany, October 31 – November 05, 2005). CIKM ‘05. ACM, New York, NY, 625–631. DOI = http://doi.acm.org/10.1145/1099554.1099714.

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology, Chennai, 600036, India
S. Shivashankar & B. Ravindran
General Motors, R&D, India Science Lab, Bangalore, 560066, India
N. R. Srinivasa Raghavan

Authors

S. Shivashankar
View author publications
You can also search for this author in PubMed Google Scholar
B. Ravindran
View author publications
You can also search for this author in PubMed Google Scholar
N. R. Srinivasa Raghavan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to B. Ravindran .

Editor information

Editors and Affiliations

General Motors India Pvt. Ltd, Whitefield Rd., Bangalore, 560066, India
N. R. Srinivasa Raghavan
Manufacturing Systems Research Lab., General Motors R & D Center, Mound Road 30500, Warren, 48090, U.S.A.
John A. Cafeo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shivashankar, S., Ravindran, B., Raghavan, N.R.S. (2009). Text Mining of Internet Content: The Bridge Connecting Product Research with Customers in the Digital Era. In: Raghavan, N., Cafeo, J. (eds) Product Research. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-2860-0_12

Download citation

DOI: https://doi.org/10.1007/978-90-481-2860-0_12
Published: 23 September 2009
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-2859-4
Online ISBN: 978-90-481-2860-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics