Abstract
The crowd can be an incredible source of information. In particular, this is true for reviews about products of any kind, freely provided by customers through specialized web sites. In other words, they are social knowledge, that can be exploited by other customers.
The Hints From the Crowd (HFC) prototype, presented in this paper, is a NoSQL database system for large collections of product reviews; the database is queried by expressing a natural language sentence; the result is a list of products ranked based on the relevance of reviews w.r.t. the natural language sentence. The best ranked products in the result list can be seen as the best hints for the user based on crowd opinions (the reviews).
In this paper, we mainly describe the query engine, and we show that our prototype obtains good performance in terms of execution time, demonstrating that our approach is feasible. The IMDb dataset, that includes more than 2 million reviews for more than 100,000 movies, is used to evaluate performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cattell, R.: Scalable sql and nosql data stores. SIGMOD Record 39(4), 12–27 (2011)
Chaovalit, P., Zhou, L.: Movie review mining: a comparison between supervised and unsupervised classification approaches. In: HICSS. IEEE Computer Society (2005)
Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: WWW, pp. 519–528 (2003)
Fosci, P., Psaila, G.: Toward a product search engine based on user reviews. In: Int. Conf. on Data Technologies and Applications, DATA 2012, Rome (Italy) (July 2012)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Kim, W., Kohavi, R., Gehrke, J., DuMouchel, W. (eds.) KDD, pp. 168–177. ACM (2004)
Hu, M., Liu, B.: Mining opinion features in customer reviews. In: McGuinness, D.L., Ferguson, G. (eds.) AAAI, pp. 755–760. AAAI Press / The MIT Press (2004)
Lee, D., Jeong, O.-R., Goo Lee, S.: Opinion mining of customer feedback data on the web. In: Kim, W., Choi, H.-J. (eds.) ICUIMC, pp. 230–235. ACM (2008)
Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Ellis, A., Hagino, T. (eds.) WWW, pp. 342–351. ACM (2005)
Ly, D.K., Sugiyama, K., Lin, Z., Kan, M.-Y.: Product review summarization based on facet identification and sentence clustering. CoRR, abs/1110.1428 (2011)
Resnick, P., Varian, H.: Recommender systems. Communications of the ACM 40(3), 56–58 (1997)
Robin, H., Jablonski, S.: Nosql evaluation: A use case oriented survey. In: International Conference on Cloud and Service Computing, CSC 2011, Hong Kong, China, pp. 336–341 (December 2011)
Sandvig, J.J., Mobasher, B., Burke, R.D.: Robustness of collaborative recommendation based on association rule mining. In: Konstan, J.A., Riedl, J., Smyth, B. (eds.) RecSys, pp. 105–112. ACM (2007)
Strauch, C.: Nosql databases (2011), http://www.christof-strauch.de/nosqldbs.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fosci, P., Psaila, G., Di Stefano, M. (2013). Hints from the Crowd: A Novel NoSQL Database. In: Cuzzocrea, A., Maabout, S. (eds) Model and Data Engineering. MEDI 2013. Lecture Notes in Computer Science, vol 8216. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41366-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-41366-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41365-0
Online ISBN: 978-3-642-41366-7
eBook Packages: Computer ScienceComputer Science (R0)