Abstract
Crowdsourced data enumeration, in which the Web crowd is requested to enumerate data items within a specified range, is important in many Web applications such as hotel reviews. This paper presents a processing method for crowdsourced data enumeration on microtask-based crowdsourcing platforms. A general approach to achieving a high recall in data enumeration is to apply the divide-and-conquer principle. However, how to apply the principle to data enumeration on microtask-based crowdsourcing platforms is not trivial. The proposed method is unique in that the workers join the process of generating smaller tasks in a divide-and-conquer fashion, and the programmer does not need to provide many microtasks in advance. This paper explains the method, provides theoretical results to show the method works well with microtask-based platforms, and explains our experimental results that suggest the proposed method can achieve higher recalls and produces appropriate tasks for microtask-based crowdsourcing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Franklin, M.J., Kossmann, D., Kraska, T., Ramesh, S., Xin, R.: CrowdDB: answering queries with crowdsourcing. In: SIGMOD 2011, pp. 61–72 (2011)
Jain, S., Parkes, D.C.: A Game-Theoretic Analysis of Games with a Purpose. In: Papadimitriou, C., Zhang, S. (eds.) WINE 2008. LNCS, vol. 5385, pp. 342–350. Springer, Heidelberg (2008)
Kulkarni, A.P., Can, M., Hartmann, B.: Collaboratively crowdsourcing workflows with turkomatic. In: CSCW 2012, pp. 1003–1012 (2012)
Morishima, A., Shinagawa, N., Mochizuki, S.: The Power of Integrated Abstraction for Data-Centric Human/Machine Computations. In: VLDS 2011, pp. 7–10 (2011)
Morishima, A., Shinagawa, N., Mitsuishi, T., Aoki, H., Fukusumi, S.: CyLog/Crowd4U: A Declarative Platform for Complex Data-centric Crowdsourcing. PVLDB 5(12), 1918–1921 (2012)
Marcus, A., Wu, E., Karger, D.R., Madden, S., Miller, R.C.: Human-powered Sorts and Joins. PVLDB 5(1), 13–24 (2011)
Marcus, A., Wu, E., Madden, S., Miller, R.C.: Crowdsourced Databases: Query Processing with People. In: CIDR 2011, pp. 211–214 (2011)
Parameswaran, A.G., Polyzotis, N.: Answering Queries using Humans, Algorithms and Databases. In: CIDR 2011, pp. 160–166 (2011)
Tripadvisor, http://www.tripadvisor.com/
Trushkowsky, B., Kraska, T., Franklin, M.J., Sarkar, P.: Crowdsourced enumeration queries. In: ICDE 2013, pp. 673–684 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Aoki, H., Morishima, A. (2013). A Divide-and-Conquer Approach for Crowdsourced Data Enumeration. In: Jatowt, A., et al. Social Informatics. SocInfo 2013. Lecture Notes in Computer Science, vol 8238. Springer, Cham. https://doi.org/10.1007/978-3-319-03260-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-03260-3_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03259-7
Online ISBN: 978-3-319-03260-3
eBook Packages: Computer ScienceComputer Science (R0)