Abstract
To facilitate (k)-Nearest Neighbor queries, the concept of Voronoi decomposition is widely used. In this work, we propose solutions to extend the concept of Voronoi-cells to uncertain data. Due to data uncertainty, the location, the shape and the extent of a Voronoi cell are random variables. To facilitate reliable query processing despite the presence of uncertainty, we employ the concept of possible-Voronoi cells and introduce the novel concept of guaranteed-Voronoi cells: The possible-Voronoi cell of an object U consists of all points in space that have a non-zero probability of having U as their nearest-neighbor; and the guaranteed-Voronoi cell, which consists of all points in space which must have U as their nearest-neighbor. Since exact computation of both types of Voronoi cells is computationally hard, we propose approximate solutions. Therefore, we employ hierarchical access methods for both data and object space. Our proposed algorithm descends both index structures simultaneously, constantly trying to prune branches in both trees by employing the concept of spatial domination. To support (k)-Nearest Neighbor queries having k > 1, this work further pioneers solutions towards the computation of higher-order possible and higher-order guaranteed Voronoi cells, which consist of all points in space which may (respectively must) have U as one of their k-nearest neighbors. For this purpose, we develop three algorithms to explore our index structures and show that the approach that descends both index structures in parallel yields the fastest query processing times. Our experiments show that we are able to approximate uncertain Voronoi cells of any order much more effectively than the state-of-the-art while improving run-time performance. Since our approach is the first to compute guaranteed-Voronoi cells and higher order (possible and guaranteed) Voronoi cells, we extend the existing state-of-the-art solutions to these concepts, in order to allow a fair experimental evaluation.
Similar content being viewed by others
Notes
1 The later case can not be guaranteed by the approach of [6] due to the numeric nature of their approach.
2 We use Euclidean distance in all examples and illustrations, but any L p norm can be applied.
recall that if \(e_{max}^{{\mathcal {D}}}\) corresponds to case 3, then there exists no R ∗-entry such that case 4 holds
References
Chow CY, Mokbel MF, Aref WG (2009) Casper*: query processing for location services without compromising privacy. ACM TODS 34(4):24
Beskales G, Soliman MA, IIyas IF (2008) Efficient search for the top-k probable nearest neighbors in uncertain databases. Proc VLDB Endowment 1(1):326–339
Cheng R, Xie X, Yiu ML, Chen J, Sun L (2010) Uv-diagram: a voronoi diagram for uncertain data. In: ICDE, IEEE, pp 796–807
Ali ME, Tanin E, Zhang R, Kotagiri R (2012) Probabilistic voronoi diagrams for probabilistic moving nearest neighbor queries. DKE 75:1–33
Bernecker T, Emrich T, Kriegel HP, Mamoulis N, Renz M, Züfle A (2011) A novel probabilistic pruning approach to speed up similarity queries in uncertain databases. In: Proceedings of the 27th international conference on data engineering (ICDE). Hannover, pp 339–350
Zhang P, Cheng R, Mamoulis N, Renz M, Züfle A, Tang Y, Emrich T (2013) Voronoi-based nearest neighbor search for multi-dimensional uncertain databases. In: Proceedings of the 29th International conference on data engineering (ICDE). Brisbane, pp 158–169
Yuan J, Zheng Y, Zhang C, Xie W, Xie X, Sun G, Huang Y (2010) T-drive: Driving directions based on taxi trajectories. In: SIGSPATIAL, pp 99–108
Yuan J, Zheng Y, Xie X, Sun G (2011) Driving with knowledge from the physical world. In: SIGKDD, pp 316–324
Niedermayer J, Züfle A, Emrich T, Renz M, Mamoulis N, Chen L, Kriegel HP (2013) Probabilistic nearest neighbor queries on uncertain moving object trajectories. Proc VLDB Endowment 7(3):205–216
Emrich T, Kriegel HP, Kröger P, Renz M, Züfle A (2009) Incremental reverse nearest neighbor ranking in vector spaces. In: Proceedings of the 11th International symposium on spatial and temporal databases (SSTD). Aalborg, pp 265–282
Beckmann N, Kriegel HP, Schneider R, Seeger B (1990) The R*-Tree: An efficient and robust access method for points and rectangles. In: Proceedings of the ACM International conference on management of data (SIGMOD). Atlantic City, pp 322–331
Orenstein JA, Merrett TH (1984) A class of data structures for associative searching. In: ACM SIGACT-SIGMOD. ACM, pp 181–190
Schmid KA, Emrich T, Züfle A, Renz M, Cheng R (2015) Approximate uv computation based on space decomposition. In: Proceedings of the 14th International symposium on spatial and temporal databases (SSTD). Hong Kong
Cheng R, Kalashnikov DV, Prabhakar S (2004) Querying imprecise data in moving object environments. In: IEEE Transactions on knowledge and data engineering
Li J, Saha B, Deshpande A (2009) A unified approach to ranking in probabilistic databases. Proc VLDB Endowment 2(1):502–513
Bernecker T, Kriegel HP, Mamoulis N, Renz M, Züfle A (2010) Scalable probabilistic similarity ranking in uncertain databases. IEEE Trans Knowl Data Eng 22(9):1234–1246
Cheema MA, Lin X, Wang W, Zhang W, Pei J (2010) Probabilistic reverse nearest neighbor queries on uncertain data. IEEE Trans Knowl Data Eng 22 (4):550–564
Aurenhammer F (1991) Voronoi diagrams-a survey of a fundamental geometric data structure. ACM CSUR 23(3):345–405
Sharifzadeh M, Shahabi C (2010) Vor-tree: R-trees with voronoi diagrams for efficient processing of spatial nearest neighbor queries. VLDB Endowment 3 (1–2):1231–1242
Zheng B, Xu J, Lee WC, Lee L (2006) Grid-partition index: a hybrid method for nearest-neighbor queries in wireless location-based services. VLDB J 15 (1):21–39
Nutanong S, Zhang R, Tanin E, Kulik L (2008) The v*-diagram: a query-dependent approach to moving knn queries. VLDB Endowment 1(1):1095–1106
Sharifzadeh M, Shahabi C (2009) Approximate voronoi cell computation on spatial data streams. VLDB J 18(1):57–75
Akdogan A, Demiryurek U, Banaei-Kashani F, Shahabi C (2010) Voronoi-based geospatial query processing with mapreduce. In: IEEE CloudCom. IEEE, pp 9–16
Kolahdouzan M, Shahabi C (2004) Voronoi-based k nearest neighbor search for spatial network databases. In: VLDB Endowment, VLDB Endowment, pp 840–851
Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: ACM SIGMOD, vol 24. ACM, pp 71–79
Emrich T, Kriegel HP, Kröger P, Renz M, Züfle A (2010) Boosting spatial pruning: on optimal pruning of MBRs. In: Proceedings of the ACM international conference on management of data (SIGMOD). Indianapolis, pp 39–50
Hjaltason GR, Samet H (1995) Ranking in spatial databases. In: Proceedings of the 4th international symposium on large spatial databases (SSD). Portland, pp 83–95
Ṡaltenis S, Jensen CS, Leutenegger ST, Lopez MA (2000) Indexing the positions of continuously moving objects, vol 29. ACM
Zhang M, Chen S, Jensen CS, Ooi BC, Zhang Z (2009) Effectively indexing uncertain moving objects for predictive queries. Proc VLDB Endowment 2 (1):1198–1209
Emrich T, Kriegel HP, Mamoulis N, Renz M, Züfle A (2012) Indexing uncertain spatio-temporal data. In: Proceedings of the 21st ACM conference on information and knowledge management (CIKM). Maui, pp 395–404
Lian X, Chen L (2009) Efficient processing of probabilistic reverse nearest neighbor queries over uncertain data. VLDB J Int J Very Large Data Bases 18(3):787–808
Emrich T, Graf F, Kriegel HP, Schubert M, Thoma M (2010) Optimizing all-nearest-neighbor queries with trigonometric pruning. In: Proceedings of the 22nd international conference on scientific and statistical database management (SSDBM). Heidelberg, pp 501–518
Achtert E, Kriegel HP, Schubert E, Zimek A (2013) Interactive data mining with 3D-Parallel-Coordinate-Trees. In: Proceedings of the ACM international conference on management of data (SIGMOD). New York City, pp 1009–1012
Dalvi N, Suciu D (2007) Efficient query evaluation on probabilistic databases. VLDB J 16(4):523–544
Acknowledgments
Part of the research leading to these results has received funding from the Deutsche Forschungsgemeinschaft (DFG) under grant number RE 266/5-1 and from the DAAD supported by BMBF under grant number 57055388. Reynold Cheng was supported by the Research Grants Council of Hong Kong (RGC Project (HKU 711110)).
Andreas Zufle has been supported by National Science Foundation AitF grant CCF-1637541.
Reynold Cheng was supported by the Research Grants Council of HK (Project HKU 17205115) and HKU (Projects 102009508 and 104004129). We would like to thank the reviewers for their insightful comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Schmid, K.A., Zufle, A., Emrich, T. et al. Uncertain Voronoi cell computation based on space decomposition. Geoinformatica 21, 797–827 (2017). https://doi.org/10.1007/s10707-017-0293-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10707-017-0293-2