Abstract
The notion of skyline query is to find a set of objects that is not dominated by any other objects. Regrettably, existing works lack on how to conduct skyline queries on high dimensional uncertain data with objects represented as continuous ranges and exact values, which in this paper is referred to as uncertain dimensions. Hence, in this paper we define skyline queries over data with uncertain dimensions and propose an algorithm, SkyQUD, to efficiently answer skyline queries. The SkyQUD algorithm determines skyline objects through three methods that guaranteed the probability of each object being in the final skyline results: exact domination, range domination, and uncertain domination. The algorithm has been validated through extensive experiments employing real and synthetic datasets. Results exhibit our proposed algorithm is efficient and scalable in answering skyline query on high dimensional and large datasets with uncertain dimensions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Atallah, M., Qi, Y.: Computing all skyline probabilities for uncertain data. In: Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium of the Principles of Database Systems (PODS), pp. 279–287 (2009)
Berchtold, S., Keim, D.A., Kriegel, H.P.: The X-tree: an index structure for high-dimensional data. In: Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB), pp. 28–39 (1996)
Böhm, C., Fiedler, F., Oswald, A., Plant, C., Wackersreuther, B.: Probabilistic skyline queries. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 651–660 (2009)
Börzsönyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proceedings of the 17th International Conference on Data Engineering (ICDE), pp. 421–430 (2001)
Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Ioannidis, Y., et al. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 478–495. Springer, Heidelberg (2006). https://doi.org/10.1007/11687238_30
Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: Finding k-dominant skylines in high dimensional space. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 503–514 (2006)
Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 717–816 (2003)
Godfrey, P., Shipley, R., Gryz, J.: Maximal vector computation in large data sets. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 229–240 (2005)
Khalefa, M.E., Mokbel, M.F., Levandoski, J.J.: Skyline query processing for uncertain data. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM), pp. 1293–1296 (2010)
Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 275–286 (2002)
Li, X., Wang, Y., Li, X., Wang, G.: Skyline query processing on interval uncertain data. In: IEEE 15th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops, pp. 87–92 (2012)
Mokbel, M.F., Levandoski, J.J.: Toward context and preference-aware location-based services. In: Proceedings of the International Workshop on Data Engineering for Wireless and Mobile Access, pp. 25–35 (2009)
Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Trans. Database Syst. 30(1), 41–82 (2005)
Pei, J., Jiang, B., Lin, X., Yuan, Y.: Probabilistic skylines on uncertain data. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 15–26 (2007)
Ross, S.M.: Introduction to Probability Models, 8th edn. American Press, San Diego (2003)
Tan, K.L., Eng, P.K., Ooi, B.C.: Efficient progressive skyline computation. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 301–310 (2001)
Yong, H., Kim, J.-H., Hwang. S.-W.: Skyline ranking for uncertain data with maybe confidence. In: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering Workshop (ICDEW), pp. 572–579 (2008)
Acknowledgements
This research was supported by Ministry of Science, Technology, and Innovation under the Fundamental Research Grant Scheme (Grant no. 08-01-16-1853FR). All opinions, findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views of the funding agencies. We thank the anonymous reviewers for their comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Mohd Saad, N.H., Ibrahim, H., Sidi, F., Yaakob, R. (2018). Non-index Based Skyline Analysis on High Dimensional Data with Uncertain Dimensions. In: Lupeikiene, A., Vasilecas, O., Dzemyda, G. (eds) Databases and Information Systems. DB&IS 2018. Communications in Computer and Information Science, vol 838. Springer, Cham. https://doi.org/10.1007/978-3-319-97571-9_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-97571-9_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97570-2
Online ISBN: 978-3-319-97571-9
eBook Packages: Computer ScienceComputer Science (R0)