Abstract
A co-location pattern represents a subset of spatial features whose events tend to locate together in spatial proximity. The certain case of the co-location pattern has been investigated. However, location information of spatial features is often imprecise, aggregated, or error prone. Because of the continuity nature of space, over-counting is a major problem. In the uncertain case, the problem becomes more challenging. In this paper, we propose a probabilistic participation index to measure co-location patterns based on the well-known possible world model. To avoid the exponential cost of calculating participation index from all possible worlds, we prove a lemma that allows for instance centric counting, avoids over-counting, and produces the same results as using possible world based counting. We use this property to develop efficient mining algorithms. We observed through both algebraic analysis and extensive experiments that the feature tree based algorithm outperforms uncertain Apriori algorithm by an order of magnitude not only for co-locations of large sizes but also for datasets with high level of uncertainty. This is an important insight in mining uncertainty co-locations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aggarwal, C.C., Li, Y., Wang, J., Wang, J.: Frequent pattern mining with uncertain data. In: Proceedings of the 15th ACM SIGKDD (2009)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proc. VLDB (1994)
Bernecker, T., Emrich, T., Kriegel, H.-P., Renz, M., Zankl, S., Züfle, A.: Efficient probabilistic reverse nearest neighbor query processing on uncertain data. In: Proc. VLDB Endow. (2011)
Bron, C., Kerbosch, J.: Algorithm 457: finding all cliques of an undirected graph. Commun. ACM (1973)
Burdick, D., Calimlim, M., Flannick, J., Gehrke, J., Yiu, T.: Mafia: a maximal frequent itemset algorithm. In: TKDE (2005)
Celik, M., Kang, J.M., Shekhar, S.: Zonal co-location pattern discovery with dynamic parameters. In: Proceedings of the Seventh IEEE ICDM (2007)
Corneil, D.G., Mulligan, G.D.: Corrections to bierstone’s algorithm for generating cliques. Commun. ACM (1972)
Gouda, K., Zaki, M.J.: Efficiently mining maximal frequent itemsets. In: ICDM (2001)
Huang, Y., Pei, J., Xiong, H.: Mining co-location patterns with rare events from spatial data sets. Geoinformatica (2006)
Huang, Y., Shekhar, S., Xiong, H.: Discovering colocation patterns from spatial data sets: A general approach. In: TKDE (2004)
Munro, R., Chawla, S., Sun, P.: Complex spatial relationships. In: Proceedings of the Third IEEE ICDM (2003)
Shekhar, S., Huang, Y.: Discovering spatial co-location patterns: A summary of results. In: Jensen, C.S., Schneider, M., Seeger, B., Tsotras, V.J. (eds.) SSTD 2001. LNCS, vol. 2121, pp. 236–256. Springer, Heidelberg (2001)
Wang, L., Wu, P., Chen, H.: Finding probabilistic prevalent colocations in spatially uncertain data sets. In: IEEE TKDE (2013)
Zaki, M.J.: Scalable algorithms for association mining. In: TKDE (2000)
Zhang, X., Mamoulis, N., Cheung, D.W., Shou, Y.: Fast mining of spatial collocations. In: ACM SIGKDD (2004)
Zou, Q., Chu, W.W., Lu, B.: Smartminer: a depth first algorithm guided by tail information for mining maximal frequent itemsets. In: ICDM (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Z., Huang, Y. (2013). Mining Co-locations under Uncertainty. In: Nascimento, M.A., et al. Advances in Spatial and Temporal Databases. SSTD 2013. Lecture Notes in Computer Science, vol 8098. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40235-7_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-40235-7_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40234-0
Online ISBN: 978-3-642-40235-7
eBook Packages: Computer ScienceComputer Science (R0)