Abstract
In this paper we propose an end to end framework that allows efficient analysis for trajectory streams. In particular, our approach consists of several steps. First, we perform a partitioning strategy for incoming streams of trajectories in order to reduce the trajectory size and represent trajectories using a suitable data structure. After the encoding step we build specialized cuboids for trajectories in order to make the querying step quite effective. This problem revealed really challenging as we deal with data (trajectories) for which the order of elements is relevant thus making the analysis quite harder than for classical transactional data. We performed several tests on real world datasets that confirmed the efficiency and effectiveness of the proposed techniques.
Similar content being viewed by others
Notes
The choice of the alphabet is irrelevant, the only constraint is that symbols in Σ must be unique
The algorithm takes as input a matrix of points, each row in the matrix is composed by the sequence of points belonging to each input trajectory that can be made equally sized w.l.o.g
We use a really efficient implementation for long integers in “C++ Big Integer Library” written and maintained by Matt McCutchen available at https://mattmccutchen.net/bigint/
References
Arge, L., Procopiuc, O., Ramaswamy, S., Suel, T., Vitter, J.S. (1998). Scalable sweeping-based spatial join. In VLDB (pp. 570–581).
Bakalov, P., Hadjieleftheriou, M., Tsotras, V. J. (2005). Time relaxed spatiotemporal trajectory joins. In ACM GIS (pp. 182–191).
Bédard, Y., Merrett, T., Han, J. (2001). Fundamentals of spatial data warehousing for geographic knowledge discovery. In GDMKD (pp. 53–73).
Cao, H., Wolfson, O., Trajcevski, G. (2006). Spatio-temporal data reduction with deterministic error bounds. The VLDB Journal, 15 (3), 211–228.
Ester, M., Kriegel, H.P., Sander, J., Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD (pp. 226–231).
Frentzos, E., Gratsias, K., Theodoridis, Y. (2007). Index-based most similar trajectory search. In Proceedings of the ICDE conference (pp. 816–825).
Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D. (2007). Trajectory pattern mining. In KDD (pp. 330–339).
Gonzalez, H., Han, J., Li, X., Klabjan, D. (2006). Warehousing and analyzing massive rfid data sets. In ICDE (p. 83).
Gorawski, M., & Gorawski, M. (2007). Modified r-mvb tree and btv algorithm used in a distributed spatio-temporal data warehouse. In PPAM (pp. 199–208).
Gudmundsson, J., Katajainen, J., Merrick, D., Ong, C., Wolle, T. (2009). Compressing spatio-temporal trajectories. Computational Geometry - Theory and Applications, 42(9), 825–841.
Han, J., Stefanovic, N., Koperski, K. (1998). Selective materialization: an efficient method for spatial data cube construction. In PAKDD.
Jensen, C.S., Kligys, A., Pedersen, T.B., Dyreson, C.E., Timko, I. (2004). Multidimensional data modeling for location-based services. In The VLDB Journal 13.
Jeung, H., Yiu, M., Zhou, X., Jensen, C.S., Shen, H.T. (2008). Discovery of convoys in trajectory databases. PVLDB, 1(1), 1068–1080.
Jolliffe, I.T. (2002). Principal component analysis. Springer Series in Statistics.
Lee, C., & Chung, C. (2008). Efficient storage scheme and query processing for supply chain management using rfid. In SIGMOD (pp. 291–302).
Lee, M.-L., Wu, X., Hsu, W. (2004). A prime number labeling scheme for dynamic ordered xml trees. In ICDE.
Lee, J., Han, J., Whang, K. (2007). Trajectory clustering: a partition-and-group framework. In SIGMOD (pp. 593–604).
Lee, J., Han, J., Li, X., Gonzalez, H. (2008a). TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. PVLDB, 1(1), 1081–1094.
Lee, J., Han, J., Li, X. (2008b). Trajectory outlier detection: a partition-and-detect framework. In ICDE - international conference on data engineering (pp. 140–149).
Leonardi, L., Marketos, G., Frentzos, E., Giatrakos, N., Orlando, S., Pelekis, N., Raffaetà, A., Roncato, A., Silvestri, C., Theodoridis, Y. (2010). T-warehouse: visual olap analysis on trajectory data. In ICDE (pp. 1141–1144).
Li, Y., Han, J., Yang, J. (2004). Clustering moving objects. In KDD (pp. 617–622).
Madey, G. R., Szabo, G., Barabasi, A. (2006). Wiper: the integrated wireless phone based emergency response system. In International conference on computational science (3) (pp. 417–424).
Malinowski, E., & Zimányi, E. (2004). Representing spatiality in a conceptual multi- dimensional model. In ACM-GIS.
Mamoulis, N., & Papadias, D. (2001). Multiway spatial joins. ACM Transactions on Database Systems, 26, 424–475.
Marketos, G.D. (2009). Data warehousing and mining techniques for moving object databases. In PhD. Thesis, University of Piraeus.
Marketos, G., Frentzos, E., Giatrakos, N., Ntoutsi, I., Pelekis, N., Raffaeta, A., Theodoridis, Y. (2008). A framework for trajectory data warehousing. In HDMS.
Masciari, E. (2011a). Efficient and effective query answering for trajectory cuboids. In FQAS (pp. 270–281).
Masciari, E. (2011b). Query answering on trajectory cuboids using prime numbers encodings. In IDEAS (pp. 214–218).
Pelekis, N., Theodoridis, Y., Vosinakis, S., Panayiotopoulos, T. (2006). Hermes—a framework for location-based data management. In EDBT.
Pfoser, D., Jensen, C.S., Theodoridis, Y. (2000). Novel approaches in query processing for moving object trajectories. In Proc. of the 26th international conference on very large data bases, VLDB ’00 (pp. 395–406). San Francisco: Morgan Kaufmann Publishers Inc.
Trajcevski, G., Wolfson, O., Hinrichs, K., Chamberlain, S. (2004). Managing uncertainty in moving objects databases. ACM Transactions on Database Systems, 29(3), 463–507.
Vaisman, A., & Zimanyi, E. (2009). What is spatio-temporal data warehousing? In Proceedings of the 11th international conference on data warehousing and knowledge discovery, DaWaK ’09 (pp. 9–23). Berlin: Springer.
Vlachos, M., Kollios, G., Gunopulos, D. (2002). Discovering similar multidimensional trajectories. In ICDE (pp. 673–684).
Yang, J., & Hu, M. (2006). Trajpattern: mining sequential patterns from imprecise trajectories of mobile objects. In EDBT—extending database technology (pp. 664–681).
Zhao, H., Yuen, P.C., Kwok, J.T. (2006). A novel incremental principal component analysis and its application for face recognition. IEEE Transaction on Systems, Man, and Cybernetics, 36, 873–886.
Zheng, Y., Li, Q., Chen, Y., Xie, X. (2008). Understanding mobility based on gps data. In UbiComp (pp. 312–321).
Zheng, Y., Zhang, L., Xie, X., Ma, W. (2009). Mining interesting locations and travel sequences from gps trajectories. In World Wide Web (pp. 791–800).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Masciari, E. An end to end framework for building data cubes over trajectory data streams. J Intell Inf Syst 45, 131–164 (2015). https://doi.org/10.1007/s10844-014-0343-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-014-0343-2