Skip to main content
Log in

An end to end framework for building data cubes over trajectory data streams

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

In this paper we propose an end to end framework that allows efficient analysis for trajectory streams. In particular, our approach consists of several steps. First, we perform a partitioning strategy for incoming streams of trajectories in order to reduce the trajectory size and represent trajectories using a suitable data structure. After the encoding step we build specialized cuboids for trajectories in order to make the querying step quite effective. This problem revealed really challenging as we deal with data (trajectories) for which the order of elements is relevant thus making the analysis quite harder than for classical transactional data. We performed several tests on real world datasets that confirmed the efficiency and effectiveness of the proposed techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Notes

  1. The choice of the alphabet is irrelevant, the only constraint is that symbols in Σ must be unique

  2. The algorithm takes as input a matrix of points, each row in the matrix is composed by the sequence of points belonging to each input trajectory that can be made equally sized w.l.o.g

  3. We use a really efficient implementation for long integers in “C++ Big Integer Library” written and maintained by Matt McCutchen available at https://mattmccutchen.net/bigint/

References

  • Arge, L., Procopiuc, O., Ramaswamy, S., Suel, T., Vitter, J.S. (1998). Scalable sweeping-based spatial join. In VLDB (pp. 570–581).

  • Bakalov, P., Hadjieleftheriou, M., Tsotras, V. J. (2005). Time relaxed spatiotemporal trajectory joins. In ACM GIS (pp. 182–191).

  • Bédard, Y., Merrett, T., Han, J. (2001). Fundamentals of spatial data warehousing for geographic knowledge discovery. In GDMKD (pp. 53–73).

  • Cao, H., Wolfson, O., Trajcevski, G. (2006). Spatio-temporal data reduction with deterministic error bounds. The VLDB Journal, 15 (3), 211–228.

    Article  Google Scholar 

  • Ester, M., Kriegel, H.P., Sander, J., Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD (pp. 226–231).

  • Frentzos, E., Gratsias, K., Theodoridis, Y. (2007). Index-based most similar trajectory search. In Proceedings of the ICDE conference (pp. 816–825).

  • Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D. (2007). Trajectory pattern mining. In KDD (pp. 330–339).

  • Gonzalez, H., Han, J., Li, X., Klabjan, D. (2006). Warehousing and analyzing massive rfid data sets. In ICDE (p. 83).

  • Gorawski, M., & Gorawski, M. (2007). Modified r-mvb tree and btv algorithm used in a distributed spatio-temporal data warehouse. In PPAM (pp. 199–208).

  • Gudmundsson, J., Katajainen, J., Merrick, D., Ong, C., Wolle, T. (2009). Compressing spatio-temporal trajectories. Computational Geometry - Theory and Applications, 42(9), 825–841.

    Article  MATH  MathSciNet  Google Scholar 

  • Han, J., Stefanovic, N., Koperski, K. (1998). Selective materialization: an efficient method for spatial data cube construction. In PAKDD.

  • Jensen, C.S., Kligys, A., Pedersen, T.B., Dyreson, C.E., Timko, I. (2004). Multidimensional data modeling for location-based services. In The VLDB Journal 13.

  • Jeung, H., Yiu, M., Zhou, X., Jensen, C.S., Shen, H.T. (2008). Discovery of convoys in trajectory databases. PVLDB, 1(1), 1068–1080.

    Google Scholar 

  • Jolliffe, I.T. (2002). Principal component analysis. Springer Series in Statistics.

  • Lee, C., & Chung, C. (2008). Efficient storage scheme and query processing for supply chain management using rfid. In SIGMOD (pp. 291–302).

  • Lee, M.-L., Wu, X., Hsu, W. (2004). A prime number labeling scheme for dynamic ordered xml trees. In ICDE.

  • Lee, J., Han, J., Whang, K. (2007). Trajectory clustering: a partition-and-group framework. In SIGMOD (pp. 593–604).

  • Lee, J., Han, J., Li, X., Gonzalez, H. (2008a). TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. PVLDB, 1(1), 1081–1094.

    Google Scholar 

  • Lee, J., Han, J., Li, X. (2008b). Trajectory outlier detection: a partition-and-detect framework. In ICDE - international conference on data engineering (pp. 140–149).

  • Leonardi, L., Marketos, G., Frentzos, E., Giatrakos, N., Orlando, S., Pelekis, N., Raffaetà, A., Roncato, A., Silvestri, C., Theodoridis, Y. (2010). T-warehouse: visual olap analysis on trajectory data. In ICDE (pp. 1141–1144).

  • Li, Y., Han, J., Yang, J. (2004). Clustering moving objects. In KDD (pp. 617–622).

  • Madey, G. R., Szabo, G., Barabasi, A. (2006). Wiper: the integrated wireless phone based emergency response system. In International conference on computational science (3) (pp. 417–424).

  • Malinowski, E., & Zimányi, E. (2004). Representing spatiality in a conceptual multi- dimensional model. In ACM-GIS.

  • Mamoulis, N., & Papadias, D. (2001). Multiway spatial joins. ACM Transactions on Database Systems, 26, 424–475.

    Article  MATH  Google Scholar 

  • Marketos, G.D. (2009). Data warehousing and mining techniques for moving object databases. In PhD. Thesis, University of Piraeus.

  • Marketos, G., Frentzos, E., Giatrakos, N., Ntoutsi, I., Pelekis, N., Raffaeta, A., Theodoridis, Y. (2008). A framework for trajectory data warehousing. In HDMS.

  • Masciari, E. (2011a). Efficient and effective query answering for trajectory cuboids. In FQAS (pp. 270–281).

  • Masciari, E. (2011b). Query answering on trajectory cuboids using prime numbers encodings. In IDEAS (pp. 214–218).

  • Pelekis, N., Theodoridis, Y., Vosinakis, S., Panayiotopoulos, T. (2006). Hermes—a framework for location-based data management. In EDBT.

  • Pfoser, D., Jensen, C.S., Theodoridis, Y. (2000). Novel approaches in query processing for moving object trajectories. In Proc. of the 26th international conference on very large data bases, VLDB ’00 (pp. 395–406). San Francisco: Morgan Kaufmann Publishers Inc.

  • Trajcevski, G., Wolfson, O., Hinrichs, K., Chamberlain, S. (2004). Managing uncertainty in moving objects databases. ACM Transactions on Database Systems, 29(3), 463–507.

    Article  Google Scholar 

  • Vaisman, A., & Zimanyi, E. (2009). What is spatio-temporal data warehousing? In Proceedings of the 11th international conference on data warehousing and knowledge discovery, DaWaK ’09 (pp. 9–23). Berlin: Springer.

  • Vlachos, M., Kollios, G., Gunopulos, D. (2002). Discovering similar multidimensional trajectories. In ICDE (pp. 673–684).

  • Yang, J., & Hu, M. (2006). Trajpattern: mining sequential patterns from imprecise trajectories of mobile objects. In EDBT—extending database technology (pp. 664–681).

  • Zhao, H., Yuen, P.C., Kwok, J.T. (2006). A novel incremental principal component analysis and its application for face recognition. IEEE Transaction on Systems, Man, and Cybernetics, 36, 873–886.

    Article  Google Scholar 

  • Zheng, Y., Li, Q., Chen, Y., Xie, X. (2008). Understanding mobility based on gps data. In UbiComp (pp. 312–321).

  • Zheng, Y., Zhang, L., Xie, X., Ma, W. (2009). Mining interesting locations and travel sequences from gps trajectories. In World Wide Web (pp. 791–800).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elio Masciari.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Masciari, E. An end to end framework for building data cubes over trajectory data streams. J Intell Inf Syst 45, 131–164 (2015). https://doi.org/10.1007/s10844-014-0343-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-014-0343-2

Keywords

Navigation