Abstract
Many types of data can be regarded as time series data. Therefore time series data predictions are applied in a wide range of domains, such as investment, traffic prediction, etc. Traffic status prediction can be used for congestion avoidance and travel planning. We solve the problem of predicting traffic status by time series prediction. The time series data prediction problem is that given a query time and time series data, we intend to predict the data value at the query time. Usually, a query time will be a future time. In this paper, we propose a hybrid prediction algorithm which exploits regression-based and clustering-based prediction methods. Explicitly, regression-based prediction is accurate when the query time is not too far from the current time. Note that time series data may have some similar shapes or trends. To capture the similar shapes hidden in this data, we utilize clustering concepts. Using these clusters, we could further discover their sequential relationships. As such, if the query time is far away from the current time, we utilize the above cluster sequential relationships to predict the possible similar cluster. From the similar cluster, the data value at the query time is obtained. Note that the hybrid algorithm aggregates the above two methods using one threshold that decides which method to use. If the time difference between the query time and the current time is smaller than the prediction length threshold, hybrid prediction uses regression-based prediction. Otherwise, our hybrid algorithm uses clustering-based prediction. To prove our proposed methods, we have carried out a set of experiments on real data sets to compare the accuracy of the methods. The results of the experiments prove that our proposed methods are both accurate and practical.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bemdt, D.J., Clifford, J.: Using dynamic time warping to find patterns in time series. In: KDD Workshop, pp. 229–248 (1994)
Chatfield, C.: Time-Series Forecasting. Chapman and Hall/CRC (2001)
Chen, L., Ng, R.T.: On the marriage of lp-norms and edit distance. In: VLDB (2004)
Chen, L., Ozsu, M.T., Oria, V.: Robust and fast similarity search for moving object trajectories. In: SIGMOD, pp. 491–502 (2005)
Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD (1996)
Giles, C.L., Lawrence, S., Tsoi, A.C.: Noisy time series prediction using recurrent neural networks and grammatical inference. Machine Learning (2001)
Lin, J., Keogh, E., Wei, L., Lonardi, S.: Experiencing sax: a novel symbolic representation of time series. Data Mining and Knowledge Discovery 15(2), 107–144 (2007)
Lloyd, S.: Least squares quantization in pcm. IEEE Transactions on Information Theory (1982)
Ha, Y.-M., Park, S., Kim, S.-W., Won, J.-I., Yoon, J.-H.: Rule discovery and matching in stock databases. In: IEEE International Computer Software and Applications Conference (2008)
Morchen, F., Ultsch, A., Hoos, O.: Extracting interpretable muscle activation patterns with time series knowledge mining. International Journal of Knowledge Based Intelligent Engineering Systems 9(3), 197 (2005)
Ron, D., Singer, Y., Tishby, N.: The power of amnesia: Learning probabilistic automata with variable memory length. Machine Leraning (1996)
Ruta, D., Gabrys, B., Lemke, C.: A generic multilevel architecture for time series prediction. TKDE (2010)
Sant’Anna, A., Wickstrom, N.: Symbolization of time-series: An evaluation of sax, persist, and aca. In: 2011 4th International Congress on Image and Signal Processing (CISP), vol. 4, pp. 2223–2228. IEEE (2011)
Tsay, R.S.: Analysis of Financial Time Series. John Wiley&Sons (2002)
Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: ICDE, pp. 673–684 (2002)
Vlachos, M., Yu, P., Castelli, V.: On periodicity detection and structural periodic similarity. In: SDM (2005)
Wang, P., Wang, H., Wang, W.: Finding semantics in time series. In: SIGMOD (2011)
Xing, Z., Pei, J., Yu, P.S.: Early classification on time series. In: KAIS (2012)
Xiong, P., Chi, Y., Zhu, S., Moon, H.J., Pu, C., Hacıgümüs, H.: Intelligent management of virtualized resources for database systems in cloud environment. In: SDM (2005)
Yuan, H., Liu, J., Pu, H., Mao, J., Gao, S.: Prediction of chaotic ferroresonance time series based on the dynamic fuzzy neural network. In: ECAC (2012)
Zhou, F., Torre, F., Hodgins, J.K.: Aligned cluster analysis for temporal segmentation of human motion. In: 8th IEEE International Conference on Automatic Face & Gesture Recognition, FG 2008, pp. 1–7. IEEE (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, BW., Wang, KW., Wei, LY., Peng, WC. (2013). A Hybrid Prediction Algorithm for Traffic Speed Prediction. In: Chang, RS., Jain, L., Peng, SL. (eds) Advances in Intelligent Systems and Applications - Volume 1. Smart Innovation, Systems and Technologies, vol 20. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35452-6_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-35452-6_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35451-9
Online ISBN: 978-3-642-35452-6
eBook Packages: EngineeringEngineering (R0)