Abstract
In this paper we introduce a modification of the real discrete Fourier transform and its inverse transform to filter noise and perform reduction on the data whilst preserving the trend of global moving of time series. The transformed data is still in the same time domain as the original data, and can therefore be directly used by any other mining algorithms.
We also present a classification algorithm MinCov in this paper. Given a new data tuple, it provides values for each class that measures the likelihood of the tuple belonging to that class. The experimental results show that the MinCov algorithm is comparable to C4.5, and using MinCov as a mining algorithm the average hit rate of predicting the sign of stock return is 23.92% higher than that on the original data. This means that the predicting accuracy has been remarkably improved by means of the proposed data reduction and noise filtering method.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Faloutsos, C., and Swami, A. Efficient Similarity Search in Sequence Databases. In Proc. of the Fourth International Conference on Foundations of data Organization and Algorithms, 1993.
Agrawal, R., Lin, K., Sawhney, H. S., and Shim, K. “Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time Series Database”, In Proc. of VLDB’95.
Das, G., Gunopulos, D., and Mannila, H. Finding Similar Time Series, In Proc. KDD97, p. 88–100.
Das, G., Kin, K., Mannia, H., Renganathan., and G., Smyth, P. Rule Discovery from Time Series, In Proc. KDD98, p. 16–22.
Faloutsos, C., Ranganathan, M., & Manolopoulos, Y. (1994). Fast Subsequence Matching in Time-Series Databases. In Proc. ACM SIGMOD Conf., Minneapolis.
Hellstrom, T., and Holmstrom, K. The Relevance of Trends for Predictions of Stock Returns. Internation Jounnal of Intelligent Systems in Accounting, Finance &Management
Hellstrom, T. Data Snooping in the Stock Market. Theory of Stochastic Processes Vol.5 (21), no.1–2, 1999, p. 33–50..
Kin-pong, C., and Ada, W. F. Efficient Time Series Matching by Wavelets.
Keogh, E., Chakrabarti, K., Pazzani, M., and Mehrotea, S. Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases.
Oppenheim, A.V. and Schafer, R.W. Digital Signal Processing, Prentice-Hall, Englewood Cliffs, N.J.,1975.
Refiei, D. (1999) On Similarity-Based Queries for Time Series Data. In Proc. of the 15 th IEEE International Conference on Data Engineering. Sydney, Australia.
Wang, H., Duntsch, I., and Bell, D. Data Reduction Based on Hyper Relations. In Proc. of KDD98, New York, p. 349–353, 1998.
Yi, B.K., Jagadish, H., & Faloutsos, C. (1998). Efficient Retreval of Similar Time Sequences Under Time Warping. IEEE International Conference on Data Engineering, p. 201–208.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Guo, G., Wang, H., Bell, D. (2002). Data Reduction and Noise Filtering for Predicting Times Series. In: Meng, X., Su, J., Wang, Y. (eds) Advances in Web-Age Information Management. WAIM 2002. Lecture Notes in Computer Science, vol 2419. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45703-8_39
Download citation
DOI: https://doi.org/10.1007/3-540-45703-8_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44045-1
Online ISBN: 978-3-540-45703-9
eBook Packages: Springer Book Archive