Abstract
In most networking applications such as IoT (Internet of Things), data are being generated at a high rate so that the long–term storage cost outweighs its benefits. Such streams of data are stored temporarily, and should be mined fast before they are lost forever. In a previous work, we have presented Time–decaying Bloom Filters (TBF) for maintaining time–varying frequency statistics in data streams. TBF extends the standard Bloom Filters (for approximate membership queries) by replacing the bit-vector with an array of small counters, whose values decay periodically with time. In this paper, we consider hot spot tracking problem for data streams. To this problem, we propose a novel scheme by integrating TBF and online sampling technology. Data streams are sampled in an online manner using a reservoir. Items newly sampled are passed to a TBF where frequency statistics are maintained for hot spot reporting.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13(7), 422–426 (1970)
Charikar, M., Chen, K., Farach-Colton, M.: Finding frequent items in data streams. In: Proceedings of the International Colloquium on Automata, Languages and Programming (ICALP), pp. 693–703 (2002)
Cheng, K., Xiang, L., Iwaihara, M.: Time-decaying bloom filters for data streams with skewed distributions. In: 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA 2005), pp. 63–69, April 2005
Cohen, E., Strauss, M.: Maintaining time-decaying stream aggregates. In: PODS 2003, pp. 223–233 (2003)
Cohen, S., Matias, Y.: Spectral bloom filters. In: SIGMOD Conference, pp. 241–252 (2003)
Deng, F., Rafiei, D.: Approximately detecting duplicates for streaming data using stable bloom filters. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, SIGMOD 2006, pp. 25–36. ACM, New York, NY, USA, (2006)
Estan, C., Varghese, G.: New directions in traffic measurement and accounting: focusing on the elephants, ignoring the mice. ACM Trans. Comput. Syst. (TOCS) 21(3), 270–313 (2003)
Fang, M., Shivakumar, N., Garcia-Molina, H., Motwani, R., Ullman, J.D.: Computing iceberg queries efficiently. In: Proceedings of the Twenty-fourth International Conference on Very Large Databases, pp. 299–310 (1998)
Luo, L., Guo, D., Ma, R.T.B., Rottenstreich, O., Luo, X.: Optimizing bloom filter: challenges, solutions, and comparisons. CoRR, abs/1804.04777 (2018)
Manku, G., Motwani, R.: Approximate frequency counts over data streams. In: Proceedings of 28th International Conference on Very Large Data Bases, VLDB 2002, pp. 346–357 (2002)
Tarkoma, S., Rothenberg, C.E., Lagerspetz, E.: Theory and practice of bloom filters for distributed systems. IEEE Commun. Surv. Tutorials 14(1), 131–155 (2012)
Yoon, M.K.: Aging bloom filter with two active buffers for dynamic sets. IEEE Trans. Knowl. Data Eng. 22(1), 134–138 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Cheng, K. (2020). Hot Spot Tracking by Time-Decaying Bloom Filters and Reservoir Sampling. In: Barolli, L., Takizawa, M., Xhafa, F., Enokido, T. (eds) Advanced Information Networking and Applications. AINA 2019. Advances in Intelligent Systems and Computing, vol 926. Springer, Cham. https://doi.org/10.1007/978-3-030-15032-7_96
Download citation
DOI: https://doi.org/10.1007/978-3-030-15032-7_96
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15031-0
Online ISBN: 978-3-030-15032-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)