Skip to main content

Hot Spot Tracking by Time-Decaying Bloom Filters and Reservoir Sampling

  • Conference paper
  • First Online:
Advanced Information Networking and Applications (AINA 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 926))

  • 1812 Accesses

Abstract

In most networking applications such as IoT (Internet of Things), data are being generated at a high rate so that the long–term storage cost outweighs its benefits. Such streams of data are stored temporarily, and should be mined fast before they are lost forever. In a previous work, we have presented Time–decaying Bloom Filters (TBF) for maintaining time–varying frequency statistics in data streams. TBF extends the standard Bloom Filters (for approximate membership queries) by replacing the bit-vector with an array of small counters, whose values decay periodically with time. In this paper, we consider hot spot tracking problem for data streams. To this problem, we propose a novel scheme by integrating TBF and online sampling technology. Data streams are sampled in an online manner using a reservoir. Items newly sampled are passed to a TBF where frequency statistics are maintained for hot spot reporting.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13(7), 422–426 (1970)

    Article  Google Scholar 

  2. Charikar, M., Chen, K., Farach-Colton, M.: Finding frequent items in data streams. In: Proceedings of the International Colloquium on Automata, Languages and Programming (ICALP), pp. 693–703 (2002)

    Google Scholar 

  3. Cheng, K., Xiang, L., Iwaihara, M.: Time-decaying bloom filters for data streams with skewed distributions. In: 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA 2005), pp. 63–69, April 2005

    Google Scholar 

  4. Cohen, E., Strauss, M.: Maintaining time-decaying stream aggregates. In: PODS 2003, pp. 223–233 (2003)

    Google Scholar 

  5. Cohen, S., Matias, Y.: Spectral bloom filters. In: SIGMOD Conference, pp. 241–252 (2003)

    Google Scholar 

  6. Deng, F., Rafiei, D.: Approximately detecting duplicates for streaming data using stable bloom filters. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, SIGMOD 2006, pp. 25–36. ACM, New York, NY, USA, (2006)

    Google Scholar 

  7. Estan, C., Varghese, G.: New directions in traffic measurement and accounting: focusing on the elephants, ignoring the mice. ACM Trans. Comput. Syst. (TOCS) 21(3), 270–313 (2003)

    Article  Google Scholar 

  8. Fang, M., Shivakumar, N., Garcia-Molina, H., Motwani, R., Ullman, J.D.: Computing iceberg queries efficiently. In: Proceedings of the Twenty-fourth International Conference on Very Large Databases, pp. 299–310 (1998)

    Google Scholar 

  9. Luo, L., Guo, D., Ma, R.T.B., Rottenstreich, O., Luo, X.: Optimizing bloom filter: challenges, solutions, and comparisons. CoRR, abs/1804.04777 (2018)

    Google Scholar 

  10. Manku, G., Motwani, R.: Approximate frequency counts over data streams. In: Proceedings of 28th International Conference on Very Large Data Bases, VLDB 2002, pp. 346–357 (2002)

    Google Scholar 

  11. Tarkoma, S., Rothenberg, C.E., Lagerspetz, E.: Theory and practice of bloom filters for distributed systems. IEEE Commun. Surv. Tutorials 14(1), 131–155 (2012)

    Article  Google Scholar 

  12. Yoon, M.K.: Aging bloom filter with two active buffers for dynamic sets. IEEE Trans. Knowl. Data Eng. 22(1), 134–138 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Cheng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cheng, K. (2020). Hot Spot Tracking by Time-Decaying Bloom Filters and Reservoir Sampling. In: Barolli, L., Takizawa, M., Xhafa, F., Enokido, T. (eds) Advanced Information Networking and Applications. AINA 2019. Advances in Intelligent Systems and Computing, vol 926. Springer, Cham. https://doi.org/10.1007/978-3-030-15032-7_96

Download citation

Publish with us

Policies and ethics