Skip to main content

Continuous Summarization of Co-evolving Data in Large Water Distribution Network

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6184))

Abstract

While traditional water data stream analysis focuses mainly on single sensor node or monitoring station, having an accurate picture of the overall data patterns is more meaningful in understanding large water distribution network’s behavior and characteristics, tracking important trends, and also making informed judgments about measurement or utilization operations. In this paper, we propose a continuous summarization scheme that aims to continuously provide Representative Patterns of the complete data in large water distribution network. Our core contributions are to propose to summarize Representative Pattern for describing the spatial-temporal pattern in water distribution network and employ a parameter-free algorithm based on the Minimum Description Length (MDL) Principle to automatically split data streams into episodes for generating the series of representative patterns. Moreover, we evaluate our approaches on a real water distribution network from the Battle of the Water Sensor Network (BWSN). Experiment results show that our online summarization methods are effective, scalable and interpretable; What’s more, we discover interesting periodic time-evolving patterns on the chlorine data.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ostfeld, A., Uber, J.G., Salomons, E.: Battle of water sensor networks: A design challenge for engineers and algorithms. In: WDSA (2006)

    Google Scholar 

  2. Rossman, L.A.: EPANET2 user’s manual. National Risk Management Research Laboratory: U.S. Environmental Protection Agency (2000)

    Google Scholar 

  3. Hart, D.B., Klise, K.A., McKenna, S.A., Wilson, M.P.: CANARY User’s Manual Version 4.1. Sandia National Laboratories. U.S. Environmental Protection Agency (2009)

    Google Scholar 

  4. Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. TKDD 1(1) (2007)

    Google Scholar 

  5. Ester, M., Peter Kriegel, H., Sander, J.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD 1996 (1996)

    Google Scholar 

  6. Rissanen, J.: Modeling by the shortest data description. Automatica 14, 465–471 (1978)

    Article  MATH  Google Scholar 

  7. Sun, J., Papadimitriou, S., Yu, P.S., Faloutsos, C.: Graphscope: Parameter-free mining of large time-evolving graphs. In: KDD (2007)

    Google Scholar 

  8. Papadimitriou, S., Sun, J.: Streaming Pattern Discovery in Multiple Time-Series. In: VLDB 2005 (2005)

    Google Scholar 

  9. Sun, J., Tao, D.: Beyond streams and graphs: dynamic tensor analysis. In: KDD (2006)

    Google Scholar 

  10. Silja, T., Helena, R., Didia, C.: Water Supply System Performance for Different Pipe Materials Part I: Water Quality Analysis. Water Resour. Manage. (2008)

    Google Scholar 

  11. Zhu, Y., Shasha, D.: Statstream: Statistical monitoring of thousands of data streams in real time. In: VLDB (2002)

    Google Scholar 

  12. Sakurai, Y., Papadimitriou, S., Faloutsos, C.: Braid: Stream mining through group lag correlations. In: SIGMOD (2005)

    Google Scholar 

  13. Lin, J., Vlachos, M., Keogh, E., Gunopulos, D.: Iterative incremental clustering of time series. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 106–122. Springer, Heidelberg (2004)

    Google Scholar 

  14. Li, L., McCann, J., Pollard, N., Faloutsos, C.: DynaMMo: Mining and Summarization of

    Google Scholar 

  15. Johnson, D.S., Krishnan, S., Chhugani, J., Kumar, S., Venkatasubramanian, S.: Compressing large Boolean matrices using reordering techniques. In: VLDB (2004)

    Google Scholar 

  16. Muthukrishnan, S.: Data Streams: Algorithms and Applications (Foundations and Trends in Theoretical Computer Science), vol. 1 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xiao, H., Ma, X., Tang, S., Tian, C. (2010). Continuous Summarization of Co-evolving Data in Large Water Distribution Network. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds) Web-Age Information Management. WAIM 2010. Lecture Notes in Computer Science, vol 6184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14246-8_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14246-8_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14245-1

  • Online ISBN: 978-3-642-14246-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics