Skip to main content

Sequence-to-Sequence Imputation of Missing Sensor Data

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11919))

Abstract

Although the sequence-to-sequence (encoder-decoder) model is considered the state-of-the-art in deep learning sequence models, there is little research into using this model for recovering missing sensor data. The key challenge is that the missing sensor data problem typically comprises three sequences (a sequence of observed samples, followed by a sequence of missing samples, followed by another sequence of observed samples) whereas, the sequence-to-sequence model only considers two sequences (an input sequence and an output sequence). We address this problem by formulating a sequence-to-sequence in a novel way. A forward RNN encodes the data observed before the missing sequence and a backward RNN encodes the data observed after the missing sequence. A decoder decodes the two encoders in a novel way to predict the missing data. We demonstrate that this model produces the lowest errors in 12% more cases than the current state-of-the-art.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://archive.ics.uci.edu/ml/index.php.

References

  1. Arasteh, H., et al.: IoT-based smart cities: a survey. In: 2016 IEEE 16th International Conference on Environment and Electrical Engineering (EEEIC), pp. 1–6, June 2016. https://doi.org/10.1109/EEEIC.2016.7555867

  2. Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Proceedings International Conference on Learning Representations (2015). http://arxiv.org/abs/1409.0473

  3. Buuren, S.v., Groothuis-Oudshoorn, K.: Multivariate imputation by chained equations: Mice v1.0 user’s manual. Technical report. PG/VGZ/00.038, TNO Prevention and Health, Leiden (2000). http://www.stefvanbuuren.nl/publications/MICE

  4. Cao, W., Wang, D., Li, J., Zhou, H., Li, L., Li, Y.: BRITS: bidirectional recurrent imputation for time series. In: Advances in Neural Information Processing Systems, pp. 6775–6785 (2018)

    Google Scholar 

  5. Che, Z., Purushotham, S., Cho, K., Sontag, D., Liu, Y.: Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 8(1), 6085 (2018). https://doi.org/10.1038/s41598-018-24271-9

    Article  Google Scholar 

  6. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735

    Article  Google Scholar 

  7. Kingma, D., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  8. Liang, X., et al.: Assessing Beijing’s PM\(_2.5\) pollution: severity, weather impact, APEC and winter heating. Proc. Roy. Soc. A: Math. Phys. Eng. Sci. 471(2182), 20150257 (2015). https://doi.org/10.1098/rspa.2015.0257. https://royalsocietypublishing.org/doi/abs/10.1098/rspa.2015.0257

    Article  Google Scholar 

  9. Poon, C.C.Y., Lo, B.P.L., Yuce, M.R., Alomainy, A., Hao, Y.: Body sensor networks: in the era of big data and beyond. IEEE Rev. Biomed. Eng. 8, 4–16 (2015). https://doi.org/10.1109/RBME.2015.2427254

    Article  Google Scholar 

  10. Shen, L., Ma, Q., Li, S.: End-to-end time series imputation via residual short paths. In: Zhu, J., Takeuchi, I. (eds.) Proceedings of The 10th Asian Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 95, PMLR, pp. 248–263, 14–16 November 2018. http://proceedings.mlr.press/v95/shen18a.html

  11. Stolfi, D.H., Alba, E., Yao, X.: Predicting car park occupancy rates in smart cities. In: Alba, E., Chicano, F., Luque, G. (eds.) International Conference on Smart Cities, vol. 10268, pp. 107–117. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-319-59513-9_11

    Chapter  Google Scholar 

  12. Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 3104–3112. Curran Associates, Inc. (2014). http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf

  13. Yi, X., Zheng, Y., Zhang, J., Li, T.: ST-MVL: filling missing values in geo-sensory time series data. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI’16, pp. 2704–2710. AAAI Press (2016). http://dl.acm.org/citation.cfm?id=3060832.3060999

  14. Yick, J., Mukherjee, B., Ghosal, D.: Wireless sensor network survey. Comput. Netw. 52(12), 2292–2330 (2008). https://doi.org/10.1016/j.comnet.2008.04.002. http://www.sciencedirect.com/science/article/pii/S1389128608001254

    Article  Google Scholar 

  15. Yoon, J., Zame, W.R., van der Schaar, M.: Deep sensing: active sensing using multi-directional recurrent neural networks. In: Sixth International Conference on Learning Representations (2018)

    Google Scholar 

  16. Zhang, Y., Thorburn, P.J., Xiang, W., Fitch, P.: SSIM-a deep learning approach for recovering missing time series sensor data. IEEE Internet Things J. 6(4), 6618–6628 (2019). https://doi.org/10.1109/JIOT.2019.2909038

    Article  Google Scholar 

  17. Zhou, J., Huang, Z.: Recover missing sensor data with iterative imputing network. In: Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence (2018)

    Google Scholar 

Download references

Acknowledgments

The authors thank YiFan Zhang from CSIRO for the discussions around the topic of this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joel Janek Dabrowski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dabrowski, J.J., Rahman, A. (2019). Sequence-to-Sequence Imputation of Missing Sensor Data. In: Liu, J., Bailey, J. (eds) AI 2019: Advances in Artificial Intelligence. AI 2019. Lecture Notes in Computer Science(), vol 11919. Springer, Cham. https://doi.org/10.1007/978-3-030-35288-2_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-35288-2_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-35287-5

  • Online ISBN: 978-3-030-35288-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics