Abstract
Although the sequence-to-sequence (encoder-decoder) model is considered the state-of-the-art in deep learning sequence models, there is little research into using this model for recovering missing sensor data. The key challenge is that the missing sensor data problem typically comprises three sequences (a sequence of observed samples, followed by a sequence of missing samples, followed by another sequence of observed samples) whereas, the sequence-to-sequence model only considers two sequences (an input sequence and an output sequence). We address this problem by formulating a sequence-to-sequence in a novel way. A forward RNN encodes the data observed before the missing sequence and a backward RNN encodes the data observed after the missing sequence. A decoder decodes the two encoders in a novel way to predict the missing data. We demonstrate that this model produces the lowest errors in 12% more cases than the current state-of-the-art.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Arasteh, H., et al.: IoT-based smart cities: a survey. In: 2016 IEEE 16th International Conference on Environment and Electrical Engineering (EEEIC), pp. 1–6, June 2016. https://doi.org/10.1109/EEEIC.2016.7555867
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Proceedings International Conference on Learning Representations (2015). http://arxiv.org/abs/1409.0473
Buuren, S.v., Groothuis-Oudshoorn, K.: Multivariate imputation by chained equations: Mice v1.0 user’s manual. Technical report. PG/VGZ/00.038, TNO Prevention and Health, Leiden (2000). http://www.stefvanbuuren.nl/publications/MICE
Cao, W., Wang, D., Li, J., Zhou, H., Li, L., Li, Y.: BRITS: bidirectional recurrent imputation for time series. In: Advances in Neural Information Processing Systems, pp. 6775–6785 (2018)
Che, Z., Purushotham, S., Cho, K., Sontag, D., Liu, Y.: Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 8(1), 6085 (2018). https://doi.org/10.1038/s41598-018-24271-9
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Liang, X., et al.: Assessing Beijing’s PM\(_2.5\) pollution: severity, weather impact, APEC and winter heating. Proc. Roy. Soc. A: Math. Phys. Eng. Sci. 471(2182), 20150257 (2015). https://doi.org/10.1098/rspa.2015.0257. https://royalsocietypublishing.org/doi/abs/10.1098/rspa.2015.0257
Poon, C.C.Y., Lo, B.P.L., Yuce, M.R., Alomainy, A., Hao, Y.: Body sensor networks: in the era of big data and beyond. IEEE Rev. Biomed. Eng. 8, 4–16 (2015). https://doi.org/10.1109/RBME.2015.2427254
Shen, L., Ma, Q., Li, S.: End-to-end time series imputation via residual short paths. In: Zhu, J., Takeuchi, I. (eds.) Proceedings of The 10th Asian Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 95, PMLR, pp. 248–263, 14–16 November 2018. http://proceedings.mlr.press/v95/shen18a.html
Stolfi, D.H., Alba, E., Yao, X.: Predicting car park occupancy rates in smart cities. In: Alba, E., Chicano, F., Luque, G. (eds.) International Conference on Smart Cities, vol. 10268, pp. 107–117. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-319-59513-9_11
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 3104–3112. Curran Associates, Inc. (2014). http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf
Yi, X., Zheng, Y., Zhang, J., Li, T.: ST-MVL: filling missing values in geo-sensory time series data. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI’16, pp. 2704–2710. AAAI Press (2016). http://dl.acm.org/citation.cfm?id=3060832.3060999
Yick, J., Mukherjee, B., Ghosal, D.: Wireless sensor network survey. Comput. Netw. 52(12), 2292–2330 (2008). https://doi.org/10.1016/j.comnet.2008.04.002. http://www.sciencedirect.com/science/article/pii/S1389128608001254
Yoon, J., Zame, W.R., van der Schaar, M.: Deep sensing: active sensing using multi-directional recurrent neural networks. In: Sixth International Conference on Learning Representations (2018)
Zhang, Y., Thorburn, P.J., Xiang, W., Fitch, P.: SSIM-a deep learning approach for recovering missing time series sensor data. IEEE Internet Things J. 6(4), 6618–6628 (2019). https://doi.org/10.1109/JIOT.2019.2909038
Zhou, J., Huang, Z.: Recover missing sensor data with iterative imputing network. In: Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Acknowledgments
The authors thank YiFan Zhang from CSIRO for the discussions around the topic of this study.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Dabrowski, J.J., Rahman, A. (2019). Sequence-to-Sequence Imputation of Missing Sensor Data. In: Liu, J., Bailey, J. (eds) AI 2019: Advances in Artificial Intelligence. AI 2019. Lecture Notes in Computer Science(), vol 11919. Springer, Cham. https://doi.org/10.1007/978-3-030-35288-2_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-35288-2_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35287-5
Online ISBN: 978-3-030-35288-2
eBook Packages: Computer ScienceComputer Science (R0)