Abstract
In this chapter, we describe deep learning techniques that are proposed for roadside video data analysis. We firstly present an introduction to deep learning concepts, and a short review of several typical types of CNN.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
L. Zheng, Y. Zhao, S. Wang, J. Wang, Q. Tian, Good practice in CNN feature transfer. arXiv preprint arXiv:1604.00133 (2016)
S.D. Learning, CS231n: convolutional neural networks for visual recognition (2016). http://cs231n.github.io/convolutional-networks/
J. Ba, V. Mnih, K. Kavukcuoglu, Multiple object recognition with visual attention, arXiv preprint arXiv:1412.7755 (2014)
J. Donahue, L. Anne Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, et al., Long-term recurrent convolutional networks for visual recognition and description, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 2625–2634
A. Dundar, J. Jin, E. Culurciello, Convolutional clustering for unsupervised learning. arXiv preprint arXiv:1511.06241 (2015)
D.V. Nguyen, L. Kuhnert, K.D. Kuhnert, Structure overview of vegetation detection. A novel approach for efficient vegetation detection using an active lighting system. Robot. Auton. Syst. 60, 498–508 (2012)
I. Lenz, H. Lee, A. Saxena, Deep learning for detecting robotic grasps. Int. J. Robot. Res. 34, 705–724 (2015)
L. Romaszko, A deep learning approach with an ensemble-based neural network classifier for black box ICML 2013 contest, in Workshop on Challenges in Representation Learning, International Conference on Machine Learning (ICML) (2013), pp. 1–3
S. Ahmad Radzi, K.-H. Mohamad, S.S. Liew, R. Bakhteri, Convolutional neural network for face recognition with pose and illumination variation. Int. J. Eng. Technol. (IJET) 6, 44–57 (2014)
F. Shaheen, B. Verma, M. Asafuddoula, Impact of automatic feature extraction in deep learning architecture, in Digital Image Computing: Techniques and Applications (DICTA), International Conference on (2016), pp. 1–8
C. Cortes, Y. LeCun, C.J.C. Burges, The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/
K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385 (2015)
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)
D.V. Nguyen, L. Kuhnert, K.D. Kuhnert, Spreading algorithm for efficient vegetation detection in cluttered outdoor environments. Robot. Auton. Syst. 60, 1498–1507 (2012)
D.V. Nguyen, L. Kuhnert, T. Jiang, S. Thamke, K.D. Kuhnert, Vegetation detection for outdoor automobile guidance, in Industrial Technology (ICIT), IEEE International Conference on (2011), pp. 358–364
A. Bosch, X. Muñoz, J. Freixenet, Segmentation and description of natural outdoor scenes. Image Vis. Comput. 25, 727–740 (2007)
W. Guo, U.K. Rage, S. Ninomiya, Illumination invariant segmentation of vegetation for time series wheat images based on decision tree model. Comput. Electron. Agri. 96, 58–66 (2013)
F. Shaheen, B. Verma, An ensemble of deep learning architectures for automatic feature extraction, in Computational Intelligence (ISSCI), IEEE Symposium Series on (2016) (in Press)
D.-X. Liu, T. Wu, B. Dai, Fusing ladar and color image for detection grass off-road scenario, in Vehicular Electronics and Safety (ICVES), IEEE International Conference on (2007), pp. 1–4
R. Mottaghi, S. Fidler, A. Yuille, R. Urtasun, D. Parikh, Human-machine CRFS for identifying bottlenecks in scene understanding. Pattern Anal. Mach. Intell. IEEE Trans. 38, 74–87 (2016)
J. Shotton, J. Winn, C. Rother, A. Criminisi, Textonboost for image understanding: multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int. J. Comput. Vis. 81, 2–23 (2009)
S. Gould, J. Rodgers, D. Cohen, G. Elidan, D. Koller, Multi-class segmentation with relative location prior. Int. J. Comput. Vis. 80, 300–316 (2008)
Y. Jimei, B. Price, S. Cohen, Y. Ming-Hsuan, Context driven scene parsing with attention to rare classes, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2014), pp. 3294–3301
A. Singhal, L. Jiebo, Z. Weiyu, Probabilistic spatial context models for scene content understanding, in Computer Vision and Pattern Recognition, (CVPR), IEEE Conference on (2003), pp. 235–241
B. Micusik, J. Kosecka, Semantic segmentation of street scenes by superpixel co-occurrence and 3D geometry, in Computer Vision Workshops (ICCV Workshops), IEEE 12th International Conference on (2009), pp. 625–632
C. Farabet, C. Couprie, L. Najman, Y. LeCun, Learning hierarchical features for scene labeling. Pattern Anal. Mach. Intell. IEEE Trans. 35, 1915–1929 (2013)
M. Seyedhosseini, T. Tasdizen, Semantic image segmentation with contextual hierarchical models. Pattern Anal. Mach. Intell. IEEE Trans. 38(5), 951–964 (2015)
D. Batra, R. Sukthankar, C. Tsuhan, Learning class-specific affinities for image labelling, in Computer Vision and Pattern Recognition, (CVPR), IEEE Conference on (2008), pp. 1–8
Z. Lei, J. Qiang, Image segmentation with a unified graphical model. Pattern Anal. Mach. Intell. IEEE Trans. 32, 1406–1425 (2010)
R. Xiaofeng, B. Liefeng, D. Fox, RGB-(D) scene labeling: features and algorithms, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2012), pp. 2759–2766
A.G. Schwing, R. Urtasun, Fully connected deep structured networks. arXiv preprint arXiv:1503.02351 (2015)
S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, et al., Conditional random fields as recurrent neural networks. arXiv preprint arXiv:1502.03240 (2015)
P.H. Pinheiro, R. Collobert, Recurrent convolutional neural networks for scene parsing. arXiv preprint arXiv:1306.2795 (2013)
A. Sharma, O. Tuzel, M.-Y. Liu, Recursive context propagation network for semantic scene labeling, in Advances in Neural Information Processing Systems (2014), pp. 2447–2455
A. Sharma, O. Tuzel, D.W. Jacobs, Deep hierarchical parsing for semantic segmentation, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 530–538
S. Ling, L. Li, L. Xuelong, Feature learning for image classification via multiobjective genetic programming. Neural Netw. Learn. Syst. IEEE Trans. 25, 1359–1371 (2014)
L. Zhang, B. Verma, D. Stockwell, S. Chowdhury, Spatially constrained location prior for scene parsing, in Neural Networks (IJCNN), International Joint Conference on (2016), pp. 1480–1486
J. Tighe, S. Lazebnik, Superparsing: scalable nonparametric image parsing with superpixels, in Computer Vision (ECCV), European Conference on (2010), pp. 352–365
P. Hanchuan, L. Fuhui, C. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. Pattern Anal. Mach. Intell. IEEE Trans. 27, 1226–1238 (2005)
P. Felzenszwalb, D. Huttenlocher, Efficient graph-based image segmentation. Int. J. Comput. Vis. 59, 167–181 (2004)
S. Gould, R. Fulton, D. Koller, Decomposing a scene into geometric and semantically consistent regions, in Computer Vision (ICCV), IEEE 12th International Conference on (2009), pp. 1–8
L. Ce, J. Yuen, A. Torralba, Nonparametric scene parsing: label transfer via dense scene alignment, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2009), pp. 1972–1979
V. Lempitsky, A. Vedaldi, A. Zisserman, Pylon model for semantic segmentation, in Advances in Neural Information Processing Systems (2011), pp. 1485–1493
D. Munoz, J.A. Bagnell, M. Hebert, Stacked hierarchical labeling, in Computer Vision (ECCV), European Conference on (2010), pp. 57–70
L. Ladicky, C. Russell, P. Kohli, P.H.S. Torr, Associative hierarchical random fields. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1056–1077 (2014)
A. Krizhevsky, I. Sutskever, G.E. Hinton, ImageNet classification with deep convolutional neural networks, in Advances in Neural Information Processing Systems (2012), pp. 1097–1105
M.D. Zeiler, R. Fergus, Visualizing and understanding convolutional networks, in European Conference on Computer Vision (2014), pp. 818–833
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, et al., Going deeper with convolutions, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 1–9
K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
K. He, X. Zhang, S. Ren, J. Sun, Identity mappings in deep residual networks. arXiv preprint arXiv:1603.05027 (2016)
S. Bing, W. Gang, Z. Zhen, W. Bing, Z. Lifan, Integrating parametric and non-parametric models for scene labeling, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 4249–4258
J. Shotton, J. Winn, C. Rother, A. Criminisi, Textonboost: joint appearance, shape and context modeling for multi-class object recognition and segmentation, in Computer Vision (ECCV), European Conference on (2006), pp. 1–15
Z. Long, C. Yuanhao, L. Yuan, L. Chenxi, A. Yuille, Recursive segmentation and recognition templates for image parsing. Pattern Anal. Mach. Intell. IEEE Trans. 34, 359–371 (2012)
E. Akbas, N. Ahuja, Low-level hierarchical multiscale segmentation statistics of natural images. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1900–1906 (2014)
A. Lucchi, L. Yunpeng, P. Fua, Learning for structured prediction using approximate subgradient descent with working sets, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2013), pp. 1987–1994
C. Gatta, F. Ciompi, Stacked sequential scale-spacetaylor context. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1694–1700 (2014)
M. Najafi, S.T. Namin, M. Salzmann, L. Petersson, Sample and filter: nonparametric scene parsing via efficient filtering. arXiv preprint arXiv:1511.04960 (2015)
T.V. Nguyen, L. Canyi, J. Sepulveda, Y. Shuicheng, Adaptive nonparametric image parsing. Circ. Syst. Video Technol. IEEE Trans. 25, 1565–1575 (2015)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Verma, B., Zhang, L., Stockwell, D. (2017). Deep Learning Techniques for Roadside Video Data Analysis. In: Roadside Video Data Analysis. Studies in Computational Intelligence, vol 711. Springer, Singapore. https://doi.org/10.1007/978-981-10-4539-4_4
Download citation
DOI: https://doi.org/10.1007/978-981-10-4539-4_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-4538-7
Online ISBN: 978-981-10-4539-4
eBook Packages: EngineeringEngineering (R0)