Abstract
Closed-circuit television cameras are used extensively to monitor streets for the security of the public. Whether passively recording day-to-day life, or actively monitoring a developing situation such as public disorder, the videos recorded have proven invaluable to police forces world wide to trace suspects and victims alike. The volume of video produced from the array of camera covering even a small area is large, and growing in modern society, and post-event analysis of collected video is a time consuming problem for police forces that is increasing. Automated computer vision analysis is desirable, but current systems are unable to reliably process videos from CCTV cameras. The video quality is low, and computer vision algorithms are unable to perform sufficiently to achieve usable results. In this chapter, we describe some of the reasons for the failure of contemporary algorithms and focus on the fundamental task of feature correspondence between frames of video—a well-studied and often considered solved problem in high quality videos, but still a challenge in low quality imagery. We present solutions to some of the problems that we acknowledge, and provide a comprehensive analysis where we demonstrate feature matching using a 138-dimensional descriptor that improves the matching performance of a state-of-the-art 384-dimension colour descriptor with just \(36\,\%\) of the storage requirements.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anjulan, A., Canagarajah, N.: A unified framework for object retrieval and mining. IEEE Trans. Circuits Syst. Video Technol. 19(1), 63–76 (2009)
Arandjelović, R.: Advancing Large Scale Object Retrieval. PhD thesis, University of Oxford (2013)
Arandjelović, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2911–2918. IEEE (2012)
Asghar, M.M.N., Hussain, F., Manton, R.: Video indexing: a survey. Int. J. Comput. Inf. Technol. 3(1), 148–169 (2014)
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Bhattacharyya, A.: On a measure of divergence between two statistical populations defined by their probability distributions. Bull. Calcutta Math. Soc. 35, 99–109 (1943)
Bordwell, D.: The Way Hollywood Tells It: Story and Style in Modern Movies. University of California Press, ISBN 978-0520246225 (2006)
Buckland, M., Gey, F.: The relationship between recall and precision. J. Am. Soc. Inf. Sci. 45(1), 12–19 (1994)
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Carpineto, C., Romano, G.: A survey of automatic query expansion in information retrieval. ACM Comput. Surv. 44(1), 1–50 (2012)
Dash, R., Majhi, B.: Motion blur parameters estimation for image restoration. Opt. Int. J. Light Electron Opt. 125(5), pp. 1634–1640 (2014)
Edelman, G., Bijhold, J.: Tracking people and cars using 3D modeling and CCTV. Forensic Sci. Int. 202(1–3), 26–35 (2010)
Forssén, P.E.: Maximally stable colour regions for recognition and matching. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Forssén, P.E., Lowe, D.G.: Shape descriptors for maximally stable extremal regions. In: Proceedings of IEEE International Conference on Computer Vision, pp. 1–8 (2007)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings Alvey Vision Conference, pp. 147–151. Alvey Vision Club (1988)
Henderson, C., Izquierdo, E.: Robust Feature matching in the wild. In: Science and Information Conference, pp. 628–637 IEEE, London (2015)
Henderson, C., Blasi, S.G., Sobhani, F., Izquierdo, E.: On the impurity of street-scene video footage. In: International Conference on Imaging for Crime Detection and Prevention. IEEE, London (2015)
Hripcsak, G.: Agreement, the F-measure, and reliability in information retrieval. J. Am. Med. Inform. Assoc. 12(3), 296–298 (2005)
Jiang, Y.G., Ngo, C.W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. Proceedings of the 6th ACM International Conference on Image and Video Retrieval—CIVR ’07, pp. 494–501 (2007)
Liu, H.L.H., Setiono, R.: Chi2: feature selection and discretization of numeric attributes. In: Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence, pp. 388–391 (1995)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27(10), 1615–1630 (2005)
Noble, J.A.: Finding corners. Image Vis. Comput. 6(2), 121–128 (1988)
Park, U., Jain, A., Kitahara, I., Kogure, K., Hagita, N.: ViSE: visual search engine using multiple networked cameras. In: 18th International Conference on Pattern Recognition, vol. 3, pp. 1204–1207. IEEE (2006)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Powers, D.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)
Rubner, Y., Tomasi, C., Guibas, L.J.: Earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40(2), 99–121 (2000)
Schuler, C., Hirsch, M.: Learning to Deblur. In: NIPS 2014 Deep Learn. Represent. Learn. Workshop, Montreal (2014)
Shekhar, R., Jawahar, C.: Word image retrieval using bag of visual words. In: 2012 10th IAPR International Workshop on Document Analysis. Systems, pp. 297–301. IEEE (2012)
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings Ninth IEEE International Conference on Computer Vision, vol. 2, pp. 1470–1477 (2003)
Sivic, J., Zisserman, A.: Video data mining using configurations of viewpoint invariant regions. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, vol. 1, pp. 488–495. IEEE (2004)
Sivic, J., Zisserman, A.: Efficient visual search of videos cast as text retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 591–606 (2009)
Sivic, J., Schaffalitzky, F., Zisserman, A.: Efficient object retrieval from videos. In: Proceedings of 12th European Signal Processing Conference EUSIPCO 04, pp. 36–39, Vienna, Austria (2004)
Sivic, J., Schaffalitzky, F., Zisserman, A.: Object level grouping for video shots. Int. J. Comput. Vis. 67, 189–210 (2006)
Stokman, H., Gevers, T.: Selection and fusion of color models for feature Detection.pdf. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 560–565. IEEE (2005)
Swain, M.J., Ballard, D.H.: Color indexing. Int. J. Comput. Vis. 7(1), 11–32 (1991)
Van De Sande, K.E., Gevers, T., Snoek, C.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1582–1596 (2010)
van de Weijer, J., Schmid, C.: Coloring local feature extraction. In: 9th European Conference on Computer Vision, vol. 3952, pp. 334–348 (2006)
Yuan, J., Wang, H., Xiao, L., Zheng, W., Li, J., Lin, F., Zhang, B.: A formal study of shot boundary detection. IEEE Trans. Circuits Syst. Video Technol. 17(2), 168–186 (2007)
Acknowledgments
This work is funded by the European Union’s Seventh Framework Programme, specific topic framework and tools for (semi-) automated exploitation of massive amounts of digital data for forensic purposes, under grant agreement number 607480 (LASIE IP project). The authors also extend their thanks to the Metropolitan Police at Scotland Yard, London, UK, for the supply of and permission to use CCTV images.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Henderson, C., Izquierdo, E. (2016). Feature Correspondence in Low Quality CCTV Videos. In: Chen, L., Kapoor, S., Bhatia, R. (eds) Emerging Trends and Advanced Technologies for Computational Intelligence. Studies in Computational Intelligence, vol 647. Springer, Cham. https://doi.org/10.1007/978-3-319-33353-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-33353-3_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-33351-9
Online ISBN: 978-3-319-33353-3
eBook Packages: EngineeringEngineering (R0)