Learning Object Detectors in Stationary Environments

Roth, Peter M.; Sternig, Sabine; Bischof, Horst

doi:10.1007/978-1-4471-5520-1_13

Peter M. Roth⁶,
Sabine Sternig⁶ &
Horst Bischof⁶

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

3023 Accesses

Abstract

The most successful approach for object detection is still applying a sliding window technique, where a pre-trained classifier is evaluated on different locations and scales. In this chapter, we interrogate this strategy in the context of stationary environments. In particular, having a fixed camera position observing the same scene a lot of prior (spatio-temporal) information is available. Exploiting this specific scene information allows for (a) improving the detection performance and (b) for reducing the model complexity; both on reduced computational costs! These benefits are demonstrated for two different real-world tasks (i.e., person and car detection). In particular, we apply two different evaluation/update strategies (holistic, grid-based), where any suited online learner can be applied. In our case we demonstrate the proposed approaches for different applications and scenarios, clearly showing their benefits compared to generic methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
We refer a classifier to as an oracle, if it has a high precision, even at a low recall, and can thus be used to generate new training samples.
2.
http://www.cvg.rdg.ac.uk/PETS2006/ (November 30, 2012).
3.
http://www.eecs.qmul.ac.uk/~andrea/avss2007_d.html (November 30, 2012).
4.
This particular task was chosen as implementations of existing approaches as well as a number of benchmark datasets are publicly available.
5.
http://homepages.inf.ed.ac.uk/rbf/CAVIARDATA1 (November 30, 2012).

References

Abney S (2002) Bootstrapping. In: Proc annual meeting of the association for computational linguistics, pp 360–367
Google Scholar
Agarwal S, Awan A, Roth D (2004) Learning to detect objects in images via a sparse, part-based representation. IEEE Trans Pattern Anal Mach Intell 26(11):1475–1490
Article Google Scholar
Andrews S, Tsochantaridis I, Hofmann T (2003) Support vector machines for multiple-instance learning. In: Advances in neural information processing systems, pp 561–568
Google Scholar
Babenko B, Yang M-H, Belongie S (2009) Visual tracking with online mulitple instance learning. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Balcan M-F, Blum A, Yang K (2004) Co-training and expansion: towards bridging theory and practice. In: Advances in neural information processing systems, pp 89–96
Google Scholar
Blum A, Mitchell T (1998) Combining labeled and unlabeled data with co-training
Google Scholar
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Dietterich TG, Lathrop RH, Lozano-Pérez T (1997) Solving the multiple instance problem with axis-parallel rectangles. Artif Intell 89(1–2):31–71
Article MATH Google Scholar
Dollár P, Wojek C, Schiele B, Perona P (2011) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743–761
Article Google Scholar
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Freund Y, Schapire RE (1995) A decision-theoretic generalization of on-line learning and an application to boosting. In: Proc European conf on computational learning theory, pp 23–37
Chapter Google Scholar
Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55:119–139
Article MathSciNet MATH Google Scholar
Friedman J (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29(5):1189–1232
Article MATH Google Scholar
Friedman J, Hastie T, Tibshirani R (2000) Additive logistic regression: a statistical view of boosting. Ann Stat 28(2):337–374
Article MathSciNet MATH Google Scholar
Goldberg AB, Li M, Zhu X (2008) Online manifold regularization: a new learning setting and empirical study. In: Proc European conf on machine learning and knowledge discovery in databases, vol I, pp 393–407
Chapter Google Scholar
Grabner H, Bischof H (2006) On-line boosting and vision. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Grabner H, Roth PM, Bischof H (2007) Is pedestrian detection really a hard task? In: Proc IEEE workshop on performance evaluation of tracking and surveillance
Google Scholar
Hoiem D, Efros AA, Hebert M (2006) Putting objects in perspective. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Javed O, Ali S, Shah M (2005) Online detection and classification of moving objects using progressively improving detectors. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Leibe B, Leonardis A, Schiele B (2008) Robust object detection with interleaved categorization and segmentation. Int J Comput Vis 77(1–3):259–289
Article Google Scholar
Leistner C, Amir R, Saffari AA, Roth PM, Bischof H (2009) On robustness of on-line boosting—a competitive study. In: Proc IEEE on-line learning for computer vision workshop
Google Scholar
Levin A, Viola P, Freund Y (2003) Unsupervised improvement of visual detectors using co-training. In: Proc IEEE int’l conf on computer vision
Google Scholar
Li M, Sethi IK (2006) Confidence-based active learning. IEEE Trans Pattern Anal Mach Intell 28(8):1251–1261
Article Google Scholar
Li L-J, Wang G, Fei-Fei L (2007) Optimol: automatic online picture collection via incremental model learning. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Liu R, Cheng J, Lu H (2009) A robust boosting tracker with minimum error bound in a co-training framework. In: Proc IEEE int’l conf on computer vision
Google Scholar
Mason L, Baxter J, Bartlett P, Frean M (1999) Functional gradient techniques for combining hypotheses. In: Advances in large margin classifiers. MIT Press, Cambridge, pp 221–247
Google Scholar
McFarlane NJB, Schofield CP (1995) Segmentation and tracking of piglets. Mach Vis Appl 8(3):187–193
Article Google Scholar
Nair V, Clark JJ (2004) An unsupervised, online learning framework for moving object detection. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Park J-H, Choi Y-K (1996) On-line learning for active pattern recognition. IEEE Signal Process Lett 3(11):301–303
Article Google Scholar
Rosenberg C, Hebert M, Schneiderman H (2005) Semi-supervised self-training of object detection models. In: IEEE workshop on applications of computer vision
Google Scholar
Roth PM, Bischof H (2008) Conservative learning for object detectors. Machine learning techniques for multimedia. Springer, Berlin
Google Scholar
Roth PM, Sternig S, Grabner H, Bischof H (2009) Classifier grids for robust adaptive object detection. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Schapire RE (1990) The strength of weak learnability. Mach Learn 5(2):197–227
Google Scholar
Schapire RE, Singer Y (2000) Boostexter: a boosting-based system for text categorization. Mach Learn 39(2/3):135–168
Article MATH Google Scholar
Skočaj D, Leonardis A (2008) Incremental and robust learning of subspace representations. Image Vis Comput 26(1):27–38
Article Google Scholar
Sternig S, Godec M, Roth PM, Bischof H (2010) TransientBoost: on-line boosting with transient data. In: Proc IEEE online learning for computer vision workshop (in conj CVPR)
Google Scholar
Sternig S, Roth PM, Bischof H (2012) On-line inverse multiple instance boosting for classifier grids. Pattern Recognit Lett 33(1):890–897
Article Google Scholar
Tieu K, Viola P (2000) Boosting image retrieval. In: Proc IEEE conf on computer vision and pattern recognition, vol I, pp 228–235
Google Scholar
Turtinen M, Pietikänien M (2005) Labeling of textured data with co-training and active learning. In: Proc workshop on texture analysis and synthesis, pp 137–142
Google Scholar
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Book MATH Google Scholar
Viola P, Jones MJ (2001) Rapid object detection using a boosted cascade of simple features. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Viola P, Jones MJ, Snow D (2003) Detecting pedestrians using patterns of motion and appearance. In: Proc IEEE int’l conf on computer vision
Google Scholar
Viola P, Platt JC, Zhang C (2005) Multiple instance boosting for object detection. In: Advances in neural information processing systems
Google Scholar
Wei W, Zhou Z-H (2007) Analyzing co-training style algorithms. In: Proc European conf on machine learning, pp 454–465
Google Scholar
Wu B, Nevatia R (2005) Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: Proc IEEE int’l conf on computer vision
Google Scholar
Wu B, Nevatia R (2007) Improving part based object detection by unsupervised, online boosting. In: Proc IEEE conf on computer vision and pattern recognition
Google Scholar
Yan R, Yang J, Hauptmann A (2003) Automatically labeling video data using multi-class active learning. In: Proc IEEE int’l conf on computer vision, vol I, pp 516–523
Chapter Google Scholar
Zhu Q, Avidan S, Cheng K-T (2005) Learning a sparse, corner-based representation for background modelling. In: Proc IEEE int’l conf on computer vision, vol I, pp 678–685
Google Scholar

Download references

Acknowledgements

The work was supported by the Austrian Science Foundation (FWF) project Advanced Learning for Tracking and Detection in Medical Workflow Analysis (I535-N23) and by the Austrian Research Promotion Agency (FFG) projects SHARE (831717) in the IV2Splus program and MobiTrick (8258408) in the FIT-IT program.

Author information

Authors and Affiliations

Institute for Computer Graphics and Vision, Graz University of Technology, Graz, Austria
Peter M. Roth, Sabine Sternig & Horst Bischof

Authors

Peter M. Roth
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Sternig
View author publications
You can also search for this author in PubMed Google Scholar
Horst Bischof
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter M. Roth .

Editor information

Editors and Affiliations

Dipartimento di Matematica e Informatica, Università di Catania, Catania, Italy
Giovanni Maria Farinella
Dipartimento di Matematica e Informatica, Università di Catania, Catania, Italy
Sebastiano Battiato
Department of Engineering, University of Cambridge, Cambridge, United Kingdom
Roberto Cipolla

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Roth, P.M., Sternig, S., Bischof, H. (2013). Learning Object Detectors in Stationary Environments. In: Farinella, G., Battiato, S., Cipolla, R. (eds) Advanced Topics in Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5520-1_13

Download citation

DOI: https://doi.org/10.1007/978-1-4471-5520-1_13
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5519-5
Online ISBN: 978-1-4471-5520-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics