A Relational Kernel-Based Framework for Hierarchical Image Understanding

Antanas, Laura; Frasconi, Paolo; Costa, Fabrizio; Tuytelaars, Tinne; De Raedt, Luc

doi:10.1007/978-3-642-34166-3_19

A Relational Kernel-Based Framework for Hierarchical Image Understanding

Laura Antanas²⁴,
Paolo Frasconi²⁴,
Fabrizio Costa²⁴,
Tinne Tuytelaars²⁴ &
…
Luc De Raedt²⁴

Conference paper

2519 Accesses
7 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7626))

Abstract

While relational representations have been popular in early work on syntactic and structural pattern recognition, they are rarely used in contemporary approaches to computer vision due to their pure symbolic nature. The recent progress and successes in combining statistical learning principles with relational representations motivates us to reinvestigate the use of such representations. More specifically, we show that statistical relational learning can be successfully used for hierarchical image understanding. We employ kLog, a new logical and relational language for learning with kernels to detect objects at different levels in the hierarchy. The key advantage of kLog is that both appearance features and rich, contextual dependencies between parts in a scene can be integrated in a principled and interpretable way to obtain a qualitative representation of the problem. At each layer, qualitative spatial structures of parts in images are detected, classified and then employed one layer up the hierarchy to obtain higher-level semantic structures. We apply a four-layer hierarchy to street view images and successfully detect corners, windows, doors, and individual houses.

Download to read the full chapter text

Chapter PDF

References

Tuytelaars, T., Mikolajczyk, K.: Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision 3(3), 177–280 (2007)
Article Google Scholar
Hanson, A., Riseman, E.: Visions: A computer system for interpreting scenes. In: CVS, pp. 303–333 (1978)
Google Scholar
De Raedt, L.: Logical and Relational Learning. Springer (2008)
Google Scholar
Fu, K.: Syntactic methods in pattern recognition, vol. 112. Elsevier Science (1974)
Google Scholar
Antanas, L., van Otterlo, M., Tuytelaars, T., Raedt, L.D., Oramas Mogrovejo, J.: A relational distance-based framework for hierarchical image understanding. In: ICPRAM, vol. (2), pp. 206–218 (2012)
Google Scholar
Pearce, A.R., Caelli, T., Bischof, W.F.: Learning relational structures: Applications in computer vision. Applied Intelligence 4, 257–268 (1994)
Article Google Scholar
Getoor, L., Friedman, N., Koller, D., Taskar, B.: Learning probabilistic models of relational structure. In: ICML, pp. 170–177 (2001)
Google Scholar
Frasconi, P., Costa, F., Raedt, L.D., Grave, K.D.: klog: A language for logical and relational learning with kernels. CoRR (2012)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE TPAMI 32(9), 1627–1645 (2010)
Article Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Weakly supervised scale-invariant learning of models for visual recognition. IJCV 71(3), 273–303 (2007)
Article Google Scholar
Han, F., Zhu, S.: Bottom-up/top-down image parsing with attribute grammar. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(1), 59–73 (2009)
Article MathSciNet Google Scholar
Zhu, L., Chen, Y., Lin, Y., Lin, C., Yuille, A.: Recursive segmentation and recognition templates for image parsing. IEEE TPAMI 34(2), 359–371 (2012)
Article Google Scholar
Girshick, R., Felzenszwalb, P., McAllester, D.: Object detection with grammar models. IEEE TPAMI 33(12) (2011)
Google Scholar
Zhu, S.C., Mumford, D.: A stochastic grammar of images. Found. Trends. Comput. Graph. Vis. 2(4), 259–362 (2006)
Article MATH Google Scholar
Hartz, J.: Learning probabilistic structure graphs for classification and detection of object structures. In: ICMLA, pp. 5–11 (2009)
Google Scholar
Zhao, P., Fang, T., Xiao, J., Zhang, H., Zhao, Q., Quan, L.: Rectilinear parsing of architecture in urban environment. In: CVPR, pp. 342–349 (2010)
Google Scholar
Koutsourakis, P., Simon, L., Teboul, O., Tziritas, G., Paragios, N.: Single view reconstruction using shape grammars for urban environments. In: ICCV, pp. 1795–1802 (2009)
Google Scholar
Terzic, K., Hotz, L., Sochman, J.: Interpreting structures in man-made scenes - combining low-level and high-level structure sources. In: ICAART, pp. 357–364 (2010)
Google Scholar
Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The nbnn kernel. In: ICCV, pp. 1824–1831 (2011)
Google Scholar
Antanas, L., Frasconi, P., Tuytelaars, T., De Raedt, L.: Employing relational languages for image understanding. In: IEEE Workshop on Kernels and Distances for Computer Vision, pp. 1–2 (2011)
Google Scholar
Ferrari, V., Fevrier, L., Jurie, F., Schmid, C.: Groups of adjacent contour segments for object detection. TPAMI, 36–51 (2008)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, pp. 886–893 (2005)
Google Scholar
Garcia-Molina, H., Ullman, J.D., Widom, J.: Database Systems: The Complete Book, 2nd edn. Prentice Hall Press, Upper Saddle River (2008)
Google Scholar
Costa, F., Grave, K.D.: Fast neighborhood subgraph pairwise distance kernel. In: ICML, pp. 255–262 (2010)
Google Scholar
Haussler, D.: Convolution kernels on discrete structures. Technical Report UCSC-CRL-99-10, University of California at Santa Cruz (1999)
Google Scholar
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: A library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
MATH Google Scholar
Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing features: Efficient boosting procedures for multiclass object detection. In: CVPR, pp. 762–769 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Katholieke Universiteit Leuven, Belgium
Laura Antanas, Paolo Frasconi, Fabrizio Costa, Tinne Tuytelaars & Luc De Raedt

Authors

Laura Antanas
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Frasconi
View author publications
You can also search for this author in PubMed Google Scholar
Fabrizio Costa
View author publications
You can also search for this author in PubMed Google Scholar
Tinne Tuytelaars
View author publications
You can also search for this author in PubMed Google Scholar
Luc De Raedt
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Auckland, Private Bag 92019, 1142, Auckland, New Zealand
Georgy Gimel’farb
Department of Computer Science, University of York, Deramore Lane, YO10 5GH, York, UK
Edwin Hancock
Institute of Media and Information Technology, Chiba University, Yayoi-cho 1-33, 263-8522, Inage-ku, Chiba, Japan
Atsushi Imiya
Technische Universität/Fraunhofer IGD, Fraunhoferstraße 5, 64283, Darmstadt, Germany
Arjan Kuijper
Graduate School of Information Science and Technology, Hokkaido University, 060-0814, Sapporo, Japan
Mineichi Kudo
Graduate School of Engineering, Tohoku University, 6-6-05 Aoba, Aramaki, Aoba-ku, 980-8579, Sendai, Miyagi, Japan
Shinichiro Omachi
Centre for Vision, Speech and Signal Processing, University of Surrey, GU2 7XH, Guildford, Surrey, UK
Terry Windeatt
C&C Innovation Research Laboratories, NEC Corporation, 8916-47 Takayama-cho, Ikoma-Shi, Nara, Japan
Keiji Yamada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Antanas, L., Frasconi, P., Costa, F., Tuytelaars, T., De Raedt, L. (2012). A Relational Kernel-Based Framework for Hierarchical Image Understanding. In: Gimel’farb, G., et al. Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2012. Lecture Notes in Computer Science, vol 7626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34166-3_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-34166-3_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34165-6
Online ISBN: 978-3-642-34166-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)