Local Feature Based Multiple Object Instance Identification Using Scale and Rotation Invariant Implicit Shape Model

Bao, Ruihan; Higa, Kyota; Iwamoto, Kota

doi:10.1007/978-3-319-16628-5_43

Ruihan Bao¹⁵,
Kyota Higa¹⁵ &
Kota Iwamoto¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9008))

Included in the following conference series:

Asian Conference on Computer Vision

1869 Accesses
3 Citations

Abstract

In this paper, we propose a Scale and Rotation Invariant Implicit Shape Model (SRIISM), and develop a local feature matching based system using the model to accurately locate and identify large numbers of object instances in an image. Due to repeated instances and cluttered background, conventional methods for multiple object instance identification suffer from poor identification results. In the proposed SRIISM, we model the joint distribution of object centers, scale, and orientation computed from local feature matches in Hough voting, which is not only invariant to scale changes and rotation of objects, but also robust to false feature matches. In the multiple object instance identification system using SRIISM, we apply a fast 4D bin search method in Hough space with complexity \(O(n)\), where \(n\) is the number of feature matches, in order to segment and locate each instance. Furthermore, we apply maximum likelihood estimation (MLE) for accurate object pose detection. In the evaluation, we created datasets simulating various industrial applications such as pick-and-place and inventory management. Experiment results on the datasets show that our method outperforms conventional methods in both accuracy (5 %–30 % gain) and speed (2x speed up).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zickler, S., Veloso, M.: Detection and localization of multiple objects. In: 2006 6th IEEE-RAS International Conference on Humanoid Robots, pp. 20–25 (2006)
Google Scholar
Collet, A., Martinez, M., Srinivasa, S.S.: The moped framework: Object recognition and pose estimation for manipulation. Int. J. Robot. Res. 30, 1–23 (2001). 0278364911401765
Google Scholar
Piccinini, P., Prati, A., Cucchiara, R.: Real-time object detection and localization with sift-based clustering. Image Vis. Comput. 30, 573–587 (2012)
Article Google Scholar
Lin, F.E., Kuo, Y.H., Hsu, W.H.: Multiple object localization by context-aware adaptive window search and search-based object recognition. In: Proceedings of the 19th ACM International Conference on Multimedia, MM 2011, pp. 1021–1024. ACM, New York (2011)
Google Scholar
Higa, K., Iwamoto, K., Nomura, T.: Multiple object identification using grid voting of object center estimated from keypoint matches. In: 2013 20th IEEE International Conference on Image Processing (ICIP), pp. 2973–2977 (2013)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. Int. J. Comput. Vis. 77, 259–289 (2008)
Article Google Scholar
Liu, M.Y., Tuzel, O., Veeraraghavan, A., Chellappa, R.: Fast directional chamfer matching. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1696–1703 (2010)
Google Scholar
Barinova, O., Lempitsky, V., Kholi, P.: On detection of multiple object instances using hough transforms. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1773–1784 (2012)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110, 346–359 (2008)
Article Google Scholar
Wu, C.C., Kuo, Y.H., Hsu, W.: Large-scale simultaneous multi-object recognition and localization via bottom up search-based approach. In: Proceedings of the 20th ACM International Conference on Multimedia, MM 2012, pp. 969–972. ACM, New York (2012)
Google Scholar
Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1038–1045. IEEE (2009)
Google Scholar
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, pp. 1470–1477. IEEE (2003)
Google Scholar
Arandjelovic, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2911–2918. IEEE (2012)
Google Scholar
Perona, P.: David lowe’s recognition system (2004)
Google Scholar
Korman, S., Reichman, D., Tsur, G., Avidan, S.: Fast-match: Fast affine template matching. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1940–1947. IEEE (2013)
Google Scholar
Sutherland, I.E., Hodgman, G.W.: Reentrant polygon clipping. Commun. ACM 17, 32–42 (1974)
Article MATH Google Scholar
Iwamoto, K., Mase, R., Nomura, T.: Bright: A scalable and compact binary descriptor for low-latency and high accuracy object identification. In: 2013 20th IEEE International Conference on Image Processing (ICIP), pp. 2915–2919 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Information and Media Processing Laboratories, NEC Corporation, Kawasaki, Japan
Ruihan Bao, Kyota Higa & Kota Iwamoto

Authors

Ruihan Bao
View author publications
You can also search for this author in PubMed Google Scholar
Kyota Higa
View author publications
You can also search for this author in PubMed Google Scholar
Kota Iwamoto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruihan Bao .

Editor information

Editors and Affiliations

Center for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
C.V. Jawahar
Institue of Computing Technology, Chinese Academy of Sciences, Beijing, China
Shiguang Shan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bao, R., Higa, K., Iwamoto, K. (2015). Local Feature Based Multiple Object Instance Identification Using Scale and Rotation Invariant Implicit Shape Model. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9008. Springer, Cham. https://doi.org/10.1007/978-3-319-16628-5_43

Download citation

DOI: https://doi.org/10.1007/978-3-319-16628-5_43
Published: 12 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16627-8
Online ISBN: 978-3-319-16628-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics