Skip to main content

Local Feature Based Multiple Object Instance Identification Using Scale and Rotation Invariant Implicit Shape Model

  • Conference paper
  • First Online:
Book cover Computer Vision - ACCV 2014 Workshops (ACCV 2014)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9008))

Included in the following conference series:

Abstract

In this paper, we propose a Scale and Rotation Invariant Implicit Shape Model (SRIISM), and develop a local feature matching based system using the model to accurately locate and identify large numbers of object instances in an image. Due to repeated instances and cluttered background, conventional methods for multiple object instance identification suffer from poor identification results. In the proposed SRIISM, we model the joint distribution of object centers, scale, and orientation computed from local feature matches in Hough voting, which is not only invariant to scale changes and rotation of objects, but also robust to false feature matches. In the multiple object instance identification system using SRIISM, we apply a fast 4D bin search method in Hough space with complexity \(O(n)\), where \(n\) is the number of feature matches, in order to segment and locate each instance. Furthermore, we apply maximum likelihood estimation (MLE) for accurate object pose detection. In the evaluation, we created datasets simulating various industrial applications such as pick-and-place and inventory management. Experiment results on the datasets show that our method outperforms conventional methods in both accuracy (5 %–30 % gain) and speed (2x speed up).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zickler, S., Veloso, M.: Detection and localization of multiple objects. In: 2006 6th IEEE-RAS International Conference on Humanoid Robots, pp. 20–25 (2006)

    Google Scholar 

  2. Collet, A., Martinez, M., Srinivasa, S.S.: The moped framework: Object recognition and pose estimation for manipulation. Int. J. Robot. Res. 30, 1–23 (2001). 0278364911401765

    Google Scholar 

  3. Piccinini, P., Prati, A., Cucchiara, R.: Real-time object detection and localization with sift-based clustering. Image Vis. Comput. 30, 573–587 (2012)

    Article  Google Scholar 

  4. Lin, F.E., Kuo, Y.H., Hsu, W.H.: Multiple object localization by context-aware adaptive window search and search-based object recognition. In: Proceedings of the 19th ACM International Conference on Multimedia, MM 2011, pp. 1021–1024. ACM, New York (2011)

    Google Scholar 

  5. Higa, K., Iwamoto, K., Nomura, T.: Multiple object identification using grid voting of object center estimated from keypoint matches. In: 2013 20th IEEE International Conference on Image Processing (ICIP), pp. 2973–2977 (2013)

    Google Scholar 

  6. Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. Int. J. Comput. Vis. 77, 259–289 (2008)

    Article  Google Scholar 

  7. Liu, M.Y., Tuzel, O., Veeraraghavan, A., Chellappa, R.: Fast directional chamfer matching. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1696–1703 (2010)

    Google Scholar 

  8. Barinova, O., Lempitsky, V., Kholi, P.: On detection of multiple object instances using hough transforms. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1773–1784 (2012)

    Article  Google Scholar 

  9. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)

    Article  Google Scholar 

  10. Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110, 346–359 (2008)

    Article  Google Scholar 

  11. Wu, C.C., Kuo, Y.H., Hsu, W.: Large-scale simultaneous multi-object recognition and localization via bottom up search-based approach. In: Proceedings of the 20th ACM International Conference on Multimedia, MM 2012, pp. 969–972. ACM, New York (2012)

    Google Scholar 

  12. Maji, S., Malik, J.: Object detection using a max-margin hough transform. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1038–1045. IEEE (2009)

    Google Scholar 

  13. Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, pp. 1470–1477. IEEE (2003)

    Google Scholar 

  14. Arandjelovic, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2911–2918. IEEE (2012)

    Google Scholar 

  15. Perona, P.: David lowe’s recognition system (2004)

    Google Scholar 

  16. Korman, S., Reichman, D., Tsur, G., Avidan, S.: Fast-match: Fast affine template matching. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1940–1947. IEEE (2013)

    Google Scholar 

  17. Sutherland, I.E., Hodgman, G.W.: Reentrant polygon clipping. Commun. ACM 17, 32–42 (1974)

    Article  MATH  Google Scholar 

  18. Iwamoto, K., Mase, R., Nomura, T.: Bright: A scalable and compact binary descriptor for low-latency and high accuracy object identification. In: 2013 20th IEEE International Conference on Image Processing (ICIP), pp. 2915–2919 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruihan Bao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Bao, R., Higa, K., Iwamoto, K. (2015). Local Feature Based Multiple Object Instance Identification Using Scale and Rotation Invariant Implicit Shape Model. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9008. Springer, Cham. https://doi.org/10.1007/978-3-319-16628-5_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16628-5_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16627-8

  • Online ISBN: 978-3-319-16628-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics