Research on Middle-Semantic Manifold Object Annotation

Feng, Wengang; Wu, Shaozhong

doi:10.1007/978-3-642-37835-5_20

Research on Middle-Semantic Manifold Object Annotation

Wengang Feng^5,6 &
Shaozhong Wu⁵

Conference paper
First Online: 01 January 2013

2997 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 215))

Abstract

A novel bionic, middle-semantic object annotation framework is presented in this paper. Moreover, we build the model based on the perception as defined by the human visual system. At first, the super-pixel is used to represent the images, and conditional random field could label each of the super-pixels, which means annotating the different classes of objects. In next step, on the basis of the previous result, image pyramid is used to represent the image, and get the sub-region of some objects of the same class. After extracting descriptor to represent the patches, all the patches are projected to a manifold, which could annotate the different views of objects from the same class. Experiments show that the bionic, middle-semantic object annotation framework could obtain superior results with respect to accuracy, and it could verify the correctness of WordNet indirectly.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197
Article Google Scholar
Foster I, Kesselman C (1999) The grid: blueprint for a new computing infrastructure. Morgan Kaufmann, San Francisco
Google Scholar
Czajkowski K, Fitzgerald S, Foster I, Kesselman C (2001) Grid information services for distributed resource sharing. In: 10th IEEE international symposium on high performance distributed computing. IEEE Press, New York, pp 181–184
Google Scholar
Foster I, Kesselman C, Nick J, Tuecke S (2002) The physiology of the grid: an open grid services architecture for distributed systems integration. Technical report, Global Grid Forum
Google Scholar
Pan J-Y et al (2004) GCap: Graph-based automatic image captioning. In: Proceedings of the conference on computer vision and pattern recognition workshop, 9:146–154
Google Scholar
Felzenszwalb P, Huttenlocher D (2006) Pictorial structures for object recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2066–2073
Google Scholar
Li BT, Goh K, Chang E (2003) Confidence-based dynamic ensemble for image annotation and semantics discovery. In: Proceedings of ACM international conference on multimedia, pp 195–206
Google Scholar
Carneiro G, Vasconcelos N (2005) A database centric view of semantic image annotation and retrieval. In: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval, pp 559–566
Google Scholar
Duygulu P et al (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of the European conference on computer vision, pp 97–112
Google Scholar
Pan JY, Yang HJ (2004) Automatic multimedia cross-modal correlation discovery. In: KDD’04, pp 322–330
Google Scholar
Li J, Wang JZ (2006) Real-time computerized annotation of pictures. In: Proceedings of the ACM international conference on multimedia, pp 911–920
Google Scholar
Barnard K (2003) Matching Words and Pictures. J Mach Learn Res 3:1107–1135
MATH Google Scholar
Vailaya A, Figueiredo A, Jain A, Zhang H (2001) Image classification for content-based indexing. IEEE Trans Image Process 10:117–129
Article MATH Google Scholar

Download references

Acknowledgments

This work was financially supported by the Chinese People’s Public Security University Natural Science Foundation (2011LG08).

Author information

Authors and Affiliations

Department of Policing Intelligence, Chinese People’s Public Security University, Beijing, 100038, China
Wengang Feng & Shaozhong Wu
Public Security Intelligence Research Center, Chinese People’s Public Security University, Beijing, 100038, China
Wengang Feng

Authors

Wengang Feng
View author publications
You can also search for this author in PubMed Google Scholar
Shaozhong Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wengang Feng .

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, People's Republic of China
Fuchun Sun
College of Mechatronics and Automation, National University of Defense Technolog, Changsha, People's Republic of China
Dewen Hu
Department of Computer Science and Techn, Tsinghua University, Beijing, People's Republic of China
Huaping Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Feng, W., Wu, S. (2014). Research on Middle-Semantic Manifold Object Annotation. In: Sun, F., Hu, D., Liu, H. (eds) Foundations and Practical Applications of Cognitive Systems and Information Processing. Advances in Intelligent Systems and Computing, vol 215. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37835-5_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-37835-5_20
Published: 22 September 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37834-8
Online ISBN: 978-3-642-37835-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics