Abstract
A large portion of news articles contains images of persons whose names appear in the news stories. To provide image search of persons, most search engines construct an index from textual descriptions (such as headline and caption) of images. The index search approach, although very simple and scalable, has one serious drawback. A query of a person name could match some news articles which do not contain images of the target person. Therefore, some irrelevant images could be returned as search results. Our main goal is to improve the performance of the index search approach based on the syntactic analysis of person name entities in the news articles. Given sentences containing person names, we construct a set of syntactic rules for identifying persons in news images. The set of syntactic rules is used to filter out images of non-target persons from the results returned by the index search. From the experimental results, our approach improved the performance over the basic index search by 10% based on the F1-measure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abney, S.: Parsing by chunks. In: Berwick, R., Abney, S., Tenny, C. (eds.) Principle-Based Parsing. Kluwer Academic Publishers, Dordrecht (1991)
Berg, T.L., Berg, A.C., Edwards, J., Maire, M., White, R., Yee-Whye, T., Learned-Miller, E., Forsyth, D.A.: Names and Faces in the News. In: Proc. of the 2004 IEEE Conf. on Computer Vision and Pattern Recognition, pp. 848–854 (2004)
Chinchor, N.: MUC-7 Named Entity Task Definition (Version 3.5). MUC-7, Fairfax, Virginia (1998)
Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys (CSUR) 40(2), 1–60 (2008)
Edwards, J., White, R., Forsyth, D.: Words and pictures in the news. In: Proc. of the HLT-NAACL 2003 workshop on learning word meaning from non-linguistic data, pp. 6–13 (2003)
He, X., Cai, D., Wen, J.-R., Ma, W.-Y., Zhang, H.-J.: Clustering and searching WWW images using link and page layout analysis. ACM Trans. on Multimedia Computing, Communications, and Applications 3(2) (2007)
Hörster, E., Lienhart, R., Slaney, M.: Image retrieval on large-scale image databases. In: Proc. of the 6th ACM int. conf. on image and video retrieval, pp. 17–24 (2007)
Kherfi, M.L., Ziou, D., Bernardi, A.: Image Retrieval from the World Wide Web: Issues, Techniques, and Systems. ACM Computing Surveys (CSUR) 36(1), 35–67 (2004)
Kitahara, A., Joutou, T., Yanai, K.: Associating Faces and Names in Japanese Photo News Articles on the Web. In: Proc. of the 22nd Int. Conf. on Advanced Information Networking and Applications - Workshops, pp. 1156–1161 (2008)
Liu, C., Jiang, S., Huang, Q.: Naming faces in broadcast news video by image google. In: Proc. of the 16th ACM int. conf. on multimedia, pp. 717–720 (2008)
Ozkan, D., Duygulu, P.: A Graph Based Approach for Naming Faces in News Photos. In: Proc. of the 2006 IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1477–1482 (2006)
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-Based Image Retrieval at the End of the Early Years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)
Srihari, R.K.: Automatic Indexing and Content-Based Retrieval of Captioned Images. Computer 28(9), 49–56 (1995)
Yagnik, J., Islam, A.: Learning people annotation from the web via consistency learning. In: Proc. of the int. workshop on multimedia information retrieval, pp. 285–290 (2007)
Yang, J., Hauptmann, A.G.: Naming every individual in news video monologues. In: Proc. of the 12th ACM int. conf. on multimedia, pp. 580–587 (2004)
Zhao, W., Chellappa, R., Phillips, P.J., Rosenfeld, A.: Face recognition: A literature survey. ACM Computing Surveys (CSUR) 35(4), 399–458 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Haruechaiyasak, C., Damrongrat, C. (2010). Identifying Persons in News Article Images Based on Textual Analysis. In: Chowdhury, G., Koo, C., Hunter, J. (eds) The Role of Digital Libraries in a Time of Global Change. ICADL 2010. Lecture Notes in Computer Science, vol 6102. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13654-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-13654-2_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13653-5
Online ISBN: 978-3-642-13654-2
eBook Packages: Computer ScienceComputer Science (R0)