Skip to main content

Infographics Retrieval: A New Methodology

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8455))

Abstract

Information graphics, such as bar charts and line graphs, are a rich knowledge source that should be accessible to users. However, techniques that have been effective for document or image retrieval are inadequate for the retrieval of such graphics. We present and evaluate a new methodology that hypothesizes information needs from user queries and retrieves infographics based on how well the inherent structure and intended message of the graphics satisfy the query information needs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arguello, J., Elsas, J.L., Callan, J., Carbonell, J.G.: Document representation and query expansion models for blog recommendation. In: Proc. ICWSM (2008)

    Google Scholar 

  2. Bendersky, M., Croft, W.B.: Discovering key concepts in verbose queries. In: Proc. of ACM SIGIR Conf. on Res. and Dev. in Information Retrieval, pp. 491–498 (2008)

    Google Scholar 

  3. Bendersky, M., Croft, W.B.: Analysis of long queries in a large scale search log. In: Proc. of the Workshop on Web Search Click Data, pp. 8–14 (2009)

    Google Scholar 

  4. Boston, C., Fang, H., Carberry, S., Wu, H., Liu, X.: Wikimantic: Toward effective disambiguation and expansion of queries. In: Data & Knowledge Engineering (2014)

    Google Scholar 

  5. Carberry, S., Elzer, S., Demir, S.: Information graphics: an untapped resource for digital libraries. In: Proc. of ACM SIGIR Conf. on Res. and Dev. in Information Retrieval, pp. 581–588 (2006)

    Google Scholar 

  6. Chester, D., Elzer, S.: Getting computers to see information graphics so users do not have to. In: Hacid, M.-S., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds.) ISMIS 2005. LNCS (LNAI), vol. 3488, pp. 660–668. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  7. Clark, S., Curran, J.: Wide-coverage efficient statistical parsing with ccg and log-linear models. Computational Linguistics 33(4), 493–552 (2007)

    Article  MATH  Google Scholar 

  8. Damljanovic, D., Agatonovic, M., Cunningham, H.: Freya: An interactive way of querying linked data using natural language. In: García-Castro, R., Fensel, D., Antoniou, G. (eds.) ESWC 2011. LNCS, vol. 7117, pp. 125–138. Springer, Heidelberg (2012)

    Google Scholar 

  9. Demir, S., Carberry, S., Elzer, S.: Effectively realizing the inferred message of an information graphic. In: Proc. of the Int. Conf. on Recent Advances in Natural Language Processing, pp. 150–156 (2007)

    Google Scholar 

  10. Elzer, S., Carberry, S., Zukerman, I.: The automated understanding of simple bar charts. Artificial Intelligence 175(2), 526–555 (2011)

    Article  MathSciNet  Google Scholar 

  11. Escalante, H.J., Hernández, C., López, A., Marín, H., Montes, M., Morales, E., Sucar, L.E., Villaseñor, L.: Towards annotation-based query and document expansion for image retrieval. In: Peters, C., Jijkoun, V., Mandl, T., Müller, H., Oard, D.W., Peñas, A., Petras, V., Santos, D. (eds.) CLEF 2007. LNCS, vol. 5152, pp. 546–553. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  12. Fang, H., Tao, T., Zhai, C.: A formal study of information retrieval heuristics. In: Proc. of ACM Conf. on Res. and Dev. in Information Retrieval, pp. 49–56 (2004)

    Google Scholar 

  13. Futrelle, R.P., Nikolakis, N.: Efficient analysis of complex diagrams using constraint-based parsing. In: Proc. of Int. Conf. on Document Analysis and Recognition, vol. 2, pp. 782–790 (1995)

    Google Scholar 

  14. Gao, Y., Wang, M., Luan, H., Shen, J., Yan, S., Tao, D.: Tag-based social image search with visual-text joint hypergraph learning. In: Proc. of Int. Conf. on Multimedia, pp. 1517–1520 (2011)

    Google Scholar 

  15. Hiemstra, D.: Statistical language models for intelligent xml retrieval. In: Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.) Intelligent Search on XML Data. LNCS, vol. 2818, pp. 107–118. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  16. Huang, W., Tan, C.L.: A system for understanding imaged infographics and its applications. In: Proc. of ACM Symposium on Document Engineering, pp. 9–18 (2007)

    Google Scholar 

  17. Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of ir techniques. ACM Trans. Inf. Syst. 20(4), 422–446 (2002)

    Article  Google Scholar 

  18. Larkin, J.H., Simon, H.A.: Why a diagram is (sometimes) worth ten thousand words. Cognitive Science 11(1), 65–100 (1987)

    Article  Google Scholar 

  19. Li, Z., Stagitis, M., Carberry, S., McCoy, K.F.: Towards retrieving relevant information graphics. In: Proc. of ACM SIGIR Conf. on Res. and Dev. in Information Retrieval, pp. 789–792 (2013)

    Google Scholar 

  20. Li, Z., Stagitis, M., McCoy, K., Carberry, S.: Towards finding relevant information graphics: Identifying the independent and dependent axis from user-written queries (2013), http://www.aaai.org/ocs/index.php/FLAIRS/FLAIRS13/paper/view/5939

  21. Liu, J., Pasupat, P., Wang, Y., Cyphers, S., Glass, J.: Query understanding enhanced by hierarchical parsing structures. In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 72–77 (2013)

    Google Scholar 

  22. Metzler, D., Cai, C.: Usc/isi at trec 2011: Microblog track. In: TREC (2011)

    Google Scholar 

  23. Metzler, D., Croft, W.B.: A markov random field model for term dependencies. In: Proc. of ACM Conf. on Res. and Dev. in Information Retrieval, pp. 472–479 (2005)

    Google Scholar 

  24. Phan, N., Bailey, P., Wilkinson, R.: Understanding the relationship of information need specificity to search query length. In: Proc. of ACM SIGIR Conf. on Res. and Dev. in Information Retrieval, pp. 709–710 (2007)

    Google Scholar 

  25. Savva, M., Kong, N., Chhajta, A., Fei-Fei, L., Agrawala, M., Heer, J.: Revision: Automated classification, analysis and redesign of chart images. In: Proc. of ACM Symposium on User Interface Software and Technology, pp. 393–402 (2011)

    Google Scholar 

  26. Tan, P., Steinbach, M., Kumar, V.: Introd. to Data Mining. Addison Wesley (2006)

    Google Scholar 

  27. Wu, P., Carberry, S., Elzer, S., Chester, D.: Recognizing the intended message of line graphs. In: Goel, A.K., Jamnik, M., Narayanan, N.H. (eds.) Diagrams 2010. LNCS (LNAI), vol. 6170, pp. 220–234. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Li, Z., Carberry, S., Fang, H., McCoy, K.F., Peterson, K. (2014). Infographics Retrieval: A New Methodology. In: Métais, E., Roche, M., Teisseire, M. (eds) Natural Language Processing and Information Systems. NLDB 2014. Lecture Notes in Computer Science, vol 8455. Springer, Cham. https://doi.org/10.1007/978-3-319-07983-7_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07983-7_15

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07982-0

  • Online ISBN: 978-3-319-07983-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics