Skip to main content

A Comparative Study of Answer-Contained Snippets and Traditional Snippets

  • Conference paper
  • First Online:
Information Retrieval Technology (AIRS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9994))

Included in the following conference series:

  • 846 Accesses

Abstract

Almost every text search engine uses snippets to help users quickly assess the relevance of retrieved items in the ranked list. Although answer-contained snippets can help to improve the effectiveness of search intuitively, quantitative study of such intuition remains untouched. In this paper, we first propose a simple answer-contained snippet method for community-based Question and Answer (cQA) search, and then compare our method with the state-of-the-art traditional snippet algorithms. The experimental results show that the answer-contained snippet method significantly outperforms the state-of-the-art traditional methods, considering relevance judgements and information satisfaction evaluations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.thuir.org/1click/ntcir9/.

  2. 2.

    http://answers.yahoo.com/.

  3. 3.

    http://cran.r-project.org/web/packages/gbm/.

  4. 4.

    In this paper, the user question has the same meaning as the user query.

  5. 5.

    The data presented in Table 2 were acquired by averaging the results for each query over the total number of queries, thus producing the average recall, precision and \(F_1\) values per query.

References

  1. Campos, R., Dias, G., Jorge, A.M., Jatowt, A.: Survey of temporal information retrieval and related applications. ACM Comput. Surv. 47(2), 1–41 (2015)

    Article  Google Scholar 

  2. Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: ACM International Conference on Information and Knowledge Management, pp. 84–90. ACM (2005)

    Google Scholar 

  3. Lee, J.T., Kim, S.B., Song, Y.I., Rim, H.C.: Bridging lexical gaps between queries and questions on large online qa collections with compact translation models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, 25–27 October 2008, Honolulu, A Meeting of Sigdat, A Special Interest Group of the ACL, pp. 410–418 (2008)

    Google Scholar 

  4. Xue, X., Jeon, J., Croft, W.B.: Retrieval models for question and answer archives. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, Singapore, pp. 475–482, July 2008

    Google Scholar 

  5. Tombros, A., Sanderson, M.: Advantages of query biased summaries in information retrieval. In: Proceedings of ACM SIGIR, pp. 2–10 (1998)

    Google Scholar 

  6. Wang, C., Jing, F., Zhang, L., Zhang, H.J.: Learning query-biased web page summarization. In: Sixteenth ACM Conference on Information and Knowledge Management, CIKM, Lisbon, pp. 555–562, November 2007

    Google Scholar 

  7. Huang, Y., Liu, Z., Chen, Y.: Query biased snippet generation in XML search. In: ACM SIGMOD International Conference on Management of Data, pp. 315–326. ACM (2008)

    Google Scholar 

  8. He, J., Shu, B., Li, X., Yan, H.: Effective time ratio: a measure for web search engines with document snippets. In: Cheng, P.-J., Kan, M.-Y., Lam, W., Nakov, P. (eds.) AIRS 2010. LNCS, vol. 6458, pp. 73–84. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Zhou, G., Zhou, Y., He, T., Wu, W.: Learning semantic representation with neural networks for community question answering retrieval. Knowl. Based Syst. 93, 75–83 (2015)

    Article  Google Scholar 

  10. Bernhard, D., Gurevych, I.: Combining lexical semantic resources with question and answer archives for translation-based answer finding. In: ACL 2009, Proceedings of the, Meeting of the Association for Computational Linguistics and the, International Joint Conference on Natural Language Processing of the AFNLP, 2–7 August 2009, Singapore, pp. 728–736 (2009)

    Google Scholar 

  11. Edmundson, H.P.: New methods in automatic extracting. J. ACM 16(2), 264–285 (1969)

    Article  MATH  Google Scholar 

  12. Gomez-Nieto, E., San, R.F., Pagliosa, P., Casaca, W., Helou, E.S., Oliveira, M.C., et al.: Similarity preserving snippet-based visualization of web search results. IEEE Trans. Vis. Comput. Graph. 20(3), 457–470 (2014)

    Article  Google Scholar 

  13. Silber, H.G., Mccoy, K.F.: Efficiently computed lexical chains as an intermediate representation for automatic text summarization. Comput. Linguist. 28(4), 487–496 (2002)

    Article  Google Scholar 

  14. Turpin, A., Tsegay, Y., Hawking, D., Williams, H.E.: Fast generation of result snippets in web search. In: SIGIR 2007: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, pp. 127–134, July 2007

    Google Scholar 

  15. Goldstein, J., Kantrowitz, M., Mittal, V., Carbonell, J.: Summarizing text documents: sentence selection and evaluation metrics. In: Research and Development in Information Retrieval, pp. 121–128 (1999)

    Google Scholar 

  16. Joho, H., Hannah, D., Jose, J.M.: Emulating query-biased summaries using document titles. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 709–710. ACM (2008)

    Google Scholar 

  17. Ichikawa, K., Morishita, S.: A simple but powerful heuristic method for accelerating k-means clustering of large-scale data in life science. IEEE/ACM Trans. Comput. Biol. Bioinf. (TCBB) 11(4), 681–692 (2014)

    Article  Google Scholar 

  18. Metzler, D.: Machine learned sentence selection strategies for query-biased summarization. In: SIGIR Learning to Rank Workshop (2008)

    Google Scholar 

  19. Ellkvist, T., Strmbck, L., Lins, L.D., Freire, J.: A first study on strategies for generating workflow snippets. In: International Workshop on Keyword Search on Structured Data, pp. 15–20(2009)

    Google Scholar 

Download references

Acknowledgments

This work was supported by 863 Program (2015AA015404), China National Science Foundation (61402036, 60973083, 61273363), Beijing Technology Project (Z151100001615029), Science and Technology Planning Project of Guangdong Province (2014A010103009, 2015A020217002), Guangzhou Science and Technology Planning Project (201604020179).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xian-Ling Mao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Mao, XL., Wang, D., Hao, YJ., Yuan, W., Huang, H. (2016). A Comparative Study of Answer-Contained Snippets and Traditional Snippets. In: Ma, S., et al. Information Retrieval Technology. AIRS 2016. Lecture Notes in Computer Science(), vol 9994. Springer, Cham. https://doi.org/10.1007/978-3-319-48051-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48051-0_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48050-3

  • Online ISBN: 978-3-319-48051-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics