A Comparative Study of Answer-Contained Snippets and Traditional Snippets

Mao, Xian-Ling; Wang, Dan; Hao, Yi-Jing; Yuan, Wenqing; Huang, Heyan

doi:10.1007/978-3-319-48051-0_5

Xian-Ling Mao²⁰,
Dan Wang²⁰,
Yi-Jing Hao²⁰,
Wenqing Yuan²¹ &
…
Heyan Huang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9994))

Included in the following conference series:

Asia Information Retrieval Symposium

846 Accesses

Abstract

Almost every text search engine uses snippets to help users quickly assess the relevance of retrieved items in the ranked list. Although answer-contained snippets can help to improve the effectiveness of search intuitively, quantitative study of such intuition remains untouched. In this paper, we first propose a simple answer-contained snippet method for community-based Question and Answer (cQA) search, and then compare our method with the state-of-the-art traditional snippet algorithms. The experimental results show that the answer-contained snippet method significantly outperforms the state-of-the-art traditional methods, considering relevance judgements and information satisfaction evaluations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.thuir.org/1click/ntcir9/.
2.
http://answers.yahoo.com/.
3.
http://cran.r-project.org/web/packages/gbm/.
4.
In this paper, the user question has the same meaning as the user query.
5.
The data presented in Table 2 were acquired by averaging the results for each query over the total number of queries, thus producing the average recall, precision and \(F_1\) values per query.

References

Campos, R., Dias, G., Jorge, A.M., Jatowt, A.: Survey of temporal information retrieval and related applications. ACM Comput. Surv. 47(2), 1–41 (2015)
Article Google Scholar
Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer archives. In: ACM International Conference on Information and Knowledge Management, pp. 84–90. ACM (2005)
Google Scholar
Lee, J.T., Kim, S.B., Song, Y.I., Rim, H.C.: Bridging lexical gaps between queries and questions on large online qa collections with compact translation models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, 25–27 October 2008, Honolulu, A Meeting of Sigdat, A Special Interest Group of the ACL, pp. 410–418 (2008)
Google Scholar
Xue, X., Jeon, J., Croft, W.B.: Retrieval models for question and answer archives. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR, Singapore, pp. 475–482, July 2008
Google Scholar
Tombros, A., Sanderson, M.: Advantages of query biased summaries in information retrieval. In: Proceedings of ACM SIGIR, pp. 2–10 (1998)
Google Scholar
Wang, C., Jing, F., Zhang, L., Zhang, H.J.: Learning query-biased web page summarization. In: Sixteenth ACM Conference on Information and Knowledge Management, CIKM, Lisbon, pp. 555–562, November 2007
Google Scholar
Huang, Y., Liu, Z., Chen, Y.: Query biased snippet generation in XML search. In: ACM SIGMOD International Conference on Management of Data, pp. 315–326. ACM (2008)
Google Scholar
He, J., Shu, B., Li, X., Yan, H.: Effective time ratio: a measure for web search engines with document snippets. In: Cheng, P.-J., Kan, M.-Y., Lam, W., Nakov, P. (eds.) AIRS 2010. LNCS, vol. 6458, pp. 73–84. Springer, Heidelberg (2010)
Chapter Google Scholar
Zhou, G., Zhou, Y., He, T., Wu, W.: Learning semantic representation with neural networks for community question answering retrieval. Knowl. Based Syst. 93, 75–83 (2015)
Article Google Scholar
Bernhard, D., Gurevych, I.: Combining lexical semantic resources with question and answer archives for translation-based answer finding. In: ACL 2009, Proceedings of the, Meeting of the Association for Computational Linguistics and the, International Joint Conference on Natural Language Processing of the AFNLP, 2–7 August 2009, Singapore, pp. 728–736 (2009)
Google Scholar
Edmundson, H.P.: New methods in automatic extracting. J. ACM 16(2), 264–285 (1969)
Article MATH Google Scholar
Gomez-Nieto, E., San, R.F., Pagliosa, P., Casaca, W., Helou, E.S., Oliveira, M.C., et al.: Similarity preserving snippet-based visualization of web search results. IEEE Trans. Vis. Comput. Graph. 20(3), 457–470 (2014)
Article Google Scholar
Silber, H.G., Mccoy, K.F.: Efficiently computed lexical chains as an intermediate representation for automatic text summarization. Comput. Linguist. 28(4), 487–496 (2002)
Article Google Scholar
Turpin, A., Tsegay, Y., Hawking, D., Williams, H.E.: Fast generation of result snippets in web search. In: SIGIR 2007: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, pp. 127–134, July 2007
Google Scholar
Goldstein, J., Kantrowitz, M., Mittal, V., Carbonell, J.: Summarizing text documents: sentence selection and evaluation metrics. In: Research and Development in Information Retrieval, pp. 121–128 (1999)
Google Scholar
Joho, H., Hannah, D., Jose, J.M.: Emulating query-biased summaries using document titles. In: International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 709–710. ACM (2008)
Google Scholar
Ichikawa, K., Morishita, S.: A simple but powerful heuristic method for accelerating k-means clustering of large-scale data in life science. IEEE/ACM Trans. Comput. Biol. Bioinf. (TCBB) 11(4), 681–692 (2014)
Article Google Scholar
Metzler, D.: Machine learned sentence selection strategies for query-biased summarization. In: SIGIR Learning to Rank Workshop (2008)
Google Scholar
Ellkvist, T., Strmbck, L., Lins, L.D., Freire, J.: A first study on strategies for generating workflow snippets. In: International Workshop on Keyword Search on Structured Data, pp. 15–20(2009)
Google Scholar

Download references

Acknowledgments

This work was supported by 863 Program (2015AA015404), China National Science Foundation (61402036, 60973083, 61273363), Beijing Technology Project (Z151100001615029), Science and Technology Planning Project of Guangdong Province (2014A010103009, 2015A020217002), Guangzhou Science and Technology Planning Project (201604020179).

Author information

Authors and Affiliations

Beijing Institute of Technology, Beijing, China
Xian-Ling Mao, Dan Wang, Yi-Jing Hao & Heyan Huang
Beijing Guzhang Mobile Technology Co., Beijing, China
Wenqing Yuan

Authors

Xian-Ling Mao
View author publications
You can also search for this author in PubMed Google Scholar
Dan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Jing Hao
View author publications
You can also search for this author in PubMed Google Scholar
Wenqing Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Heyan Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xian-Ling Mao .

Editor information

Editors and Affiliations

Tsinghua University , Beijing, China
Shaoping Ma
Renmin University of China , Beijing, China
Ji-Rong Wen
Tsinghua University , Beijing, China
Yiqun Liu
Renmin University of China , Beijing, China
Zhicheng Dou
Tsinghua University , Beijing, China
Min Zhang
Yahoo Labs , Sunnyvale, California, USA
Yi Chang
Renmin University of China , Beijing, China
Xin Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mao, XL., Wang, D., Hao, YJ., Yuan, W., Huang, H. (2016). A Comparative Study of Answer-Contained Snippets and Traditional Snippets. In: Ma, S., et al. Information Retrieval Technology. AIRS 2016. Lecture Notes in Computer Science(), vol 9994. Springer, Cham. https://doi.org/10.1007/978-3-319-48051-0_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-48051-0_5
Published: 15 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48050-3
Online ISBN: 978-3-319-48051-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics