Linguistically-Based Reranking of Google’s Snippets with GreG

Delmonte, Rodolfo; Tripodi, Rocco

doi:10.1007/978-3-642-21384-7_5

Rodolfo Delmonte⁵ &
Rocco Tripodi⁵

Part of the book series: Studies in Computational Intelligence ((SCI,volume 361))

453 Accesses

Abstract

We present an experiment evaluating the contribution of a system called GReG for reranking the snippets returned by Google’s search engine in the 10 hits presented to the user and captured by the use of Google’s API. The evaluation aims at establishing whether or not the introduction of deep linguistic information may improve the accuracy of Google or rather it is the opposite case as maintained by the majority of people working in Information Retrieval and using a Bag Of Words approach. We used 900 questions and answers taken from TREC 8 and 9 competitions and execute three different types of evaluation: one without any linguistic aid; a second one with tagging and syntactic constituency contribution; another run with what we call Partial Logical Form. Even though GReG is still work in progress, it is possible to draw clear cut conclusions: adding linguistic information to the evaluation process of the best snippet that can answer a question improves enormously the performance. In another experiment we used the actual texts associated to the Q/A pairs distributed by one of TREC’s participant and got even higher accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bresnan, J.: Lexical-Functional Syntax. Blackwell, Malden (2000)
Google Scholar
ComLex, http://nlp.cs.nyu.edu/comlex
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proceedings of COLING-ACL 1998, Montreal, Canada (1998)
Google Scholar
Ellsworth, M., Erk, K., Kingsbury, P., Pado, S.: PropBank, SALSA, and FrameNet: How design determines product. In: Proceedings of the LREC 2004 Workshop on Building Lexical Resources from Semantically Annotated Corpora, Lisbon (2004)
Google Scholar
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
MATH Google Scholar
Delmonte, R. (ed.): Computational Linguistic Text Processing – Logical Form, Semantic Interpretation, Discourse Relations and Question Answering. Nova Science Publishers, New York (2007)
Google Scholar
Delmonte, R.: Computational Linguistic Text Processing – Lexicon, Grammar, Parsing and Anaphora Resolution. Nova Science Publishers, New York (2009)
Google Scholar
Delmonte, R.: Deep & Shallow Linguistically Based Parsing. In: Delmonte, R., Di Sciullo, A.M. (eds.) UG and External Systems, pp. 335–374. John Benjamins, Amsterdam (2005)
Google Scholar
Delmonte, R., Bristot, A., Piccolino Boniforti, M.A., Tonelli, S.: Entailment and Anaphora Resolution in RTE3. In: ACL Workshop on Text Entailment and Paraphrasing, Prague, ACL Madison, USA, pp. 48–53 (2007)
Google Scholar
Cui, H., Sun, R., Li, K., Kan, M.-Y., Chua, T.-S.: Question Answering Pas-sage Retrieval Using Dependency Relations. In: SIGIR 2005, pp. 400–406. ACM, Salvador (2005)
Chapter Google Scholar
Litkowski, K.C.: Syntactic Clues and Lexical Resources in Question-Answering. In: Voorhees, E.M., Harman, D.K. (eds.) The Ninth Text Retrieval Conference (TREC-9), pp. 157–166. NIST Special Publication, Gaithersburg (2001)
Google Scholar
Wang, M., Smith, N.A., Mitamura, T.: What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, pp. 22–32 (2007)
Google Scholar
Traum, D., Habash, N.: Generation from Lexical Conceptual Structure. In: Workshop on Applied Interlinguas, ANLP 2000, Seattle, WA, pp. 123–134 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Language Science, Università “Ca Foscari”, 30123, Venezia, Italy
Rodolfo Delmonte & Rocco Tripodi

Authors

Rodolfo Delmonte
View author publications
You can also search for this author in PubMed Google Scholar
Rocco Tripodi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

InterAnalytics, Rue des Savoises, 19, 1205, Geneva, Switzerland
Vincenzo Pallotta
CRS4, Center of Advanced Studies Research and Development in Sardinia, Parco Scientifico della Sardegna, Ed. 1, 09010, Loc. Piscinamanna Pula, CA, Italy
Alessandro Soro
Department of Electrical and Electronic Engineering, University of Cagliari, 09123, Piazza d’Armi, Cagliari, Italy
Eloisa Vargiu

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Delmonte, R., Tripodi, R. (2011). Linguistically-Based Reranking of Google’s Snippets with GreG. In: Pallotta, V., Soro, A., Vargiu, E. (eds) Advances in Distributed Agent-Based Retrieval Tools. Studies in Computational Intelligence, vol 361. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21384-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-21384-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21383-0
Online ISBN: 978-3-642-21384-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics