Skip to main content

Time Event Extraction to Boost an Information Retrieval System

  • Chapter
  • First Online:

Part of the book series: Studies in Computational Intelligence ((SCI,volume 668))

Abstract

In this chapter we propose an innovative information retrieval system able to manage temporal information. The system allows temporal constraints in a classical keyword-based search. Information about temporal events is automatically extracted from text at indexing time and stored in an ad-hoc data structure exploited by the retrieval module for searching relevant documents. Our system can search textual information that refers to specific period of times. We perform an exploratory case study indexing all Italian Wikipedia articles.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://github.com/pippokill/TAIR.

  2. 2.

    http://lucene.apache.org/.

  3. 3.

    https://code.google.com/p/heideltime/.

  4. 4.

    http://lucene.apache.org/core/4_8_1/queryparser/org/apache/lucene/queryparser/classic/package-summary.html.

References

  1. Alonso, O., Gertz, M.: Clustering of search results using temporal attributes. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 597–598. ACM (2006)

    Google Scholar 

  2. Alonso, O., Gertz, M., Baeza-Yates, R.: On the value of temporal information in information retrieval. SIGIR Forum 41(2), 35–41 (2007)

    Article  Google Scholar 

  3. Alonso, O., Gertz, M., Baeza-Yates, R.: Clustering and exploring search results using timeline constructions. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM ’09, pp. 97–106. ACM (2009)

    Google Scholar 

  4. Alonso, O., Strötgen, J., Baeza-Yates, R.A., Gertz, M.: Temporal information retrieval: challenges and opportunities. In: Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW 2011), vol. 11, pp. 1–8 (2011)

    Google Scholar 

  5. Arikan, I., Bedathur, S.J., Berberich, K.: Time will tell: leveraging temporal expressions in IR. In: Baeza-Yates, R.A., Boldi, P., Ribeiro-Neto, B.A., Cambazoglu, B.B. (eds.) Proceedings of the 2ND International Conference on Web Search and Web Data Mining, WSDM 2009, Barcelona, Spain, February 9–11, 2009. ACM (2009)

    Google Scholar 

  6. Berberich, K., Bedathur, S., Alonso, O., Weikum, G.: A language modeling approach for temporal information needs. In: Proceedings of the 32nd European Conference on Advances in Information Retrieval, ECIR’2010, pp. 13–25. Springer (2010)

    Google Scholar 

  7. Campos, R., Dias, G., Jorge, A.M., Jatowt, A.: Survey of temporal information retrieval and related applications. ACM Comput. Surv. 47(2), 15:1–15:41 (2014)

    Google Scholar 

  8. Chang, A.X., Manning, C.D.: SUTime: a library for recognizing and normalizing time expressions. In: LREC, pp. 3735–3740 (2012)

    Google Scholar 

  9. Elsas, J.L., Dumais, S.T.: Leveraging temporal dynamics of document content in relevance ranking. In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining, WSDM ’10, pp. 1–10. ACM (2010)

    Google Scholar 

  10. Hienert, D., Luciano, F.: Extraction of historical events from Wikipedia. In: Proceedings of the First International Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data, pp. 25–36 (2011)

    Google Scholar 

  11. Hobbs, J.R., Pan, F.: An ontology of time for the semantic web. ACM Trans. Asian Lang. Inf. Process. (Special Issue on Temporal Information Processing) 3(1), 66–85 (2004)

    Google Scholar 

  12. Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif. Intell. 194, 28–61 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  13. Kanhabua, N., Nørvåg, K.: Learning to rank search results for time-sensitive queries. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM ’12, pp. 2463–2466. ACM (2012)

    Google Scholar 

  14. Kuzey, E., Weikum, G.: Extraction of temporal facts and events from Wikipedia. In: Proceedings of the 2nd Temporal Web Analytics Workshop, pp. 25–32. ACM (2012)

    Google Scholar 

  15. Ling, X., Weld, D.S.: Temporal information extraction. In: Proceedings of the 24th Conference on Artificial Intelligence (AAAI 2010). Atlanta, GA (2010)

    Google Scholar 

  16. Matthews, M., Tolchinsky, P., Blanco, R., Atserias, J., Mika, P., Zaragoza, H.: Searching through time in the New York Times. In: Proceedings of the Fourth Workshop on Human-Computer Interaction and Information Retrieval (HCIR 10), pp. 41–44 (2010)

    Google Scholar 

  17. Nunes, S., Ribeiro, C., David, G.: Use of temporal expressions in web search. In: Proceedings of the IR Research, 30th European Conference on Advances in Information Retrieval, ECIR’08, pp. 580–584. Springer (2008)

    Google Scholar 

  18. Pustejovsky, J., Castano, J.M., Ingria, R., Sauri, R., Gaizauskas, R.J., Setzer, A., Katz, G., Radev, D.R.: TimeML: robust specification of event and temporal expressions in text. New Dir. Quest. Answ. 3, 28–34 (2003)

    Google Scholar 

  19. Saurí, R., Knippen, R., Verhagen, M., Pustejovsky, J.: Evita: A robust event recognizer for QA systems. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 700–707. ACL (2005)

    Google Scholar 

  20. Strötgen, J., Zell, J., Gertz, M.: HeidelTime: tuning english and developing Spanish resources for TempEval-3. In: 2nd Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Proceedings of the 7th International Workshop on Semantic Evaluation, pp. 15–19. ACL (2013)

    Google Scholar 

  21. UzZaman, N., Llorens, H., Derczynski, L., Allen, J., Verhagen, M., Pustejovsky, J.: Semeval-2013 task 1: Tempeval-3: Evaluating time expressions, events, and temporal relations. In: 2nd Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Proceedings of the 7th International Workshop on Semantic Evaluation, pp. 1–9. ACL (2013)

    Google Scholar 

  22. Vandenbussche, P.Y., Teissèdre, C.: Events retrieval using enhanced semantic web knowledge. In: Workshop DeRIVE 2011 (Detection, Representation, and Exploitation of Events in the Semantic Web) in cunjunction with 10th International Semantic Web Conference 2011 (ISWC 2011) (2011)

    Google Scholar 

  23. Verhagen, M., Sauri, R., Caselli, T., Pustejovsky, J.: SemEval-2010 Task 13: TempEval-2. In: Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 57–62. ACL (2010)

    Google Scholar 

  24. Whiting, S., Jose, J., Alonso, O.: Wikipedia as a time machine. In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion, pp. 857–862. International World Wide Web Conferences Steering Committee (2014)

    Google Scholar 

Download references

Acknowledgments

The computational work has been executed on the IT resources made available by two projects financed by the MIUR (Italian Ministry for Education, University and Research) in the “PON Ricerca e Competitività 2007–2013” Program: ReCaS (Azione I —Interventi di rafforzamento strutturale, PONa3_00052, Avviso 254/Ric) and PRISMA (Asse II—Sostegno all’innovazione, PON04a2_A).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pierpaolo Basile .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Basile, P., Caputo, A., Semeraro, G., Siciliani, L. (2017). Time Event Extraction to Boost an Information Retrieval System. In: Lai, C., Giuliani, A., Semeraro, G. (eds) Information Filtering and Retrieval. Studies in Computational Intelligence, vol 668. Springer, Cham. https://doi.org/10.1007/978-3-319-46135-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46135-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46133-5

  • Online ISBN: 978-3-319-46135-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics