Advertisement

FRel: A Freshness Language Model for Optimizing Real-Time Web Search

  • Mariem BambiaEmail author
  • Rim Faiz
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 348)

Abstract

An effective information retrieval system must satisfy different users search intentions expecting a variety of queries categories, comprising recency sensitive queries where fresh content is the major user’s requirement. However, using temporal features of documents to measure their freshness remains a hard task since these features may not be accurately represented in recent documents. In this paper, we propose a language model which estimates the topical relevance and freshness of documents with respect to real-time sensitive queries. In order to improve recency ranking, our approach models freshness by exploiting terms extracted from recently posted tweets topically relevant to each real-time sensitive query. In our experiments, we use these fresh terms to re-rank initial search results. Then, we compare our model with two baseline approaches which integrate temporal relevance in their language models. Our results show that there is a clear advantage of using microblogs platforms, such as Twitter, to extract fresh keywords.

Keywords

Social Information Retrieval Real-time sensitive queries Language models Fresh keywords 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bai, J., Nie, J.Y., Cao, G.: Context-dependent term relations for information retrieval. In: Proc. Empirical Methods in Natural Language Processing, EMNLP 2006, pp. 551–559 (2006)Google Scholar
  2. 2.
    Ben Jabeur, L., Tamine, L., Boughanem, M.: Featured tweet search: Modeling time and social influence for microblog retrieval. In: Proceedings of International Conference on Web Intelligence, China, pp. 166–173 (2012)Google Scholar
  3. 3.
    Dai, N., Shokouhi, M., Davison, B.: Learning to rank for freshness and relevance. In: Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 95–104 (2011)Google Scholar
  4. 4.
    Dakka, W., Gravano, L., Ipeirotis, P.G.: Answering general time-sensitive queries. Proceedings of the IEEE Transactions on Knowledge and Data Engeneering 24, 220–235 (2012)CrossRefGoogle Scholar
  5. 5.
    Dong, A., Chang, Y., Zheng, Z., Mishne, G., Bai, J., Zhang, R., Buchner, K., Liao, C., Diaz, F.: Towards recency ranking in web search. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010, pp. 11–20. ACM, New York (2010)Google Scholar
  6. 6.
    Efron, M., Golovchinsky, G.: Estimation methods for ranking recent information. In: Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 495–504 (2011)Google Scholar
  7. 7.
    Huo, W., Tsotras, V.J.: Temporal top-k search in social tagging sites using multiple social networks. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 5981, pp. 498–504. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  8. 8.
    Viera, A.J., Garrett, J.M.: Understanding interobserver agreement: The kappa statistic. Family Medecine Research Series 37(5), 360–363 (2005)Google Scholar
  9. 9.
    Karkali, M., Plachouras, V., Vazirgiannis, M., Stefanatos, C.: Keeping keywords fresh: A bm25 variation for personalized keyword extraction. In: Proceedings of the 2nd Temporal Web Analytics Workshop, pp. 17–24 (2012)Google Scholar
  10. 10.
    Li, X., Croft, W.B.: Time-based language models. In: ACM (ed.) Proceedings of the Twelfth International Conference on Information and Knowledge Management, New York, NY, USA (2003)Google Scholar
  11. 11.
    Massoudi, K., Tsagkias, M., Rijke, M., Weerkamp, M.: Incorporating query expansion and quality indicators in searching microblog posts. In: Proceedings of the 33rd European Conference on IR Research, Dublin, Ireland, pp. 362–367 (2011)Google Scholar
  12. 12.
    Moon, T., Chu, W., Lihong, L., Zheng, Z., Chang, Y.: Online learning for recency search ranking using real-time user feedback. ACM Transactions on Information Systems 30(4), 20 (2010)Google Scholar
  13. 13.
    Ponte, J., Croft, W.: A language modeling approach to information retrieval. In: Proceedings of the 21st annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281 (1998)Google Scholar
  14. 14.
    Wang, H., Dong, A., Li, L., Chang, Y.: Joint relevance and freshness learning from clickthroughs for news search. In: Proceedings of the 21st International Conference on World Wide Web, pp. 579–588 (2012)Google Scholar
  15. 15.
    Wang, H., Dong, A., Li, L., Chang, Y.: Joint relevance and freshness learning from clickthroughs for news search. In: Proceedings of the 21st International World Wide Web Conference Committee, pp. 579–588 (2012)Google Scholar
  16. 16.
    Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems 2(2), 179–214 (2004)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.LARODECISG University of TunisLe BardoTunisia
  2. 2.LARODECIHEC University of CarthageCarthage PresidencyTunisia

Personalised recommendations