Skip to main content

Using Wikipedia to Improve Precision of Contextual Advertising

  • Conference paper
Human Language Technology. Challenges for Computer Science and Linguistics (LTC 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6562))

Included in the following conference series:

Abstract

Contextual advertising is an important part of the Web economy today. Profit is linked to the interest that users find in the ads presented to them. The problem is for contextual advertising platforms to select the most relevant ads. Simple keyword matching techniques for matching ads to page content give poor accuracy. Problems such as homonymy, polysemy, limited intersection between content and selection keywords as well as context mismatch can significantly degrade the precision of ads selection. In this paper, we propose a method for improving the relevance of contextual ads based on “Wikipedia matching”. It is a technique that uses Wikipedia articles as “reference points” for ads selection. In our research, we worked on English language, but it is possible to port the algorithm to other languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hayter, A.J.: Probability and Statistics for Engineers and Scientists. Duxbury, Belmont (2007)

    Google Scholar 

  2. Broder, A., Fontoura, M., Josifovski, V., Riedel, L.: Semantic approach to contextual advertising. In: Proc. of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, The Netherlands (2007)

    Google Scholar 

  3. Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 407–491 (1988)

    Google Scholar 

  4. Jun, Z.: Comprehensive Perl Archive Network (2007), http://search.cpan.org/~jzhang/html-contentextractor-0.02/lib/html/contentextractor.pm

  5. Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  6. Murdock, V., Ciaramita, M., Plachouras, V.: A noisy-channel approach to contextual advertising. In: Proc. of the 1st International Workshop on Data Mining and Audience Intelligence for Advertising, San Jose, California, pp. 21–27 (2007)

    Google Scholar 

  7. Porter, M.F.: An algorithm for suffix stripping. Readings in Information Retrieval, 313–316 (1997)

    Google Scholar 

  8. Ribeiro-Neto, B., Cristo, M.: Impedance coupling in content-targeted advertising. In: Proc. of the 28th annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, pp. 496–503 (2005)

    Google Scholar 

  9. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)

    Article  MATH  Google Scholar 

  10. Zhang, Y., Vogel, S.: Measuring confidence intervals for the machine translation evaluation metrics. In: Proceedings of the 10th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 2004), pp. 4–6 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pak, A. (2011). Using Wikipedia to Improve Precision of Contextual Advertising. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20095-3_49

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20094-6

  • Online ISBN: 978-3-642-20095-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics