Skip to main content

Building Web-Scale Data Mining Infrastructure for Search

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4976))

Abstract

The competition in search has been driving great innovation and investment on next generation Internet services, with the goal of providing the computing platform for Internet economies on a global scale. Different from traditional Internet services, search involves myriad offline computations to analyze data at a very large scale, and an infrastructure for “scale” experiments is often required to evaluate the effectiveness of newly invented algorithms in a simulated “real” environment. In this talk, I will first review a variety of new trends in computational economies on the Internet in which search and online advertising have become the driving forces in building underlying computing infrastructure. Then, I will introduce current efforts at Microsoft Research Asia on building this new infrastructure. I will also discuss how these efforts are influencing the design of next-generation search engines from an architecture stand-point. Some advanced search technologies based on the use of this infrastructure and deeper data mining on the Web will also be demonstrated.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Author information

Authors and Affiliations

Authors

Editor information

Yanchun Zhang Ge Yu Elisa Bertino Guandong Xu

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ma, WY. (2008). Building Web-Scale Data Mining Infrastructure for Search. In: Zhang, Y., Yu, G., Bertino, E., Xu, G. (eds) Progress in WWW Research and Development. APWeb 2008. Lecture Notes in Computer Science, vol 4976. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78849-2_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78849-2_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78848-5

  • Online ISBN: 978-3-540-78849-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics