Skip to main content

Improving Search Engines Performance on Multithreading Processors

  • Conference paper
High Performance Computing for Computational Science - VECPAR 2008 (VECPAR 2008)

Abstract

In this paper we present strategies and experiments that show how to take advantage of the multi-threading parallelism available in Chip Multithreading (CMP) processors in the context of efficient query processing for search engines. We show that scalable performance can be achieved by letting the search engine go synchronous so that batches of queries can be processed concurrently in a simple but very efficient manner. Furthermore, our results indicate that the multithreading capabilities of modern CMP systems are not fully exploited when the search engine operates on a conventional asynchronous mode due to the moderate thread level parallelism that can be extracted from a single query.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arusu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Raghavan, S.: Searching the web. ACM Trans. 1(1), 2–43 (2001)

    Article  Google Scholar 

  2. Asanovic, K., Bodik, R., Catanzaro, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley (December 2006)

    Google Scholar 

  3. Badue, C., Baeza-Yates, R., Ribeiro, B., Ziviani, N.: Distributed query processing using partitioned inverted files. In: Eighth Symposium on String Processing and Information Retrieval (SPIRE 2001), pp. 10–20 (2001)

    Google Scholar 

  4. Barroso, A., Dean, J., Olzle, U.H.: Web search for a planet: The google cluster architecture. IEEE Micro 23(2), 22–28 (2002)

    Article  Google Scholar 

  5. Hennessy, J.L., Patterson, D.A.: Computer Architecture, A Quantitative Approach, 4th edn. Morgan Kaufmann Publishers Inc., San Francisco (2006)

    MATH  Google Scholar 

  6. Kongetira, P., Aingaran, K., Olukotun, K.: Niagara: A 32-way multithreaded sparc processor. IEEE Micro 25(2), 21–29 (2005)

    Article  Google Scholar 

  7. Marin, M., Bonacic, C., Gil-Costa, V., Gomez, C.: A search engine accepting on-line updates. In: Kermarrec, A.-M., Bougé, L., Priol, T. (eds.) Euro-Par 2007. LNCS, vol. 4641, pp. 348–357. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  8. Marin, M., Gil-Costa, V.: High-performance distributed inverted files. In: CIKM 2007: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, pp. 935–938. ACM, New York (2007)

    Chapter  Google Scholar 

  9. Marin, M., Gil-Costa, V. (Sync|Async)\(^{\mbox{+}}\) MPI Search Engines. In: Cappello, F., Herault, T., Dongarra, J. (eds.) PVM/MPI 2007. LNCS, vol. 4757, pp. 117–124. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  10. Moffat, W., Webber, J., Zobel, J., Baeza-Yates, R.: A pipelined architecture for distributed text query evaluation. Information Retrieval, published on-line, 5, October (2006)

    Google Scholar 

  11. Olukotun, K., Hammond, L., Laudon, J.: Chip Multiprocessor Architecture: Techniques to Improve Throughput and Latency. In: Synthesis Lectures on Computer Architecture, vol. 3. Morgan and Claypool Publishers, San Francisco (2007)

    Google Scholar 

  12. Persin, M., Zobel, J., Sacks-Davis, R.: Filtered document retrieval with frequency-sorted indexes. Journal of the American Society for Information Science 47(10), 749–764 (1996)

    Article  Google Scholar 

  13. Sheahan, D.: Developing and tuning applications on ultrasparc t1 chip multithreading systems. Technical report, Sun Microsystems. Sun BluePrints Online (October 2007)

    Google Scholar 

  14. Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Computing Surveys 38(2) (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bonacic, C., Garcia, C., Marin, M., Prieto, M., Tirado, F., Vicente, C. (2008). Improving Search Engines Performance on Multithreading Processors. In: Palma, J.M.L.M., Amestoy, P.R., Daydé, M., Mattoso, M., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2008. VECPAR 2008. Lecture Notes in Computer Science, vol 5336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92859-1_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-92859-1_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-92858-4

  • Online ISBN: 978-3-540-92859-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics