A Search Engine Accepting On-Line Updates

Marin, Mauricio; Bonacic, Carolina; Costa, Veronica Gil; Gomez, Carlos

doi:10.1007/978-3-540-74466-5_38

A Search Engine Accepting On-Line Updates

Mauricio Marin¹,
Carolina Bonacic²,
Veronica Gil Costa³ &
…
Carlos Gomez¹

Conference paper

770 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4641))

Abstract

We describe and evaluate the performance of a parallel search engine that is able to cope efficiently with concurrent read/write operations. Read operations come in the usual form of queries submitted to the search engine and write ones come in the form of new documents added to the text collection in an on-line manner, namely the insertions are embedded into the main stream of user queries in an unpredictable arrival order but with query results respecting causality. The search engine is built upon distributed inverted files for which we propose generic strategies for load balance and concurrency control.

Download to read the full chapter text

Chapter PDF

References

Badue, C., Baeza-Yates, R., Ribeiro, B., Ziviani, N.: Distributed query processing using partitioned inverted files. In: Eighth Symposium on String Processing and Information Retrieval (SPIRE 2001), pp. 10–20 (November 2001)
Google Scholar
Buttcher, S., Clarke, C.: Indexing time vs. query time trade-offs in dynamic information retrieval systems. In: International Conference on Information and Knowledge Management, pp. 317–318 (2005)
Google Scholar
MacFarlane, A., McCann, J., Robertson, S.: Parallel search using partitioned inverted files. In: 7th International Symposium on String Processing and Information Retrieval, pp. 209–220. IEEE Computer Society Press, Los Alamitos (2000)
Chapter Google Scholar
Moffat, W., Webber, J., Zobel, Baeza-Yates, R.: A pipelined architecture for distributed text query evaluation. Information Retrieval (October 5, 2006)
Google Scholar
Orlando, S., Perego, R., Silvestri, F.: Design of a parallel and distributed web search engine. In: Proc. 2001 Parallel Computing Conf., pp.197–204 (2001)
Google Scholar
Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Computing Surveys 38(2) (2006)
Google Scholar
Persin, M., Zobel, J., Sacks-Davis, R.: Filtered document retrieval with frequency-sorted indexes. Journal of the American Society for Information Science 47(10), 749–764 (1996)
Article Google Scholar
Valiant, L.G.: A bridging model for parallel computation. Comm. ACM 33, 103–111 (1990)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Yahoo! Research, Santiago, University of, Chile
Mauricio Marin & Carlos Gomez
ARTECS, Complutense University of Madrid, Spain
Carolina Bonacic
DCC, University of San Luis, Argentina
Veronica Gil Costa

Authors

Mauricio Marin
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Bonacic
View author publications
You can also search for this author in PubMed Google Scholar
Veronica Gil Costa
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Gomez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Anne-Marie Kermarrec Luc Bougé Thierry Priol

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Marin, M., Bonacic, C., Costa, V.G., Gomez, C. (2007). A Search Engine Accepting On-Line Updates. In: Kermarrec, AM., Bougé, L., Priol, T. (eds) Euro-Par 2007 Parallel Processing. Euro-Par 2007. Lecture Notes in Computer Science, vol 4641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74466-5_38

Download citation

DOI: https://doi.org/10.1007/978-3-540-74466-5_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74465-8
Online ISBN: 978-3-540-74466-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics