Maximal Intersection Queries in Randomized Graph Models

Hoffmann, Benjamin; Lifshits, Yury; Nowotka, Dirk

doi:10.1007/978-3-540-74510-5_24

Benjamin Hoffmann¹,
Yury Lifshits² &
Dirk Nowotka¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4649))

Included in the following conference series:

International Computer Science Symposium in Russia

683 Accesses
3 Citations

Abstract

Consider a family of sets and a single set, called query set. How can one quickly find a member of the family which has a maximal intersection with the query set? Strict time constraints on the query and on a possible preprocessing of the set family make this problem challenging. Such maximal intersection queries arise in a wide range of applications, including web search, recommendation systems, and distributing on-line advertisements. In general, maximal intersection queries are computationally expensive. Therefore, one needs to add some assumptions about the input in order to get an efficient solution. We investigate two well-motivated distributions over all families of sets and propose an algorithm for each of them. We show that with very high probability an almost optimal solution is found in time logarithmic in the size of the family. In particular, we point out a threshold phenomenon on the probabilities of intersecting sets in each of our two input models which leads to the efficient algorithms mentioned above.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bruck, J., Naor, M.: The hardness of decoding linear codes with preprocessing. IEEE Transactions on Information Theory 36(2), 381–385 (1990)
Article MATH MathSciNet Google Scholar
Cole, R., Gottlieb, L.-A., Lewenstein, M.: Dictionary matching and indexing with errors and don’t cares. In: STOC 2004, pp. 91–100 (2004)
Google Scholar
Kleinberg, J.M.: Two algorithms for nearest-neighbor search in high dimensions. In: STOC 1997, pp. 599–608 (1997)
Google Scholar
Lifshits, Y.: A Guide to Web Research. Materials of mini-course at Stuttgart University (2007), Available at http://logic.pdmi.ras.ru/~yura/webguide.html
Maaß, M.G., Nowak, J.: A new method for approximate indexing and dictionary lookup with one error. Inf. Process. Lett. 96(5), 185–191 (2005)
Article Google Scholar
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (2002)
Google Scholar
Newman, M.: The structure and function of complex networks. SIAM Review 45(2), 167–256 (2003)
Article MATH MathSciNet Google Scholar
O’Connor, M., Herlocker, J.: Clustering items for collaborative filtering. In: SIGIR 2001, Workshop on Recommender Systems (2001)
Google Scholar
Yianilos, P.N.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: SODA 1993, pp. 311–321 (1993)
Google Scholar
Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Comput. Surv. 38(2), 6 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

FMI, Universität Stuttgart, Germany
Benjamin Hoffmann & Dirk Nowotka
Steklov Institute of Mathematics at St.Petersburg, Russia
Yury Lifshits

Authors

Benjamin Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Yury Lifshits
View author publications
You can also search for this author in PubMed Google Scholar
Dirk Nowotka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Volker Diekert Mikhail V. Volkov Andrei Voronkov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hoffmann, B., Lifshits, Y., Nowotka, D. (2007). Maximal Intersection Queries in Randomized Graph Models. In: Diekert, V., Volkov, M.V., Voronkov, A. (eds) Computer Science – Theory and Applications. CSR 2007. Lecture Notes in Computer Science, vol 4649. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74510-5_24

Download citation

DOI: https://doi.org/10.1007/978-3-540-74510-5_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74509-9
Online ISBN: 978-3-540-74510-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics