Abstract
In consequence of the change in the nature of data management systems the requirements for search support have shifted. In the early days of data management systems, efficient access techniques and optimization strategies for exact match queries had been the main focus. Most of the problems in this field are satisfactorily solved today and new types of applications for data management systems have turned the focus of current research to content-based similarity queries and queries on distributed databases. The present contribution addresses these two aspects. In the first part, algorithms and data structures supporting similarity queries are presented together with considerations about their integration in data management systems, whereas search techniques for distributed data management systems and especially for peer-to-peer networks are discussed in the second part. Here, techniques for exact match queries and for similarity queries are addressed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM press. Pearson Education Limited, Harlow, England (1999)
Balakrishnan, H., Kasshoek, M.F., Karger, D., Morris, R., Stoica, I.: Looking Up Data in P2P Systems. Commun. ACM 46(2), 43–48 (2003)
Balko, S., Schmitt, I.: Efficient Nearest Neighbor Retrieval by Using a Local Approximation Technique - the Active Vertice Approach. Technical Report 2, Fakultät für Informatik, Universität Magdeburg (2002)
Buckley, C., Lewit, A.: Optimization of inverted vector searches. In: Proc. 8th International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, USA, pp. 97–105 (1985)
Carey, M.J., Kossmann, D.: On saying Enough already.SQL. In Proc. 1997 ACM SIGMOD Intl. Conf. on Management of Data, Tucson, Arizona, June 13–15, pp. 219-230 (1997)
Chaudhuri, S., Gravano, L.: Evaluating top-k selection queries. In: Proc. 25th Intl. Conf. on Very Large Data Bases, Edinburgh, Scotland, UK, September 1999, pp. 397–410 (1999)
Clarke, I., Sandberg, O., Wiley, B., Hong, T.W.: Freenet: A distributed anonymous information storage and retrieval system. In: Federrath, H. (ed.) Designing Privacy Enhancing Technologies. LNCS, vol. 2009, p. 46. Springer, Heidelberg (2001)
Cuenca-Acuna, F. M., Nguyen, T. D.: Text-Based Content Search and Retrieval in ad hoc P2P Communities. Technical Report DCS-TR-483, Department of Computer Science, Rutgers University (2002)
Fagin, R., Lotem, A., Naor, M.: Optimal aggregation algorithms for middleware. In: Proc. 10th ACM Symposium on Principles of Database Systems: PODS, New York, USA, pp. 102–113 (2001)
Fagin, R., Wimmers, E.L.: A formula for incorporating weights into scoring rules. Theoretical Computer Science 239(2), 309–338 (2000)
Güntzer, U., Balke, W.-T., Kießling, W.: Optimizing multi-feature queries for image databases. In: VLDB 2000, Proc. 26th Intl. Conf. on Very Large Data Bases, Cairo, Egypt, pp. 419–428 (2000)
Härder, T., Rahm, E.: Datenbanksysteme: Konzepte und Techniken der Implementierung, 2nd edn. Springer, Heidelberg (2001)
Henrich, A.: A distance scan algorithm for spatial access structures. In: Proc. of the 2nd ACM Workshop on Advances in Geographic Information Systems, Gaithersburg, Maryland, USA, pp. 136–143. ACM Press, New York (1994)
Henrich, A.: The LSD6h-tree: An access structure for feature vectors. In: Proc. 14th Intl. Conf. on Data Engineering, Orlando, Florida, USA, February 23-27, pp. 362–369. IEEE Computer Society, Los Alamitos (1998)
Henrich, A., Robbert, G.: Combining multimedia retrieval and text retrieval to search structured documents in digital libraries. In: Proc. 1st DELOS Workshop on Information Seeking, Searching and Querying in Digital Libraries, Zürich, Switzerland, December 2000, pp. 35–40 (2000)
Henrich, A., Robbert, G.: Ein Ansatz zur Übertragung von Rangordnungen bei der Suche auf strukturierten Daten. In: Tagungsband der 10. Konferenz Datenbanksysteme für Business, Technologie und Web (BTW 2003),, Leipzig, Deutschland,GI, February 2003, vol. 26, LNI pp. 167–186 (2003)
Henrich, A., Robbert, G.: Comparison and evaluation of fusion algorithms and transfer semantics for structured multimedia data. In: Danilowicz, C. (ed.) 1st International Workshop on Multimedia Information Systems Technology, Multimedia and Network Information Systems, Szklarska Poreba, Poland, September 2004, pp. 181–192 (2004) Oficyna Wydawinicza Politechniki Wroclawskiej
Herstel, T., Schmitt, I.: Optimierung von Ausdrücken der Ähnlichkeitsalgebra SA. In: INFORMATIK 2004 - Informatik verbindet - Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V (GI), Band 2, Ulm, Germany, September 2004, vol. P-51 LNI, pp. 49–53 (2004)
Hjaltason, G.R., Samet, H.: Ranking in spatial databases. In: Egenhofer, M.J., Herring, J.R. (eds.) SSD 1995. LNCS, vol. 951, pp. 83–95. Springer, Heidelberg (1995)
Joseph, S.: Adaptive routing in distributed decentralized systems: Neurogrid, Gnutella and Freenet. In: Proc. of workshop on Infrastructure for Agents, MAS, and Scalable MAS, at Autonomous Agents, Montreal, Canada (2001)
Kronfol, A. Z.: A Fault-tolerant, Adaptive, Scalable, Distributed Search Engine (May 2002) Final Thesis, Princeton, http://www.searchlore.org/library/kronfol_final_thesis.pdf
Lomet, D.B., Salzberg, B.: The hb-tree: A multiattribute indexing method with good guaranteed performance. ACM Trans. Database Syst. 15(4), 625–658 (1990)
Müller, H., Squire, D. M., Müller, W., Pun, T.: Efficient access methods for content-based image retrieval with inverted files. Technical Report 99.02, Computer Vision Group, University of Geneva (July 1999)
Müller, W., Eisenhardt, M., Henrich, A.: Scalable summary-based search in P2P networks (2004) (submitted for publication)
Müller, W., Henrich, A.: Faster exact histogram intersection on large data collections using inverted VA-files. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 455–463. Springer, Heidelberg (2004)
Müller, W., Henrich, A.: Reducing I/O cost of similarity queries by processing several at a time. In: Proc. MDDE 2004, 4th International Workshop on Multimedia Data and Document Engineering, Washington DC, USA, July 2004, IEEE Computer Society, Los Alamitos (2004)
Natsev, A., Chang, Y.-C., Smith, J.R., Li, C.-S., Vitter, J.S.: Supporting incremental join queries on ranked inputs. In: VLDB 2001, Proc. of 27th Intl. Conf. on Very Large Data Bases, Roma, Italy, pp. 281–290 9 (2001)
Ng, C.H., Sia, K.C.: Peer clustering and firework query model. In: Poster Proc. of The 11th Interational World Wide Web Conf. Honululu, HI, USA (May 2002)
Pfeifer, U., Pennekamp, S.: Incremental Processing of Vague Queries in Interactive Retrieval Systems. In: Hypertext - Information Retrieval - Multimedia 1997: Theorien, Modelle und Implementierungen integrierter elektronischer Informationssysteme, Dortmund, pp. 223–235. Universitätsverlag Konstanz (1997)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Schenker, S.: A scalable content-addressable network. In: Proc. 2001 Conf. on applications, technologies, architectures, and protocols for computer communications, San Diego, CA (2001)
Roussopoulos, N., Kelley, S., Vincent, F.: Nearest neighbor queries. In: Proc. 1995 ACM SIGMOD Intl. Conf. on Management of Data, San Jose, California, May 22-25, pp. 71–79 (1995)
Sakurai, Y., Yoshikawa, M., Uemura, S., Kojima, H.: The A-tree: An index structure for high-dimensional spaces using relative approximation. In: Proc. of the 26th Intl. Conf. on Very Large Data Bases, Cairo, pp. 516–526 (2000)
Schmitt, I., Schulz, N.: Similarity relational calculus and its reduction to a similarity algebra. In: Seipel, D., Turull-Torres, J.M.a. (eds.) FoIKS 2004. LNCS, vol. 2942, pp. 252–272. Springer, Heidelberg (2004)
Smith, I., Chang, S.-F.: VisualSEEK: A fully automated content-based image query system. In: Proc. of the 4th ACM Multimedia Conf., New York, USA, November 1996, pp. 87–98 (1996)
Squire, D.M., Müller, W., Müller, H., Raki, J.: Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback. In: 11th Scandinavian Conf. on Image Analysis, Kangerlussuaq, Greenland (1999)
Stoica, I., Morris, R., Karger, D., Kaashoek, F., Balakrishnan, H.: Chord: A scalable Peer-To-Peer lookup service for Internet applications. In: Proc. ACM SIGCOMM Conf. San Diego, CA, USA (2001)
Tang, C., Xu, Z., Mahalingam, M.: pSearch: Information retrieval in structured overlays. In: First Workshop on Hot Topics in Networks (HotNets-I), Princeton, NJ (2002)
Weber, R., Schek, H.-J., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proc. Intl. Conf. on VLDB, New York, USA (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Henrich, A. (2005). Search Support in Data Management Systems. In: Härder, T., Lehner, W. (eds) Data Management in a Connected World. Lecture Notes in Computer Science, vol 3551. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11499923_8
Download citation
DOI: https://doi.org/10.1007/11499923_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26295-4
Online ISBN: 978-3-540-31654-1
eBook Packages: Computer ScienceComputer Science (R0)