Abstract
There are three common design decisions taken by today’s search engines. First, they do not replicate the data found on the Web. Second, they rely on full-text indexes instead. Third, they do not support the querying of document structure. The main reason for the latter is that HTML’s ability to express semantics with syntactic structure is very limited. This is different for XML since it allows for self-describing data. Due to its flexibility by inventing arbitrary new element and attribute names, XML allows to encode semantics within syntax. The consequence is that search engines for XML should support the querying of structure. In our current work on search engines for XML data on the Web, we want to keep the first two design decisions of traditional search engines but modify the last one according to the new requirements implied by the necessity to query structure. Since our search engine accepts queries with structural information, a full-text index does not suffice any longer. What is needed is a scalable index structure that allows to answer queries over the structure of XML documents. One possible index structure called eXtended Access Support Relation (XASR) is introduced. Further, we report on a search engine for XML data called Mumpits. Due to its prototypical character, we intentionally kept the design and implementation of Mumpits very simple. Its design is centered around a single XASR and its implementation heavily builds on a commercial relational database management system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
The BIOpolymer Markup Language (BIOML) Home Page.http://www.bioml.com/BIOML/index.html.
S. Abiteboul, P. Buneman, and D. Suciu. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufman, 1999.
E. Bertino and W. Kim. Indexing techniques for queries on nested objects. IEEE Trans. on Knowledge and Data Engineering, 1(2):196–214, Jun 1989.
T. Bray, J. Paoli, and C. M. Sperberg-McQueen. Extensible markup language (xml) 1.0. Technical report, World Wide Web Consortium, 1998. W3C Recommendation 10-Feb-98.
J. Chen, D. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query system for internet databases. In Proc. of the ACM SIGMOD Conf. on Management of Data, 2000. to appear.
J. Clark and S. DeRose. XML path language (XPath) version 1.0. Technical report, World Wide Web Consortium, 1999. W3C Recommendation 16 Nov. 1999.
T. Cormen, C. Leiserson, and R. Rivest. Introduction to Algorithms. MIT Press, 1989.
A. Deutsch, M. Fernandez, and D. Suciu. Storing semistructured data with STORED. In Proc. of the ACM SIGMOD Conf. on Management of Data, 1999.
D. Florescu and D. Kossmann. Storing and querying XML data using and RDBMS. IEEE Data Engineering Bulletin, 22(3):27–34, 1999.
R. Goldman, J. McHugh, and J. Widom. From semistructured data to XML: Migrating the Lore data model and query language. In ACM SIGMOD Workshop on the Web and Databases (WebDB), 1999.
A. Kemper and G. Moerkotte. Access support in object bases. In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 364–374, 1990.
A. Kemper and G. Moerkotte. Advanced query processing in object bases using access support relations. In Proc. Int. Conf. on Very Large Data Bases (VLDB), pages 294–305, 1990.
D. Maier and J. Stein. Indexing in an object-oriented DBMS. In Proc. IEEE Intl. Workshop on Object-Oriented Database Systems, Asilomar, Pacific Grove, CA, pages 171–182. IEEE Computer Society Press, 1986.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A database management system for semistructured data. ACM SIGMOD Record, 26(3):54–66, 1997.
J. Shanmugasundaram, H. Gang, K. Tufte, C. Yhang, D. J. DeWitt, and J. Naughton. Relational databases for querying xml documents: Limitations and opportunities. In Proc. Int. Conf. on Very Large Data Bases (VLDB), pages 302–314, 1999.
B. Shidlowsky and E. Bertino. A graph-theoretic approach to indexing in object-oriented databases. In Proc. IEEE Conference on Data Engineering, pages 230–237, 1996.
R. Van Zwol, P. Apers, and A. Wilschut. Modeling and querying semistructured data with MOA. In ICDT’99 Workshop on Query Processing for semistructured data, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fiebig, T., Moerkotte, G. (2001). Evaluating Queries on Structure with eXtended Access Support Relations. In: Goos, G., Hartmanis, J., van Leeuwen, J., Suciu, D., Vossen, G. (eds) The World Wide Web and Databases. WebDB 2000. Lecture Notes in Computer Science, vol 1997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45271-0_8
Download citation
DOI: https://doi.org/10.1007/3-540-45271-0_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41826-9
Online ISBN: 978-3-540-45271-3
eBook Packages: Springer Book Archive