Abstract
This paper proposes an approach for representing and querying semistructured Web data, which is based on nested tables allowing internal nested structural variations. Our motivation is to reduce the complexity found in typical query languages for semistructured data and to provide users with an alternative for quickly querying data obtained from multiple-record Web pages. We show the feasibility of our proposal by developing a prototype for a graphical query interface called QSByE (Querying Semistructured data By Example), which implements a set of QBE-like operations that extends typical nested-relational-algebra operations to handle semistructured data.
This work was partially supported by Project SIAM (MCT/CNPq/PRONEX grant number 76.97.1016.00) and by CNPq (grant number 467775/00-1). The first and second authors are supported by scholarships from CAPES. The fourth author is supported by NSF (grant number IIS-0083127).
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Abiteboul, S., Quass, D., McHugh, J., Widom, J., and Wiener, J. The Lorel Query Language for Semistructured Data. International Journal on Digital Libraries 1, 1 (1997), 68–88.
Colby, L. S. A Recursive Algebra and Query Optimization for Nested Relations. In Proceedings of the 1989 ACM SIGMOD International Conference on Management of Data (Portland, Oregon, 1989), pp. 273–283.
Evangelista-Filha, I. M. R., Laender, A. H. F., and Silva, A. S. Querying Semistructured Data By Example: The QSByE Interface. In Proceedings of the International Workshop on Information Integration on the Web (Rio de Janeiro, Brazil, 2001), pp. 156–163.
Laender, A. H. F., Ribeiro-Neto, B., and da Silva., A. S. DEByE — Data Extraction by Bxample. Data and Knowledge Engineering 40, 2 (2002), 121–154.
Thomas, S. J., and Fischer, P. C. Nested Relational Structures. Advances in Computing Research 3 (1986), 269–307.
Zloof, M. M. Query-by-Example: A Data Base Language. IBM Systems Journal 16, 4 (1977), 324–343.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Filha, I.M.R.E., da Silva, A.S., Laender, A.H.F., Embley, D.W. (2002). Using Nested Tables for Representing and Querying Semistructured Web Data. In: Pidduck, A.B., Ozsu, M.T., Mylopoulos, J., Woo, C.C. (eds) Advanced Information Systems Engineering. CAiSE 2002. Lecture Notes in Computer Science, vol 2348. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47961-9_53
Download citation
DOI: https://doi.org/10.1007/3-540-47961-9_53
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43738-3
Online ISBN: 978-3-540-47961-1
eBook Packages: Springer Book Archive