Abstract
Semi-structured data has become prevalent with the growth of the Internet and other on-line information repositories. Many organizational databases are presented on the web as semi-structured data. Designing a “good” semi-structured database is increasingly crucial to prevent data redundancy, inconsistency and updating anomalies. In this paper, we define a semi-structured schema graph and identify the various anomalies that may occur in the graph. A normal form for semi-structured schema graph, S3-NF, is proposed. We present two approaches to design S3-NF database, namely, restructuring by decomposition and the ER approach. The first approach consists of a set of rules to decompose a semi-structured schema graph into S3-NF. The second approach uses the ER model to remove anomalies at the semantic level.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Abiteboul, D. Quass, J. Widom, and J. Wiener. The lorel query language for semistructured data. International Journal on Digital Libraries, 1(1), 1997.
P.A. Bernstein. Synthesizing third normal form relations form functional dependencies. ACM Transactions on Database Systems, 4(1):277–298, 1976.
T. Bray, J. Paoli, and C. Sperberg-McQueen. Extensible markup language (xml) 1.0. W3C Recommendation available at http://www.w3.org/TR/1998, 1998.
P. Buneman, S. Davidson, M. Fernandez, and D. Suciu. Adding structure to semistructured data. In Int. Conference on Database Theory, 1997.
P. Buneman, S. Davidson, G. Hillebrand, and D. Suciu. A query language and optimization technique for unstructured data. In Proc. ACM SIGMOD, 1996.
E.F. Codd. Further normalization of the database relational model. Database Systems, edited by Randell Rustin, 1972.
M. Fernandez, D. Florescu, A. Levy, and D. Suciu. A query language for a web-site management system. SIGMOD Record, 26(3), 1997.
R. Goldman and J. Widom. Dataguides: Enabling query formulation and optimization in semistructured databases. In Proc. of the 23rd VLDB, 1997.
T.W. Ling. A normal form for entity-relationship diagrams. In Proc. of 4th Int. Conference on Entity-Relationship Approach, pages 24–35, 1985.
T.W. Ling. A normal form for sets of not-necessarily normalized relations. In Proc. of 22nd Hawaii Int. Conference on Systems Science, pages 578–586, 1989.
T.W. Ling, F.W. Tompa, and T. Kameda. An improved third normal form for relational databases. ACM Transactions on Database Systems, 2(6):329–346, 1981.
T.W. Ling and L.L. Yan. Nf-nr: A practical normal form for nested relations. Journal of Systems Integration, 4:309–340, 1994.
D. Maier. Theory of relational databases. Pitman, 1983.
A. Makinouchi. A consideration on normal form of not-necessarily normalized relation in the relational data model. In Proc. of 3rd VLDB, 1977.
J. McHugh, S. Abiteboul, R. Goldman, and J. Widom. Lore: A database management system for semistructured data. SIGMOD Record, 26(3), 1997.
S. Nestorov, J. Ullman, J. Wiener, and S. Chawathe. Objects: Concise representation of semistructured hierarchical data. In Proc. of the 13th Int. Conference on Data Engineering, 1997.
Z.M. Ozsoyoglu and L.Y. Yuan. A normal form for nested relations. ACM Transactions on Database Systems, 1(12):111–136, 1987.
Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In IEEE International Conference on Data Engineering, pages 251–260, 1995.
M.A. Roth and H.F. Korth. The design of 1nf relational databases into nested normal form. In Proc. of ACM SIGMOD, 1987.
J.D. Ullman. Principles of database systems. Computer Science Press, 1983.
K. Wang and H.Q. Liu. Schema discovery from semistructured data. In Int. Conference on Knowledge Discovery and Data Mining, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, S.Y., Lee, M.L., Ling, T.W., Kalinichenko, L.A. (1999). Designing Good Semi-structured Databases. In: Akoka, J., Bouzeghoub, M., Comyn-Wattiau, I., Métais, E. (eds) Conceptual Modeling — ER ’99. ER 1999. Lecture Notes in Computer Science, vol 1728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47866-3_9
Download citation
DOI: https://doi.org/10.1007/3-540-47866-3_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66686-8
Online ISBN: 978-3-540-47866-9
eBook Packages: Springer Book Archive