Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Semi-structured Database Design

  • Gillian DobbieEmail author
  • Tok Wang Ling
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_338


XML database design


From a requirements document, a database designer distills the real world constraints and designs a database schema. While the design process for structured data is well defined, the design process for semi-structured data is not as well understood. What is a “good” design for semi-structured databases that captures real world constraints, prevents data redundancy and update anomalies, and allows typical queries to execute quickly?

Historical Background

There was a lot of research into the design of relational databases in the 1970s, and it was found that the design of relational databases involves a trade off between the speed of execution of queries and the updating anomalies caused by maintaining redundant data when updates occur. During logical schema design normalization algorithms are used to reduce redundancy, and during physical design to improve performance some redundancy may be reintroduced, views can be created over the schema, and...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Arenas M, Libkin L. A normal form for XML documents. ACM Trans Database Syst. 2004;29(1):195–232.CrossRefGoogle Scholar
  2. 2.
    Brandin C. Information modeling with XML. In A. Chaudhri, A. Raschid, and R. Zicari (ed.). XML data management. Addison-Wesley: Boston, 2003,p. 3–17.Google Scholar
  3. 3.
    Bray T, Paoli J, Sperberg-McQueen CM. Extensible markup language (XML) 1.0. 2nd edn. 2000.Google Scholar
  4. 4.
    Chen PP. The entity-relationship model – toward a unified view of data. ACM Trans Database Syst. 1976;1(1):9–36.CrossRefGoogle Scholar
  5. 5.
    Ling TW, Goh CH, Lee M-L. Extending classical functional dependencies for physical database design. Inf Soft Tech. 1996;38(9):601–8.CrossRefGoogle Scholar
  6. 6.
    Ling TW, Lee ML, Dobbie G. Semistructured database design. New York: Springer; 2005.zbMATHGoogle Scholar
  7. 7.
    Nestorov S, Abiteboul S, Motwani R. Extracting schema from semistructured data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1995. p. 295–306.Google Scholar
  8. 8.
    Thompson HS, Beech D, Maloney M, Mendelson N, editors. XML schema part 1: structures. 2001. http://www.w3.org/TR/xmlschema-1.
  9. 9.
    Wang QY, Yu JX, Wong KF. Approximate graph schema extraction for semi-structured data. In: Advances in Database Technology, Proceedings of the 7th International Conference on Extending Database Technology; 2000. p. 302–17.CrossRefGoogle Scholar
  10. 10.
    Yu C, Jagadish HV. Efficient discovery of XML data redundancies. In: Proceedings of the 32nd International Conference on Very Large Data Bases; 2006. p. 103–14.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.University of AucklandAucklandNew Zealand
  2. 2.National University of SingaporeSingaporeSingapore

Section editors and affiliations

  • Frank Tompa
    • 1
  1. 1.David R. Cheriton School of Computer ScienceUniv. of WaterlooWaterlooCanada