Skip to main content

A data model, knowledge base, and natural language processing for sharing a large statistical database

  • Contributed Papers
  • Chapter
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 339))

Abstract

Most existing statistical databases are mere collections of statistical files gathered for specific purposes. Consequently, as they grow in size, users are faced with difficulties in identifying and finding the data they need.

In order to obtain data descriptions independent of specific purposes, this paper proposes an object-oriented data design, which distinguishes between data conceptually obtainable and data actually stored in a database, and specifies relationships among classifications and categories independent of particular data files.

This is followed by a discussion of the representation of knowledge about data and classifications on a knowledge base, giving clear definitions of hierarchies and relationships among statistical data concepts.

Finally, a natural language query system using the knowledge base is demonstrated, which proves the advantage of the proposed statistical data concepts.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ANSI/X3/SPARC, "Study Group on Data Base Management Systems: Interim Report," FDT (Bulletin of ACM-SIGMOD), 7(2), 1975.

    Google Scholar 

  2. R.J.Brackman, "What IS-A is and isn't: An Analysis of Taxonomic Links in Semantic Networks," IEEE Computer, Oct. 1983, pp.30–36.

    Google Scholar 

  3. P.Chan and A.Shoshani, "SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases," VLDB, 1981, pp.553–563.

    Google Scholar 

  4. R.E.Cubitt, "Meta Data: An Experience of its Uses and Management," SSDBM, 1983, pp.167–169.

    Google Scholar 

  5. E.Malmborg, "On the Semantics of Aggregated Data," SSDBM, 1986, pp.152–158.

    Google Scholar 

  6. National Land Agency, Knowledge Management of Land Information, (in Japanese), Publication Bureau of the Ministry of Finance, Japan, 1986.

    Google Scholar 

  7. Z.M.Ozsoyoglu and G.Ozsoyoglu, "An Extension of Relational Algebra for Summary Tables," SSDBM, 1983, pp.202–211.

    Google Scholar 

  8. R.Reiter, "On Closed World Data Bases," in H.Gallaire and J.Minker (eds.), Logic and Data Bases, Plenum Press, 1978, pp.55–76.

    Google Scholar 

  9. H.Sato, T.Nakano, Y.Fukasawa and R.Hotaka, "Conceptual Schema for a Wide-Scope Statistical Database and Its Applications," SSDBM, 1986, pp.165–172.

    Google Scholar 

  10. H.Sato, Design and Development of Statistical Databases: An Application of Data Model and Knowledge Base, (in Japanese), Ohm Co., Japan, 1988, 246 pages.

    Google Scholar 

  11. A.Shoshani, "Statistical Databases: Characteristics, Problems and some Solutions," VLDB, 1982, pp.208–222.

    Google Scholar 

  12. J.M. Smith and D.C.P. Smith, "Database Abstractions: Aggregation and Generalization," TODS, 2(2), June 1977, pp.105–133.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Maurizio Rafanelli John C. Klensin Per Svensson

Rights and permissions

Reprints and permissions

Copyright information

© 1989 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Sato, H. (1989). A data model, knowledge base, and natural language processing for sharing a large statistical database. In: Rafanelli, M., Klensin, J.C., Svensson, P. (eds) Statistical and Scientific Database Management. SSDBM 1988. Lecture Notes in Computer Science, vol 339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0027515

Download citation

  • DOI: https://doi.org/10.1007/BFb0027515

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-50575-4

  • Online ISBN: 978-3-540-46045-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics