Skip to main content

An Expressive Functional Data Model and Query Language for Bioinformatics Data Integration

  • Chapter
The Functional Approach to Data Management

Summary

We argue the need for a federated information infrastructure that is perceived by users to be functionally integrated, yet maintaining its autonomy. There are various ways to do this, but the crucial issue is the provision of a common semantics and a shared data model. For this purpose we have adapted the Functional Data Model1 and the Daplex language (originally designed for heterogeneous multi-database work) and developed a graphic user interface for it. We show how the model’s ability to compose functions makes it able to combine computed data with stored data. This makes it especially suitable for bioinformatics queries involving derived data and we give examples from antibodies. We also describe a view mechanism for creating derived relationships as functions which adapt objects to a given application program, in order to simplify a shared data model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. P. Brown and M.L. Stonebraker. BigSur: A System for the Management of Earth Science Data. In Proc. 21st International Conference on Very Large Data Bases VLDB’95, pages 720–728, 1995.

    Google Scholar 

  2. R.G.G. Cattell, editor. The Object Database Standard: ODMG 2.0. Morgan Kaufmann, 1997.

    MATH  Google Scholar 

  3. D.N. Chorafas and H. Steinmann. Solutions for Networked Databases. Academic Press, 1993.

    Google Scholar 

  4. C. Chothia and A.M. Lesk. Canonical structures for the hypervariable regions of immunoglobulins. J.Mol.Biol, 196:901–917, 1987.

    Article  Google Scholar 

  5. C. Chothia, A.M. Lesk, A. Tramontano, et al. Conformations of immunoglobulin hypervariable regions. Nature, 342:877–883, 1989.

    Article  Google Scholar 

  6. S.M. Embury and P.M.D. Gray. The Declarative Expression of Semantic Integrity in a Database of Protein Structure. In A. Illaramendi and O. Diaz, editors, Data Management Systems: Proceedings of the Basque International Workshop on Information Technology (BIWIT 95), pages 216–224, San Sebastian, Spain, July 1995. IEEE Computer Society Press.

    Google Scholar 

  7. E. Gamma, R. Helm, R. Johnson, and J. Vlissides. Design Patterns. Addison-Wesley, 1994.

    Google Scholar 

  8. I. Gil, P.M.D. Gray, and G.J.L Kemp. A Visual Interface and Navigator for the P/FDM Object Database. In N.W. Paton and T Griffiths, editors, Proceedings of User Interfaces to Data Intensive Systems (UIDIS,99), pages 54–63. IEEE Computer Society Press, 1999.

    Google Scholar 

  9. P.M.D. Gray, S.M. Embury, K.Y. Hui, and G.J.L. Kemp. The Evolving Role of Constraints in the Functional Data Model. Journal of Intelligent Information Systems, 12:113–137, 1999.

    Article  Google Scholar 

  10. P.M.D. Gray, G.J.L. Kemp, P. Brunschwig, and S.M.Embury. Collection Views: Dynamically Composed Views which Inherit Behaviour. In B. Lings and K. Jeffery, editors, Advances in Databases, Proc. BNCOD11, pages 102–121. (LNCS 1832). Springer-Verlag, 2000.

    Chapter  Google Scholar 

  11. P.M.D. Gray, G.J.L. Kemp, C.J. Rawlings, N.P. Brown, C. Sander, J.M. Thornton, C.M. Orengo, S.J. Wodak, and J. Richelle. Macromolecular Structure Information and Databases. Trends in Biochemical Sciences, 21:251–256, 1996.

    Article  Google Scholar 

  12. P.M.D. Gray, K.G. Kulkarni, and N.W. Paton. Object-Oriented Databases: a Semantic Data Model Approach. Prentice Hall Series in Computer Science. Prentice Hall International, 1992.

    MATH  Google Scholar 

  13. E.A. Kabat, T.T. Wu, H.M. Perry, K.S. Gottesman, and C. Foeller. Sequences of proteins of immunological interest, 5th edition Public Health Service, NIH, Washington D.C., 1992.

    Google Scholar 

  14. G.J.L. Kemp. Protein Modelling: a Design Application of an Object-Oriented Database. In J. Gero, editor, Proc. 1st International Conference on Artificial Intelligence in Design, pages 387–406. Butterworth-Heinemann, 1991.

    Google Scholar 

  15. G.J.L. Kemp, J. Dupont, and P.M.D. Gray. Using the Functional Data Model to Integrate Distributed Biological Data Sources. In P. Svensson and J.C. French, editors, Proc. Eighth International Conference on Scientific and Statistical Database Management, pages 176–185. IEEE Computer Society Press, 1996.

    Chapter  Google Scholar 

  16. G.J.L. Kemp, J.J. Iriarte, and P.M.D. Gray. Efficient Access to FDM Objects Stored in a Relational Database. In D.S. Bowers, editor, Directions in Databases: Proc. Twelfth British National Conference on Databases (BNCOD 12), pages 170–186. Springer-Verlag, 1994.

    Google Scholar 

  17. G.J.L. Kemp, Z. Jiao, P.M.D. Gray, and J.E. Fothergill. Combining Computation with Database Access in Biomolecular Computing. In W. Litwin and T. Risch, editors, Applications of Databases: Proceedings of the First International Conference, pages 317–335. Springer-Verlag, 1994.

    Google Scholar 

  18. G.J.L. Kemp, C.J. Robertson, P.M.D. Gray, and N. Angelopoulos. CORBA and XML: Design Choices for Database Federations. In B. Lings and K. Jeffery, editors, Proceedings of the Seventeenth British National Conference on Databases, pages 191–208. Springer-Verlag, 2000.

    Google Scholar 

  19. T. Landers and R. L. Rosenberg. An Overview of MULTIBASE. In H.-J. Schneider, editor, Distributed Data Bases. North-Holland, 1982.

    Google Scholar 

  20. D.W. Ritchie and G.J.L. Kemp. Modeling Antibody Side Chain Conformations Using Heuristic Database Search. In T. Gaasterland, P. Karp, K. Karplus, C. Ouzounis, C. Sander, and A. Valencia, editors, Proceedings

    Google Scholar 

  21. R.J. Robbins. Biolnformatics: Essential Infrastructure for Global Biology. J. Comp. Biol., 3:465–478, 1996.

    Article  Google Scholar 

  22. J. Rumbaugh, M. Blaha, and W. Premerlani. Object-oriented Modelling and Design. Prentice Hall, 1991.

    Google Scholar 

  23. A.P. Sheth and J.A. Larson. Federated Database Systems for Managing Distributed, Heterogeneous and Autonomous databases. ACM Computing Surveys, 22:183–236, 1990.

    Article  Google Scholar 

  24. D.W. Shipman. The Functional Data Model and the Data Language DAPLEX. ACM Transactions on Database Systems, 6(1):140–173, 1981.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Gray, P.M.D., Kemp, G.J.L. (2004). An Expressive Functional Data Model and Query Language for Bioinformatics Data Integration. In: Gray, P.M.D., Kerschberg, L., King, P.J.H., Poulovassilis, A. (eds) The Functional Approach to Data Management. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-05372-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-05372-0_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-05575-1

  • Online ISBN: 978-3-662-05372-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics