Advertisement

Containment of Relational Queries with Annotation Propagation

  • Wang-Chiew Tan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2921)

Abstract

We study the problem of determining whether a query is contained in another when queries can carry along annotations from source data. We say that a query is annotation-contained in another if the annotated output of the former is contained in the latter on every possible annotated input databases. We study the relationship between query containment and annotation-containment and show that annotation-containment is a more refined notion in general. As a consequence, the usual equivalences used by a typical query optimizer may no longer hold when queries can carry along annotations from the source to the output. Despite this, we show that the same annotated result is obtained whether intermediate constructs of a query are evaluated with set or bag semantics. We also give a necessary and sufficient condition, via homomorphisms, that checks whether a query is annotation-contained in another. Even though our characterization suggests that annotation-containment is more complex than query containment, we show that the annotation-containment problem is NP-complete, thus putting it in the same complexity class as query containment. In addition, we show that the annotation placement problem, which was first shown to be NP-hard in [7], is in fact DP-hard and the exact complexity of this problem still remains open.

Keywords

Annotation Propagation Output Location Conjunctive Query Query Engine Relational Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison Wesley Publishing Co., Reading (1995)zbMATHGoogle Scholar
  2. 2.
    Bairoch, A., Apweiler, R.: The SWISS-PROT protein sequence database and its supplement TrEMBL. Nucleic Acids Research 28, 45–48 (2000)CrossRefGoogle Scholar
  3. 3.
    Benson, D., Karsch-Mizrachi, I., Lipman, D., Ostell, J., Rapp, B., Wheeler, D.: GenBank. Nucleic Acids Research 28(1), 15–18 (2000)CrossRefGoogle Scholar
  4. 4.
    Bernstein, P., Bergstraesser, T.: Meta-Data Support for Data Transformations Using Microsoft Repository. IEEE Data Engineering Bulletin 22(1), 9–14 (1999)Google Scholar
  5. 5.
    biodas.org, http://biodas.org
  6. 6.
    Buneman, P., Khanna, S., Tan, W.: Why and Where: A Characterization of Data Provenance. In: Proceedings of the International Conference on Database Theory (ICDT), London, United Kingdom, pp. 316–330 (2001)Google Scholar
  7. 7.
    Buneman, P., Khanna, S., Tan, W.: On Propagation of Deletions and Annotations Through Views. In: Proceedings of the ACM Symposium on Principles of Database Systems (PODS), Wisconsin, Madison, pp. 150–158 (2002)Google Scholar
  8. 8.
    Chandra, K., Merlin, P.M.: Optimal implementation of conjunctive queries in relational data bases. In: Proceedings of the ninth annual ACM symposium on Theory of computing, Boulder, Colorado, pp. 77–90 (1977)Google Scholar
  9. 9.
    Chaudhuri, S., Vardi, M.Y.: Optimization of real conjunctive queries. In: Proceedings of the ACM Symposium on Principles of Database Systems (PODS), Washington, DC, pp. 59–70 (1993)Google Scholar
  10. 10.
    Chekuri, C., Rajaraman, A.: Conjunctive Query Containment Revisited. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 56–70. Springer, Heidelberg (1996)Google Scholar
  11. 11.
    Cosmadakis, S.S.: The Complexity of Evaluating Relational Queries. Information and Control 58(1-3), 101–112 (1983)zbMATHCrossRefMathSciNetGoogle Scholar
  12. 12.
    Davidson, S.B., Crabtree, J., Bunk, B.P., Schug, J., Tannen, V., Overton, G.C., Stoeckert Jr., C.J.: K2/Kleisli and GUS: Experiments in integrated access to genomic data sources - Author bios. IBM Systems Journal 40(2), 512–531 (2001)CrossRefGoogle Scholar
  13. 13.
    Denning, D.E., Lunt, T.F., Schell, R.R., Shockley, W.R., Heckman, M.: The SeaView Security Model. In: IEEE Symposium on Security and Privacy, Washington, DC, pp. 218–233 (1988)Google Scholar
  14. 14.
    Dowell, R.: A Distributed Annotation System. Technical report, Department of Computer Science, Washington University in St. Louis (2001)Google Scholar
  15. 15.
    Fernández, M., Kadiyska, Y., Morishima, A., Suciu, D., Tan, W.: SilkRoute: A Framework for Publishing Relational Data in XML. ACM Transactions on Database Systems (TODS) 27(4), 438–493 (2002)CrossRefGoogle Scholar
  16. 16.
    Hell, P., Nešetřil, J.: The Core of a Graph. Discrete Mathematics 109, 117–126 (1992)zbMATHCrossRefMathSciNetGoogle Scholar
  17. 17.
    Jajodia, S., Sandhu, R.S.: Polyinstantiation integrity in multilevel relations. In: IEEE Symposium on Security and Privacy, Oakland, California, pp. 104–115 (1990)Google Scholar
  18. 18.
    Kahan, J., Koivunen, M., Prud’Hommeaux, E., Swick, R.: Annotea: An open rdf infrastructure for shared web annotations. In: Proceedings of the International World Wide Web Conference(WWW10), Hong Kong, China, pp. 623–632 (2001)Google Scholar
  19. 19.
    Kent, W.J., Sugnet, C.W., Furey, T.S., Roskin, K.M., Pringle, T.H., Zahler, A.M., Haussler, D.: The Human Genome Browser at UCSC. Genome Research 12(5), 996–1006 (2002)Google Scholar
  20. 20.
    LaLiberte, D., Braverman, A.: A Protocol for Scalable Group and Public Annotations. In: Proceedings of the International World Wide Web Conference(WWW3), Darmstadt, Germany (1995)Google Scholar
  21. 21.
    Lee, T., Bressan, S., Madnick, S.: Source Attribution for Querying Against Semistructured Documents. In: Workshop on Web Information and Data Management (WIDM), Washington, DC (1998)Google Scholar
  22. 22.
    Levy, A.Y., Mendelzon, A.O., Sagiv, Y., Srivastava, D.: Answering queries using views. In: Proceedings of the ACM Symposium on Principles of Database Systems (PODS), San Jose, California, pp. 95–104 (1995)Google Scholar
  23. 23.
    Maier, D., Delcambre, L.: Superimposed Information for the Internet. In: Proceedings of the International Workshop on the Web and Databases (WebDB), Philadelphia, Pennsylvania, pp. 1–9 (1999)Google Scholar
  24. 24.
    Myers, C., Liskov, B.: A decentralized model for information control. In: Proceedings of the ACM Symposium on Operating Systems Principles (SOSP), Saint-Malo, France, pp. 129–142 (1997)Google Scholar
  25. 25.
    Papadimitriou, C.H.: Computational Complexity. Addison-Wesley, Reading (1994)zbMATHGoogle Scholar
  26. 26.
    Phelps, T.A., Wilensky, R.: Multivalent Annotaions. In: Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, Pisa, Italy, pp. 287–303 (1997)Google Scholar
  27. 27.
    Phelps, T.A., Wilensky, R.: Multivalent documents. In: Proceedings of the Communications of the Association for Computing Machinery (CACM), vol. 43(6), pp. 82–90 (2000)Google Scholar
  28. 28.
    Phelps, T.A., Wilensky, R.: Robust intra-document locations. In: Proceedings of the International World Wide Web Conference(WWW9), Amsterdam, Netherlands, pp. 105–118 (2000)Google Scholar
  29. 29.
    Schickler, M.A., Mazer, M.S., Brooks, C.: Pan-Browser Support for Annotations and Other Meta-Information on the World Wide Web. In: Proceedings of the International World Wide Web Conference(WWW5), Paris, France (1996)Google Scholar
  30. 30.
    Shanmugasundaram, J., Shekita, E., Barr, R., Carey, M., Lindsay, B., Pirahesh, H., Reinwald, B.: Efficiently Publishing Relational Data as XML Documents. VLDB Journal 10(2-3), 133–154 (2001)zbMATHGoogle Scholar
  31. 31.
    Tan, W.: Containment of Relational Queries with Annotation Propagation. Technical report, Dept. of Computer Science, University of California, Santa Cruz (2003)Google Scholar
  32. 32.
    W3C. Annotea Project, http://www.w3.org/2001/Annotea
  33. 33.
    Wang, Y.R., Madnick, S.E.: A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective. In: Proceedings of the International Conference on Very Large Data Bases (VLDB), Brisbane, Queensland, Australia, pp. 519–538 (1990)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Wang-Chiew Tan
    • 1
  1. 1.University of CaliforniaSanta Cruz

Personalised recommendations