Skip to main content

Query Reformulation for Dynamic Information Integration

  • Chapter
Intelligent Integration of Information

Abstract

The standard approach to integrating heterogeneous information sources is to build a global schema that relates all of the information in the different sources, and to pose queries directly against it. The problem is that schema integration is usually difficult, and as soon as any of the information sources change or a new source is added, the process may have to be repeated.

The SIMS system uses an alternative approach. A domain model of the application domain is created, establishing a fixed vocabulary for describing data sets in the domain. Using this language, each available information source is described. Queries to SIMS against the collection of available information sources are posed using terms from the domain model, and reformulation operators are employed to dynamically select an appropriate set of information sources and to determine how to integrate the available information to satisfy a query. This approach results in a system that is more flexible than existing ones, more easily scalable, and able to respond dynamically to newly available or unexpectedly missing information sources.

This paper describes the query reformulation process in SIMS and the operators used in it. We provide precise definitions of the reformulation operators and explain the rationale behind choosing the specific ones SIMS uses. We have demonstrated the feasibility and effectiveness of this approach by applying SIMS in the domains of transportation planning and medical trauma care.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Ahmed, Rafi, Philippe De Smedt, Weimin Du, William Kent, Mohammad A. Ketabchi, Witold A. Litwin, Abbas Rafii, and Ming Chien Shan. The Pegasus heterogenous multidatabase system. IEEE Computer, pages 19–27, 1991.

    Google Scholar 

  • Arens, Yigal, Chin Chee, Chun-Nan Hsu, Hoh In, and Craig A. Knoblock. Query processing in an information mediator. In Proceedings of the ARPA/Rome Laboratory Knowledge-Based Planning and Scheduling Initiative, Tucson, AZ, 1994.

    Google Scholar 

  • Arens, Yigal, Chin Y. Chee, Chun-Nan Hsu, and Craig A. Knoblock. Retrieving and integrating data from multiple information sources. International Journal on Intelligent and Cooperative Information Systems, 2(2): 127–158, 1993.

    Article  Google Scholar 

  • Barrett, Anthony, Keith Golden, Scott Penberthy, and Daniel Weld. UCPOP user’s manual (version 2.0). Technical Report 93–09–06, Department of Computer Science and Engineering, University of Washington, 1993.

    Google Scholar 

  • Brachman,R.J., and J.G. Schmolze. An overview of the kl-one knowledge representation system. Cognitive Science, 9(2):171–216,1985.

    Article  Google Scholar 

  • Cheung, Waiman. The Model-assisted Global Query System. PhD thesis, Computer Science Department, Rensselaer Polytechnic Institute, Troy, NY, 1991.

    Google Scholar 

  • Collet, Christine, Michael N. Huhns, and Wei-Min Shen. Resource integration using a large knowledge base in Carnot. IEEE Computer, pages 55–62, December 1991.

    Google Scholar 

  • Gallier, Jean, Paliath Narendran, David Plaisted, Stan Raatz, and Wayne Synder. An algorithm for finding canonical sets of ground rewrite rules in polynomial time. Journal of the ACM, 40(1). 1–16,1993.

    Article  MATH  Google Scholar 

  • Hsu, Chun-Nan and Craig A. Knoblock. Reformulating query plans for multidatabase systems. In Proceedings of the Second International Conference on Information and Knowledge Management, Washington, D.C., 1993. ACM.

    Google Scholar 

  • Hsu, Chun-Nan and Craig A. Knoblock. Rule induction for semantic query optimization. In Proceedings of the Eleventh International Conference on Machine Learning, New Brunswick, NJ, 1994.

    Google Scholar 

  • Hsu, Chun-Nan and Craig A. Knoblock. Estimating the robustness of discovered knowledge. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining, Montreal, Canada, 1995.

    Google Scholar 

  • Hsu, Chun-Nan and Craig A. Knoblock. Using inductive learning to generate rules for semantic query optimization. In Gregory Piatetsky-Shapiro and Usama Fayyad, editors, Advances in Knowledge Discovery and Data Mining, chapter 17. MIT Press, Boston, MA, 1995.

    Google Scholar 

  • Huhns, M.N N. Jacobs, T. Ksiezyk, WM. Shen, M.P. Singh, and P.E. Cannata. Integrating enterprise information models in Carnot. In Proceedings of 1993 International Conference on Intelligent and Cooperative Information Systems, Rotterdam, Holland, May 1993.

    Google Scholar 

  • InterSystems, Cambridge, MA. Open M/SQL Server User Guide, RDBMS E.3 edition, 1993.

    Google Scholar 

  • Kim, Won and Jungyun Seo. Classifying schematic and data heterogeneity in multidatabase systems. IEEE Computer, pages 12–18,1991.

    Google Scholar 

  • Knoblock, Craig A. Generating parallel execution plans with a partial-order planner. In Proceedings of the Second International Conference on Artificial Intelligence Planning Systems, Chicago, IL, 1994.

    Google Scholar 

  • Knoblock, Craig A. Planning, executing, sensing, and replanning for information gathering. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, Montreal, Canada, 1995.

    Google Scholar 

  • Landers, Terry and Ronni L. Rosenberg. An overview of Multibase. In H.J. Schneider, editor, Distributed Data Bases. North-Holland, 1982.

    Google Scholar 

  • Lenat, D. and R.V. Guha. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley, Reading, MA, 1990.

    Google Scholar 

  • Levy, Alon Y, Alberto O. Mendelzon, Yehoshua Sagiv, and Divesh Srivastava. Answering queries using views. In Proceedings of the 14th ACM Symposium on Principles of Database Systems, San Jose, CA, 1995.

    Google Scholar 

  • Levy, Alon Y, Divesh Srivastava, and Thomas Kirk. Data model and query evaluation in global information systems. Journal of Intelligent Information Systems, 1995.

    Google Scholar 

  • MacGregor, Robert. The evolving technology of classification-based knowledge representation systems. In John Sowa, editor, Principles of Semantic Networks: Explorations in the Representation of Knowledge. Morgan Kaufmann, 1990.

    Google Scholar 

  • McKay, Donald P., Timothy W. Finin, and Anthony O’Hare. The intelligent database interface: Integrating AI and database systems. In Proceedings of the Eighth National Conference on Artificial Intelligence, Boston, MA, 1990.

    Google Scholar 

  • Pastor, Jon A., Donald P. McKay, and Timothy W. Finin. View-concepts: Knowledge-based access to databases. In Proceedings of the First International Conference on Information and Knowledge Management, pages 84–91, Baltimore, MD, 1992.

    Google Scholar 

  • Reddy, M.P.,B.E. Prasad, and PG. Reddy. Query processing in heterogeneous distributed database management systems. In Amar Gupta, editor, Integration of Information Systems: Bridging Heterogeneous Databases, pages 264–277. IEEE Press, NY, 1989.

    Google Scholar 

  • Griffiths Selinger, P., M.M. Astrahan, D.D. Chamberlin, R.A. Lorie, and T.G. Price. Access path selection in a relational database management system. In Artificial Intelligence and Databases, pages 511–522. Morgan Kaufmann, Los Altos, CA, 1988.

    Google Scholar 

  • Sheth, Amit P., and James A. Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys, 22(3): 183–236,1990.

    Article  Google Scholar 

  • Siegel, Michael. A metadata approach to resolving semantic conflicts. In Proceedings of the 17th International Conference on Very Large Data Bases, pages 133–145, Barcelona, Spain, 1991.

    Google Scholar 

  • Tejada, Sheila and Craig A. Knoblock. Mapping a relational query language into a knowledge representation query language. Technical report, Information Sciences Institute, University of Southern California, 1995.

    Google Scholar 

  • Woelk, D., W.M. Shen, M.N. Huhns, and P. Cannata. Model driven enterprise information management in Carnot. In Enterprise Integration Modeling: Proceedings of the First International Conference, 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Kluwer Academic Publishers, Boston

About this chapter

Cite this chapter

Arens, Y., Knoblock, C.A., Shen, WM. (1996). Query Reformulation for Dynamic Information Integration. In: Wiederhold, G. (eds) Intelligent Integration of Information. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-0467-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4613-0467-8_2

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-8062-7

  • Online ISBN: 978-1-4613-0467-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics