Skip to main content
Log in

Query reformulation for dynamic information integration

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

The standard approach to integrating heterogeneous information sources is to build a global schema that relates all of the information in the different sources, and to pose queries directly against it. The problem is that schema integration is usually difficult, and as soon as any of the information sources change or a new source is added, the process may have to be repeated.

The SIMS system uses an alternative approach. A domain model of the application domain is created, establishing a fixed vocabulary for describing data sets in the domain. Using this language, each available information source is described. Queries to SIMS against the collection of available information sources are posed using terms from the domain model, and reformulation operators are employed to dynamically select an appropriate set of information sources and to determine how to integrate the available information to satisfy a query. This approach results in a system that is more flexible than existing ones, more easily scalable, and able to respond dynamically to newly available or unexpectedly missing information sources.

This paper describes the query reformulation process in SIMS and the operators used in it. We provide precise definitions of the reformulation operators and explain the rationale behind choosing the specific ones SIMS uses. We have demonstrated the feasibility and effectiveness of this approach by applying SIMS in the domains of transportation planning and medical trauma care.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ahmed, Rafi, Philippe De Smedt, Weimin Du, William Kent, Mohammad A. Ketabchi, Witold A. Litwin, Abbas Rafii, and Ming Chien Shan. The Pegasus heterogenous multidatabase system. IEEE Computer, pages 19–27, 1991.

  • Arens, Yigal, Chin Chee, Chun-Nan Hsu, Hoh In, and Craig A. Knoblock. Query processing in an information mediator. In Proceedings of the ARPA/Rome Laboratory Knowledge-Based Planning and Scheduling Initiative, Tucson, AZ, 1994.

  • Arens, Yigal, Chin Y., Chee, Chun-Nan, Hsu, and Craig A., Knoblock. Retrieving and integrating data from multiple information sources. International Journal on Intelligent and Cooperative Information Systems, 2(2):127–158, 1993.

    Google Scholar 

  • Barrett, Anthony, Keith Golden, Scott Penberthy, and Daniel Weld. UCPOP user's manual (version 2.0). Technical Report 93-09-06, Department of Computer Science and Engineering, University of Washington, 1993.

  • Brachman, R.J., and J.G., Schmolze. An overview of the kl-one knowledge representation system. Cognitive Science, 9(2):171–216, 1985.

    Google Scholar 

  • Cheung, Waiman. The Model-assisted Global Query System. PhD thesis, Computer Science Department, Rensselaer Polytechnic Institute, Troy, NY, 1991.

  • Collet, Christine, Michael N. Huhns, and Wei-Min Shen. Resource integration using a large knowledge base in Carnot. IEEE Computer, pages 55–62, December 1991.

  • Gallier, Jean, Paliath, Narendran, David, Plaisted, Stan, Raatz, and Wayne, Syner. An algorithm for finding canonical sets of ground rewrite rules in polynomial time. Journal of the ACM, 40(1):1–16, 1993.

    Google Scholar 

  • Hsu, Chun-Nan and Craig A. Knoblock. Reformulating query plans for multidatabase systems. In Proceedings of the Second International Conference on Information and Knowledge Management, Washington, D.C., 1993. ACM.

  • Hsu, Chun-Nan and Craig A. Knoblock. Rule induction for semantic query optimization. In Proceedings of the Eleventh International Conference on Machine Learning, New Brunswick, NJ, 1994.

  • Hsu, Chun-Nan and Craig A. Knoblock. Estimating the robustness of discovered knowledge. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining, Montreal, Canada, 1995.

  • Hsu, Chun-Nan and Craig A., Knoblock. Using inductive learning to generate rules for semantic query optimization. In Gregory, Piatetsky-Shapiro and Usama, Fayyad, editors, Advances in Knowledge Discovery and Data Mining, chapter 17. MIT Press, Boston, MA, 1995.

    Google Scholar 

  • Huhns, M.N.N. Jacobs, T. Ksiezyk, W.M. Shen, M.P. Singh, and P.E. Cannata. Integrating enterprise information models in Carnot. In Proceedings of 1993 International Conference on Intelligent and Cooperative Information Systems, Rotterdam, Holland, May 1993.

  • InterSystems, Cambridge, MA. Open M/SQL Server User Guide, RDBMS E.3 edition, 1993.

  • Kim, Won and Jungyun Seo. Classifying schematic and data heterogeneity in multidatabase systems. IEEE Computer, pages 12–18, 1991.

  • Knoblock, Craig A. Generating parallel execution plans with a partial-order planner. In Proceedings of the Second International Conference on Artificial Intelligence Planning Systems, Chicago, IL, 1994.

  • Knoblock, Craig A. Planning, executing, sensing, and replanning for information gathering. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, Montreal, Canada, 1995.

  • Landers, Terry and Ronni L. Rosenberg. An overview of Multibase. In H.J. Schneider, editor, Distributed Data Bases. North-Holland, 1982.

  • Lenat, D. and R.V., Guha. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley, Reading, MA, 1990.

    Google Scholar 

  • Levy, Alon Y., Alberto O. Mendelzon, Yehoshua Sagiv, and Divesh Srivastava. Answering queries using views. In Proceedings of the 14th ACM Symposium on Principles of Database Systems, San Jose, CA, 1995.

  • Levy, Alon Y., Divesh Srivastava, and Thomas Kirk. Data model and query evaluation in global information systems. Journal of Intelligent Information Systems, 1995.

  • MacGregor, Robert. The evolving technology of classification-based knowledge representation systems. In John Sowa, editor, Principles of Semantic Networks: Explorations in the Representation of Knowledge. Morgan Kaufmann, 1990.

  • McKay, Donald P., Timothy W. Finin, and Anthony O'Hare. The intelligent database interface: Integrating AI and database systems. In Proceedings of the Eighth National Conference on Artificial Intelligence, Boston, MA, 1990.

  • Pastor, Jon A., Donald P. McKay, and Timothy W. Finin. View-concepts: Knowledge-based access to databases. In Proceedings of the First International Conference on Information and Knowledge Management, pages 84–91, Baltimore, MD, 1992.

  • Reddy, M.P., B.E., Prasad, and P.G., Reddy. Query processing in heterogeneous distributed database management systems. In Amar, Gupta, editor, Integration of Information Systems: Bridging Heterogeneous Databases, pages 264–277. IEEE Press, NY, 1989.

    Google Scholar 

  • Griffiths Selinger, P., M.M., Astrahan, D.D., Chamberlin, R.A., Lorie, and T.G., Price. Access path selection in a relational database management system. In Artificial Intelligence and Databases, pages 511–522. Morgan Kaufmann, Los Altos, CA, 1988.

    Google Scholar 

  • Sheth, Amit P., and James A., Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys, 22(3): 183–236, 1990.

    Google Scholar 

  • Siegel, Michael. A metadata approach to resolving semantic conflicts. In Proceedings of the 17th International Conference on Very Large Data Bases, pages 133–145, Barcelona, Spain, 1991.

  • Tejada, Sheila and Craig A. Knoblock. Mapping a relational query language into a knowledge representation query language. Technical report, Information Sciences Institute, University of Southern California, 1995.

  • Woelk, D., W.M. Shen, M.N. Huhns, and P. Cannata. Model driven enterprise information management in Carnot. In Enterprise Integration Modeling: Proceedings of the First International Conference, 1992.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Arens, Y., Knoblock, C.A. & Shen, WM. Query reformulation for dynamic information integration. J Intell Inf Syst 6, 99–130 (1996). https://doi.org/10.1007/BF00122124

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00122124

Keywords

Navigation