ABSTRACT
A fundamental task in data integration and data exchange is the design of schema mappings, that is, high-level declarative specifications of the relationship between two database schemas. Several research prototypes and commercial systems have been developed to facilitate schema-mapping design; a common characteristic of these systems is that they produce a schema mapping based on attribute correspondences across schemas solicited from the user via a visual interface. This methodology, however, suffers from certain shortcomings. In the past few years, a fundamentally different methodology to designing and understanding schema mappings has emerged. This new methodology is based on the systematic use of data examples to derive, illustrate, and refine schema mappings.
Example-driven schema-mapping design is currently an active area of research in which several different approaches towards using data examples in schema-mapping design have been explored. After a brief overview of the earlier methodology, this tutorial will provide a comprehensive overview of the different ways in which data examples can be used in schema-mapping design. In particular, it will cover the basic concepts, technical results, and prototype systems that have been developed in the past few years, as well as open problems and directions for further research in this area.
- Bogdan Alexe, Laura Chiticariu, Renée J. Miller, and Wang Chiew Tan. Muse: Mapping Understanding and deSign by Example. In ICDE, pages 10--19, 2008. Google ScholarDigital Library
- Bogdan Alexe, Wang Chiew Tan, and Yannis Velegrakis. STBenchmark: Towards a Benchmark for Mapping Systems. PVLDB, 1(1):230--244, 2008. Google ScholarDigital Library
- Bogdan Alexe, Balder ten Cate, Phokion G. Kolaitis, and Wang Chiew Tan. Characterizing schema mappings via data examples. ACM Trans. Database Syst., 36(4):23, 2011. Google ScholarDigital Library
- Bogdan Alexe, Balder ten Cate, Phokion G. Kolaitis, and Wang Chiew Tan. Designing and refining schema mappings via data examples. In SIGMOD, pages 133--144, 2011. Google ScholarDigital Library
- Bogdan Alexe, Balder ten Cate, Phokion G. Kolaitis, and Wang Chiew Tan. Eirene: Interactive design and refinement of schema mappings via data examples. PVLDB, 4(12):1414--1417, 2011.Google Scholar
- Pablo Barceló. Logical foundations of relational data exchange. SIGMOD Record, 38(1):49--58, 2009. Google ScholarDigital Library
- Philip A. Bernstein, Todd J. Green, Sergey Melnik, and Alan Nash. Implementing Mapping Composition. VLDB Journal, 17(2):333--353, 2008. Google ScholarDigital Library
- Philip A. Bernstein and Laura M. Haas. Information integration in the enterprise. Communications of the Association for Computing Machinery (CACM), 51(9):72--79, 2008. Google ScholarDigital Library
- A. Bonifati, E. Q. Chang, T. Ho, V. S. Lakshmanan, and R. Pottinger. HePToX: Marrying XML and Heterogeneity in Your P2P Databases. In VLDB, pages 1267--1270, 2005. Google ScholarDigital Library
- Balder ten Cate, Víctor Dalmau, and Phokion Kolaitis. Learning schema mappings. In Proceedings of the International Conference on Database Theory (ICDT 2012), pages 22--33. ACM Press, 2012. Google ScholarDigital Library
- George H. L. Fletcher, Marc Gyssens, Jan Paredaens, and Dirk Van Gucht. On the expressive power of the relational algebra on finite sets of relation pairs. TKDE, 21(6):939--942, 2009. Google ScholarDigital Library
- Georg Gottlob and Pierre Senellart. Schema mapping discovery from data instances. JACM, 57(2), 2010. Google ScholarDigital Library
- L. M. Haas, M. A. Hernández, H. Ho, L. Popa, and M. Roth. Clio Grows Up: From Research Prototype to Industrial Tool. In ACM SIGMOD, pages 805--810, 2005. Google ScholarDigital Library
- P. G. Kolaitis. Schema Mappings, Data Exchange, and Metadata Management. In ACM PODS, pages 61--75, 2005. Google ScholarDigital Library
- M. Lenzerini. Data Integration: A Theoretical Perspective. In ACM PODS, pages 233--246, 2002. Google ScholarDigital Library
- Anish Das Sarma, Aditya G. Parameswaran, Hector Garcia-Molina, and Jennifer Widom. Synthesizing view definitions from data. In ICDT, pages 89--103, 2010. Google ScholarDigital Library
- L. Yan, R. J. Miller, L. M. Haas, and R. Fagin. Data-Driven Understanding and Refinement of Schema Mappings. In ACM SIGMOD, pages 485--496, 2001. Google ScholarDigital Library
Index Terms
- Schema mappings and data examples
Recommendations
Designing and refining schema mappings via data examples
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of dataA schema mapping is a specification of the relationship between a source schema and a target schema. Schema mappings are fundamental building blocks in data integration and data exchange and, as such, obtaining the right schema mapping constitutes a ...
Characterizing schema mappings via data examples
Schema mappings are high-level specifications that describe the relationship between two database schemas; they are considered to be the essential building blocks in data exchange and data integration, and have been the object of extensive research ...
Characterizing schema mappings via data examples
PODS '10: Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systemsSchema mappings are high-level specifications that describe the relationship between two database schemas; they are considered to be the essential building blocks in data exchange and data integration, and have been the object of extensive research ...
Comments