Abstract
Semantic heterogeneity is one of the key challenges in integrating and sharing data across disparate sources, data exchange and migration, data warehousing, model management, the Semantic Web and peer-to-peer databases. Semantic heterogeneity can arise at the schema level and at the data level. At the schema level, sources can differ in relations, attribute and tag names, data normalization, levels of detail, and the coverage of a particular domain. The problem of reconciling schema-level heterogeneity is often referred to as schema matching or schema mapping. At the data level, we find different representations of the same real-world entities (e.g., people, companies, publications, etc.). Reconciling data-level heterogeneity is referred to as data deduplication, record linkage, and entity/object matching. To exacerbate the heterogeneity challenges, schema elements of one source can be represented as data in another. This special issue presents a set of articles that describe recent work on semantic heterogeneity at the schema level.
- C. Batini, M. Lenzerini, and S. Navathe. A comparative analysis of methodologies for database schema integration. ACM Computing Survey, 18(4):323--364, 1986.]] Google ScholarDigital Library
- J. Berlin and A. Motro. Database schema matching using machine learning with feature selection. In Proceedings of the Conf. on Advanced Information Systems Engineering (CAiSE), 2002.]] Google ScholarDigital Library
- P. Bernstein. Applying model management to classical meta data problems. In Proceedings of the Conf. on Innovative Database Research (CIDR), 2003.]]Google Scholar
- C. Clifton, E. Housman, and A. Rosenthal. Experience with a combined approach to attribute-matching across heterogeneous databases. In Proc. of the IFIP Working Conference on Data Semantics (DS-7), 1997.]]Google Scholar
- H. Do and E. Rahm. Coma: A system for flexible combination of schema matching approaches. In Proceedings of the 28th Conf. on Very Large Databases (VLDB), 2002.]] Google ScholarDigital Library
- A. Doan, P. Domingos, and A. Halevy. Reconciling schemas of disparate data sources: A machine learning approach. In Proceedings of the ACM SIGMOD Conference, 2001.]] Google ScholarDigital Library
- A. Doan and A. Halevy. Semantic integration research in the database community: A brief survey. AI Magazine, Special Issue on Semantic Integration. To appear. Available at http://anhai.cs.uiuc.edu/home, 2005.]] Google ScholarDigital Library
- A. Doan, A. Y. Halevy, and N. F. Noy. Semantic integration workshop at the 2nd int. semantic web conf. (iswc-2003). SIGMOD Record, 33(1), 2004.]]Google Scholar
- D. Embley, D. Jackman, and L. Xu. Multifaceted exploitation of metadata for attribute match discovery in information integration. In Proc. of the WIIW-01, 2001.]]Google Scholar
- A. Halevy. Answering queries using views: A survey. The VLDB Journal, 10(4):270--294, 2001.]] Google ScholarDigital Library
- B. He and K. Chang. Statistical schema matching across web query interfaces. In Proc. of the ACM SIGMOD Conf. (SIGMOD), 2003.]] Google ScholarDigital Library
- J. Kang and J. Naughton. On schema matching with opaque column names and data values. In Proc. of the ACM SIGMOD Int. Conf. on Management of Data (SIGMOD-03), 2003.]] Google ScholarDigital Library
- M. Lenzerini. Data integration; a theoretical perspective. In Proc. of PODS-02, 2002.]] Google ScholarDigital Library
- J. Madhavan, P. Bernstein, A. Doan, and A. Halevy. Corpus-based schema matching. In Proc. of the 18th IEEE Int. Conf. on Data Engineering (ICDE), 2005.]] Google ScholarDigital Library
- R. McCann, A. Doan, A. Kramnik, and V. Varadarajan. Building data integration systems via mass collaboration. In Proc. of the SIGMOD-03 Workshop on the Web and Databases (WebDB-03), 2003.]]Google Scholar
- A. Ouksel and A. P. Seth. Special issue on semantic interoperability in global information systems. SIGMOD Record, 28(1), 1999.]] Google ScholarDigital Library
- E. Rahm and P. Bernstein. On matching schemas automatically. VLDB Journal, 10(4), 2001.]] Google ScholarDigital Library
- W. Wu, C. Yu, A. Doan, and W. Meng. An interactive clustering-based approach to integrating source query interfaces on the Deep Web. In Proc. of the ACM SIGMOD Conf., 2004.]] Google ScholarDigital Library
- L. Yan, R. Miller, L. Haas, and R. Fagin. Data driven understanding and refinement of schema mappings. In Proceedings of the ACM SIGMOD, 2001.]] Google ScholarDigital Library
Recommendations
Ontology alignment for semantic data integration through foundational ontologies
ER'12: Proceedings of the 2012 international conference on Advances in Conceptual ModelingOntology alignment is the process of finding corresponding entities with the same intended meaning in different ontologies. In scenarios where an ontology conceptually describes the contents of a data repository, this provides valuable information for ...
Semantic integration of enterprise information systems using meta-metadata ontology
This paper proposes a non-domain-specific metadata ontology as a core component in a semantic model-based document management system (DMS), a potential contender towards the enterprise information systems of the next generation. What we developed is the ...
Frame-based ontological view for semantic integration
Semantic integration is crucial for successful collaboration between heterogeneous information systems. Traditional ontology-driven approaches rely on the availability of explicit ontologies. However, in most application domains, this prerequisite ...
Comments