Skip to main content

XSLTGen: A System for Automatically Generating XML Transformations Via Semantic Mappings

  • Conference paper
Book cover Journal on Data Semantics V

Part of the book series: Lecture Notes in Computer Science ((JODS,volume 3870))

Abstract

XML is rapidly emerging as a dominant standard for representing and exchanging information. The ability to transform and present data in XML is crucial and XSLT is a relatively recent programming language, specially designed to support this activity. Despite its utility, however, XSLT is widely considered a difficult language to learn.

In this paper, we present a novel system called XSLTGen, an automatic XSLT Generator. This system automatically generates an XSLT stylesheet, given a source XML document and a desired output HTML or XML document. It allows users to become familiar with and learn XSLT stylesheets, based solely on their knowledge of XML or HTML. Our method for automatically generating XSLT transformations is based on the use of semantic mappings between the input and output documents. We show how such mappings can be first discovered and then employed to create XSLT stylesheets. The results of our experiments show that XSLTGen works well with a number of different varieties of XML and HTML documents.

Parts of results of this paper appeared in [30].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barnard, D.T., Duncan, N., Clarke, G.: Tree-to-tree Correction for Document Trees. Technical Report 95–372, Department of Computing and Information Science, Queen’s University, Kingston (1995)

    Google Scholar 

  2. Bergamaschi, S., Castano, S., Vimeracati, S.D.C.D., Vincini, M.: An Intelligent Approach to Information Integration. In: Proceedings of the 1st International Conference on Formal Ontology in Information Systems, Trento, Italy, pp. 253–267 (June 1998)

    Google Scholar 

  3. Boukottaya, A., Vanoirbeek, C., Paganelli, F., Khaled, O.A.: Automating XML Documents Transformations: A Conceptual Modelling Based Approach. In: Proceedings of the 1st Asia-Pacific Conference on Conceptual Modelling, Dunedin, New Zealand, pp. 81–90 ( January 2004)

    Google Scholar 

  4. Bray, T., Paoli, J., Sperberg-McQueen, C.M., Maler, E.: Extensible Markup Language (XML) 1.0 Second Edn. W3C Recommendation (October 2000), http://www.w3.org/TR/REC–xml

  5. Castano, S., Antonellis, V.D.: A Schema Analysis and Reconciliation Tool Environment for Heterogeneous Databases. In: Proceedings of the 1999 International Database Engineering and Applications Symposium, Montreal, Canada, pp. 53–62 (1999)

    Google Scholar 

  6. Chawathe, S.S., Rajaraman, A., Garcia-Molina, H., Widom, J.: Change Detection in Hierarchically Structured Information. In: Proceedings of the 1996 International Conference on Management of Data, Montreal, Canada, pp. 493–504 (June 1996)

    Google Scholar 

  7. Clark, J.: XSL Transformation (XSLT) Version 1.0. W3C Recommendation (November 1999), http://www.w3.org/TR/xslt

  8. Clark, J., DeRose, S.: XML Path Language (XPath) Version 1.0. W3C Recommendation (November 1999), http://www.w3.org/TR/xpath

  9. Cobena, G., Abiteboul, S., Marian, A.: Detecting changes in xml documents. In: ICDE, pp. 41–52 (2002)

    Google Scholar 

  10. Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to Map between Ontologies on the Semantic Web. In: Proceedings of the 11th International Conference on World Wide Web, Honolulu, USA, pp. 662–673 (May 2002)

    Google Scholar 

  11. Erwig, M.: Toward the Automatic Derivation of XML Transformations. In: Proceedings of the 1st International Workshop on XML Schema and Data Management, Chicago, USA, pp. 342–354 (October 2003)

    Google Scholar 

  12. Hors, A.L., et al.: Document Object Model (DOM) Level 2 Core Specification Version 1.0. W3C Recommendation (November 2000), http://www.w3.org/TR/DOM-Level-2-Core

  13. Fox, J.: Generating XSLT with a Semantic Hub. In: Proceedings of the 2002 XML Conference, Baltimore, USA (December 2002)

    Google Scholar 

  14. Garofalakis, M., Gionis, A., Rastogi, R., Seshadri, S., Shim, K.: XTRACT: Learning Document Type Descriptors from XML Document Collections. Data Mining and Knowledge Discovery 7(1), 23–56 (2003)

    Article  MathSciNet  Google Scholar 

  15. Haas, L.M., Miller, R.J., Niswonger, B., Roth, M.T., Schwarx, P.M., Wimmers, E.L.: Transforming Heterogeneous Data with Database Middleware: Beyong Integration. Bulleting of the IEEE Computer Society Technical Committee on Data Engineering 22(1), 31–36 (1999)

    Google Scholar 

  16. Kay, M.: XSLT Programmer’s Reference. Wrox Press Ltd. (2000)

    Google Scholar 

  17. Kurgan, L., Swiercz, W., Cios, K.J.: Semantic Mapping of XML Tags using Inductive Machine Learning. In: Proceedings of the 2002 International Conference on Machine Learning and Applications, Las Vegas, USA, pp. 99–109 (June 2002)

    Google Scholar 

  18. Leonardi, E., et al.: Detecting content changes on ordered xml documents using relational databases. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 580–590. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  19. Leventhal, M.: XSL Considered Harmful (1999), http://www.xml.com/pub/a/1999/05/xsl/xslconsidered1.html

  20. Lim, S., Ng, Y.: An Automated Change-Detection Algorithm for HTML Documents Based on Semantic Hierarchies. In: Proceedings of the 17th International Conference on Data Engineering, Heidelberg, Germany, pp. 303–312 (April 2001)

    Google Scholar 

  21. Miller, R.J., Haas, L.M., Hernández, M.A.: Schema Mapping as Query Discovery. In: Proceedings of the 26th International Conference on Very Large Data Bases, Cairo, Egypt, pp. 77–88 (September 2000)

    Google Scholar 

  22. Milo, T., Zohar, S.: Using Schema Matching to Simplify Heterogeneous Data Translation. In: Proceedings of 24th International Conference on Very Large Data Bases, New York, USA, pp. 122–133 (August 1998)

    Google Scholar 

  23. Nance, K.L., Hay, B.: IDACT: Automating Data Discovery and Compilation. In: Proceedings of the 2004 Nasa’s Earth Science Technology Conference, Palo Alto, USA (June 2003)

    Google Scholar 

  24. Ono, K., et al.: XSLT Stylesheet Generation by Example with WYSIWYG Editing. In: Proceedings of the 2002 International Symposium on Applications and the Internet, Nara, Japan (March 2002)

    Google Scholar 

  25. Popa, L., Velegrakis, Y., Miller, R.J., Hernández, M.A., Fagin, R.: Translating Web Data. In: Proceedings of the 28th International Conference on Very Large Data Bases, Hong Kong, China, pp. 598–609 (August 2002)

    Google Scholar 

  26. Raggett, D., Hors, A.L., Jacobs, I.: Hypertext Markup Language (HTML) 4.01. W3C Recommendation (December 1999), http://www.w3.org/TR/html4

  27. Rahm, E., Bernstein, P.A.: A Survey of Approaches to Automatic Schema Matching. VLDB Journal 10(4), 334–350 (2001)

    Article  MATH  Google Scholar 

  28. Selkow, S.M.: The Tree-to-Tree Editing Problem. Information Processing Letters 6(6), 184–186 (1977)

    Article  MATH  MathSciNet  Google Scholar 

  29. Tai, K.C.: The Tree-to-Tree Correction Problem. Journal of the ACM 26(3), 422–433 (1979)

    Article  MATH  MathSciNet  Google Scholar 

  30. Waworuntu, S., Bailey, J.: XSLTGen: A system for automatically generating XML transformations via semantic mappings. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 479–492. Springer, Heidelberg (November 2004)

    Chapter  Google Scholar 

  31. Yan, L.L., Miller, R.J., Haas, L.M., Fagin, R.: Data-Driven Understanding and Refinement of Schema Mappings. In: Proceedings of ACM SIGMOD International Conference on Management of Data, Santa Barbara, USA (May 2001)

    Google Scholar 

  32. Zhang, K., Shasha, D.: Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM Journal of Computing 18(6), 1245–1262 (1989)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Waworuntu, S., Bailey, J. (2006). XSLTGen: A System for Automatically Generating XML Transformations Via Semantic Mappings. In: Spaccapietra, S., Atzeni, P., Chu, W.W., Catarci, T., Sycara, K.P. (eds) Journal on Data Semantics V. Lecture Notes in Computer Science, vol 3870. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11617808_4

Download citation

  • DOI: https://doi.org/10.1007/11617808_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31426-4

  • Online ISBN: 978-3-540-31427-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics