Abstract
XML is rapidly emerging as a dominant standard for representing and exchanging information. The ability to transform and present data in XML is crucial and XSLT is a relatively recent programming language, specially designed to support this activity. Despite its utility, however, XSLT is widely considered a difficult language to learn.
In this paper, we present a novel system called XSLTGen, an automatic XSLT Generator. This system automatically generates an XSLT stylesheet, given a source XML document and a desired output HTML or XML document. It allows users to become familiar with and learn XSLT stylesheets, based solely on their knowledge of XML or HTML. Our method for automatically generating XSLT transformations is based on the use of semantic mappings between the input and output documents. We show how such mappings can be first discovered and then employed to create XSLT stylesheets. The results of our experiments show that XSLTGen works well with a number of different varieties of XML and HTML documents.
Parts of results of this paper appeared in [30].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barnard, D.T., Duncan, N., Clarke, G.: Tree-to-tree Correction for Document Trees. Technical Report 95–372, Department of Computing and Information Science, Queen’s University, Kingston (1995)
Bergamaschi, S., Castano, S., Vimeracati, S.D.C.D., Vincini, M.: An Intelligent Approach to Information Integration. In: Proceedings of the 1st International Conference on Formal Ontology in Information Systems, Trento, Italy, pp. 253–267 (June 1998)
Boukottaya, A., Vanoirbeek, C., Paganelli, F., Khaled, O.A.: Automating XML Documents Transformations: A Conceptual Modelling Based Approach. In: Proceedings of the 1st Asia-Pacific Conference on Conceptual Modelling, Dunedin, New Zealand, pp. 81–90 ( January 2004)
Bray, T., Paoli, J., Sperberg-McQueen, C.M., Maler, E.: Extensible Markup Language (XML) 1.0 Second Edn. W3C Recommendation (October 2000), http://www.w3.org/TR/REC–xml
Castano, S., Antonellis, V.D.: A Schema Analysis and Reconciliation Tool Environment for Heterogeneous Databases. In: Proceedings of the 1999 International Database Engineering and Applications Symposium, Montreal, Canada, pp. 53–62 (1999)
Chawathe, S.S., Rajaraman, A., Garcia-Molina, H., Widom, J.: Change Detection in Hierarchically Structured Information. In: Proceedings of the 1996 International Conference on Management of Data, Montreal, Canada, pp. 493–504 (June 1996)
Clark, J.: XSL Transformation (XSLT) Version 1.0. W3C Recommendation (November 1999), http://www.w3.org/TR/xslt
Clark, J., DeRose, S.: XML Path Language (XPath) Version 1.0. W3C Recommendation (November 1999), http://www.w3.org/TR/xpath
Cobena, G., Abiteboul, S., Marian, A.: Detecting changes in xml documents. In: ICDE, pp. 41–52 (2002)
Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to Map between Ontologies on the Semantic Web. In: Proceedings of the 11th International Conference on World Wide Web, Honolulu, USA, pp. 662–673 (May 2002)
Erwig, M.: Toward the Automatic Derivation of XML Transformations. In: Proceedings of the 1st International Workshop on XML Schema and Data Management, Chicago, USA, pp. 342–354 (October 2003)
Hors, A.L., et al.: Document Object Model (DOM) Level 2 Core Specification Version 1.0. W3C Recommendation (November 2000), http://www.w3.org/TR/DOM-Level-2-Core
Fox, J.: Generating XSLT with a Semantic Hub. In: Proceedings of the 2002 XML Conference, Baltimore, USA (December 2002)
Garofalakis, M., Gionis, A., Rastogi, R., Seshadri, S., Shim, K.: XTRACT: Learning Document Type Descriptors from XML Document Collections. Data Mining and Knowledge Discovery 7(1), 23–56 (2003)
Haas, L.M., Miller, R.J., Niswonger, B., Roth, M.T., Schwarx, P.M., Wimmers, E.L.: Transforming Heterogeneous Data with Database Middleware: Beyong Integration. Bulleting of the IEEE Computer Society Technical Committee on Data Engineering 22(1), 31–36 (1999)
Kay, M.: XSLT Programmer’s Reference. Wrox Press Ltd. (2000)
Kurgan, L., Swiercz, W., Cios, K.J.: Semantic Mapping of XML Tags using Inductive Machine Learning. In: Proceedings of the 2002 International Conference on Machine Learning and Applications, Las Vegas, USA, pp. 99–109 (June 2002)
Leonardi, E., et al.: Detecting content changes on ordered xml documents using relational databases. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 580–590. Springer, Heidelberg (2004)
Leventhal, M.: XSL Considered Harmful (1999), http://www.xml.com/pub/a/1999/05/xsl/xslconsidered1.html
Lim, S., Ng, Y.: An Automated Change-Detection Algorithm for HTML Documents Based on Semantic Hierarchies. In: Proceedings of the 17th International Conference on Data Engineering, Heidelberg, Germany, pp. 303–312 (April 2001)
Miller, R.J., Haas, L.M., Hernández, M.A.: Schema Mapping as Query Discovery. In: Proceedings of the 26th International Conference on Very Large Data Bases, Cairo, Egypt, pp. 77–88 (September 2000)
Milo, T., Zohar, S.: Using Schema Matching to Simplify Heterogeneous Data Translation. In: Proceedings of 24th International Conference on Very Large Data Bases, New York, USA, pp. 122–133 (August 1998)
Nance, K.L., Hay, B.: IDACT: Automating Data Discovery and Compilation. In: Proceedings of the 2004 Nasa’s Earth Science Technology Conference, Palo Alto, USA (June 2003)
Ono, K., et al.: XSLT Stylesheet Generation by Example with WYSIWYG Editing. In: Proceedings of the 2002 International Symposium on Applications and the Internet, Nara, Japan (March 2002)
Popa, L., Velegrakis, Y., Miller, R.J., Hernández, M.A., Fagin, R.: Translating Web Data. In: Proceedings of the 28th International Conference on Very Large Data Bases, Hong Kong, China, pp. 598–609 (August 2002)
Raggett, D., Hors, A.L., Jacobs, I.: Hypertext Markup Language (HTML) 4.01. W3C Recommendation (December 1999), http://www.w3.org/TR/html4
Rahm, E., Bernstein, P.A.: A Survey of Approaches to Automatic Schema Matching. VLDB Journal 10(4), 334–350 (2001)
Selkow, S.M.: The Tree-to-Tree Editing Problem. Information Processing Letters 6(6), 184–186 (1977)
Tai, K.C.: The Tree-to-Tree Correction Problem. Journal of the ACM 26(3), 422–433 (1979)
Waworuntu, S., Bailey, J.: XSLTGen: A system for automatically generating XML transformations via semantic mappings. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 479–492. Springer, Heidelberg (November 2004)
Yan, L.L., Miller, R.J., Haas, L.M., Fagin, R.: Data-Driven Understanding and Refinement of Schema Mappings. In: Proceedings of ACM SIGMOD International Conference on Management of Data, Santa Barbara, USA (May 2001)
Zhang, K., Shasha, D.: Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM Journal of Computing 18(6), 1245–1262 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Waworuntu, S., Bailey, J. (2006). XSLTGen: A System for Automatically Generating XML Transformations Via Semantic Mappings. In: Spaccapietra, S., Atzeni, P., Chu, W.W., Catarci, T., Sycara, K.P. (eds) Journal on Data Semantics V. Lecture Notes in Computer Science, vol 3870. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11617808_4
Download citation
DOI: https://doi.org/10.1007/11617808_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31426-4
Online ISBN: 978-3-540-31427-1
eBook Packages: Computer ScienceComputer Science (R0)