Language Support to XML Data Mining: A Case Study

Romei, Andrea; Turini, Franco

doi:10.1007/978-3-642-19890-8_2

Andrea Romei⁴ &
Franco Turini⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 129))

Included in the following conference series:

International Conference on Agents and Artificial Intelligence

542 Accesses

Abstract

There are several reasons that justify the study of a powerful, expressive and efficient XML-based framework for intelligent data analysis. First of all, the proliferation of XML sources offer good opportunities to mine new data. Second, native XML databases appear to be a natural alternative to relational databases when the purpose is querying both data and the extracted models in an uniform manner. This work offers a new query language for XML Data Mining. In presenting the language, we show its versatility, expressiveness and efficiency by proposing a concise, yet comprehensive set of queries which cover the major aspects of the data mining. Queries are designed over the well-known xmark XML database, that is a scalable benchmark dataset modeling an Internet auction site.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Imielinski, T., Mannila, H.: A database perspective on knowledge discovery. Comm. of the ACM 39(11), 58–64 (1996)
Article Google Scholar
The Data Mining Group. The Predictive Model Markup Language (PMML), version 4.0, http://www.dmg.org/v4-0/GeneralStructure.html
W3C World Wide Web Consortium. XQuery 1.0: An XML Query Language (Recommendation January 23, 2007), http://www.w3.org/TR/Query
Romei, A., Turini, F.: XML data mining. Softw., Pract. Exper. 40(2), 101–130 (2010)
Article Google Scholar
Romei, A., Turini, F.: XQuake - An XQuery-like Language for Mining XML Data. In: The Second International Conference on Agents and Artificial Intelligence (ICAART), Valencia, Spain, pp. 20–27 (2010)
Google Scholar
Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: XMark: A Benchmark for XML Data Management. In: The 28th International Conference on Very Large Data Bases (VLDB), Hong Kong, China, pp. 974–985 (2002)
Google Scholar
W3C World Wide Web Consortium. XQuery and XPath Full Text 1.0 (Candidate Recommendation July 09, 2009), http://www.w3.org/TR/xpath-full-text-10/
Meo, R., Psaila, G.: An XML-Based Database for Knowledge Discovery. In: Grust, T., Höpfner, H., Illarramendi, A., Jablonski, S., Fischer, F., Müller, S., Patranjan, P.-L., Sattler, K.-U., Spiliopoulou, M., Wijsen, J. (eds.) EDBT 2006. LNCS, vol. 4254, pp. 814–828. Springer, Heidelberg (2006)
Chapter Google Scholar
Romei, A., Ruggieri, S., Turini, F.: KDDML: a middleware language and system for knowledge discovery in databases. Data Knowl. Eng. 57(2), 179–220 (2006)
Article Google Scholar
Euler, T., Klinkenberg, R., Mierswa, I., Scholz, M., Wurst, M.: YALE: rapid prototyping for complex data mining tasks. In: Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Philadelphia, PA, USA, pp. 935–940 (2006)
Google Scholar
Braga, D., Campi, A., Ceri, S., Klemettinen, M., Lanzi, P.: Discovering interesting information in XML data with association rules. In: The Eighteenth Annual ACM Symposium on Applied Computing (SAC), Melbourne, Florida, pp. 450–454 (2003)
Google Scholar
Feng, L., Tharam, S.D.: Mining Interesting XML-Enabled Association Rules with Templates. In: Goethals, B., Siebes, A. (eds.) KDID 2004. LNCS, vol. 3377, pp. 66–88. Springer, Heidelberg (2005)
Chapter Google Scholar
W3C World Wide Web Consortium. OWL Web Ontology Language (Recommendation February 10, 2004), http://www.w3.org/TR/owl-features

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Pisa, Largo B. Pontecorvo, 3, Pisa, 56127, Italy
Andrea Romei & Franco Turini

Authors

Andrea Romei
View author publications
You can also search for this author in PubMed Google Scholar
Franco Turini
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departament of Systems and Informatics, Polytechnic Institute of Setúbal – INSTICC, Rua do Vale de Chaves - Estefanilha, 2910-761, Setúbal, Portugal
Joaquim Filipe
IST - Technical University of Lisbon, Av.Rovisco Pais, 1, 1049-001, Lisbon, Portugal
Ana Fred
School of Computing, Staffordshire University, Baconside, Stafford, UK
Bernadette Sharp

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Romei, A., Turini, F. (2011). Language Support to XML Data Mining: A Case Study. In: Filipe, J., Fred, A., Sharp, B. (eds) Agents and Artificial Intelligence. ICAART 2010. Communications in Computer and Information Science, vol 129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19890-8_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-19890-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19889-2
Online ISBN: 978-3-642-19890-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics