Abstract
There are several reasons that justify the study of a powerful, expressive and efficient XML-based framework for intelligent data analysis. First of all, the proliferation of XML sources offer good opportunities to mine new data. Second, native XML databases appear to be a natural alternative to relational databases when the purpose is querying both data and the extracted models in an uniform manner. This work offers a new query language for XML Data Mining. In presenting the language, we show its versatility, expressiveness and efficiency by proposing a concise, yet comprehensive set of queries which cover the major aspects of the data mining. Queries are designed over the well-known xmark XML database, that is a scalable benchmark dataset modeling an Internet auction site.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Imielinski, T., Mannila, H.: A database perspective on knowledge discovery. Comm. of the ACM 39(11), 58–64 (1996)
The Data Mining Group. The Predictive Model Markup Language (PMML), version 4.0, http://www.dmg.org/v4-0/GeneralStructure.html
W3C World Wide Web Consortium. XQuery 1.0: An XML Query Language (Recommendation January 23, 2007), http://www.w3.org/TR/Query
Romei, A., Turini, F.: XML data mining. Softw., Pract. Exper. 40(2), 101–130 (2010)
Romei, A., Turini, F.: XQuake - An XQuery-like Language for Mining XML Data. In: The Second International Conference on Agents and Artificial Intelligence (ICAART), Valencia, Spain, pp. 20–27 (2010)
Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: XMark: A Benchmark for XML Data Management. In: The 28th International Conference on Very Large Data Bases (VLDB), Hong Kong, China, pp. 974–985 (2002)
W3C World Wide Web Consortium. XQuery and XPath Full Text 1.0 (Candidate Recommendation July 09, 2009), http://www.w3.org/TR/xpath-full-text-10/
Meo, R., Psaila, G.: An XML-Based Database for Knowledge Discovery. In: Grust, T., Höpfner, H., Illarramendi, A., Jablonski, S., Fischer, F., Müller, S., Patranjan, P.-L., Sattler, K.-U., Spiliopoulou, M., Wijsen, J. (eds.) EDBT 2006. LNCS, vol. 4254, pp. 814–828. Springer, Heidelberg (2006)
Romei, A., Ruggieri, S., Turini, F.: KDDML: a middleware language and system for knowledge discovery in databases. Data Knowl. Eng. 57(2), 179–220 (2006)
Euler, T., Klinkenberg, R., Mierswa, I., Scholz, M., Wurst, M.: YALE: rapid prototyping for complex data mining tasks. In: Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Philadelphia, PA, USA, pp. 935–940 (2006)
Braga, D., Campi, A., Ceri, S., Klemettinen, M., Lanzi, P.: Discovering interesting information in XML data with association rules. In: The Eighteenth Annual ACM Symposium on Applied Computing (SAC), Melbourne, Florida, pp. 450–454 (2003)
Feng, L., Tharam, S.D.: Mining Interesting XML-Enabled Association Rules with Templates. In: Goethals, B., Siebes, A. (eds.) KDID 2004. LNCS, vol. 3377, pp. 66–88. Springer, Heidelberg (2005)
W3C World Wide Web Consortium. OWL Web Ontology Language (Recommendation February 10, 2004), http://www.w3.org/TR/owl-features
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Romei, A., Turini, F. (2011). Language Support to XML Data Mining: A Case Study. In: Filipe, J., Fred, A., Sharp, B. (eds) Agents and Artificial Intelligence. ICAART 2010. Communications in Computer and Information Science, vol 129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19890-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-19890-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19889-2
Online ISBN: 978-3-642-19890-8
eBook Packages: Computer ScienceComputer Science (R0)