An Improved Database Classification Algorithm for Multi-database Mining

Li, Hong; Hu, XueGang; Zhang, YanMing

doi:10.1007/978-3-642-02270-8_35

Hong Li^19,20,
XueGang Hu²¹ &
YanMing Zhang^19,20

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5598))

Included in the following conference series:

International Workshop on Frontiers in Algorithmics

972 Accesses
9 Citations

Abstract

Database classification is a data preprocessing technique for multi-database mining. To reduce search costs in the data from all databases, we need to identify those databases which are most likely relevant to a data mining application. Based on the related research, the algorithm GreedyClass and BestClassification [7]are improved in order to optimize the time complexity of algorithm and to obtainthe best classification from m given databases. Theoretical analysis and experimental results show the efficiency of the proposed algorithm.

The work was supported by the natural science fund from Anhui Education Department (serial number: KJ2008B122). Any opinions, findings, and conclusions or recommendations expressed in this paper are those of the authors and do not necessarily reflect the views of the funding agencies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wu, X.D., Zhang, S.: Synthesizing High-Frequency Rules from Different Data Sources. J. IEEE Transactions on Knowledge and Data Engineering 15(2), 353–367 (2003)
Article Google Scholar
Zhang, C., Zhang, S.: Association Rules Mining: Models and Algorithms. LNCS, vol. 2307, p. 243. Springer, Heidelberg (2002)
MATH Google Scholar
Li, H.: Strategy of Parallel Association Rules Mining Based on Cluster System. J. Journal of Hefei University of Thchnology 30(3), 274–277 (2007)
Google Scholar
Liu, H., Lu, H., Yao, J.: Identifying Relevant Databases for Multidatabase Mining.R. In: Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 210–221 (1998)
Google Scholar
Liu, H., Lu, H., Yao, J.: Toward multi-database mining: identifying relevant databases. J. IEEE Transactions Knowledge Data Engineering 13(4), 541–553 (2001)
Article Google Scholar
Tang, Y.F., Niu, L., Zhong, Z., Zhang, C.Q.: Application-independent Database Classification Research in Multi-database Mining. J. Journal of GuanGxi normal university 21(4), 32–36 (2003)
Google Scholar
Wu, X.D., Zhang, C.Q., Zhang, S.C.: Database classification for multi-database mining. J. Elsevier Computer Science. Information Systems 30, 71–88 (2005)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Hefei University, 230001, China
Hong Li & YanMing Zhang
Hefei University Key Laboratory of Network and Intelligent Information Processing, China
Hong Li & YanMing Zhang
School of Computer & Information, Hefei University of technology, 230001, China
XueGang Hu

Authors

Hong Li
View author publications
You can also search for this author in PubMed Google Scholar
XueGang Hu
View author publications
You can also search for this author in PubMed Google Scholar
YanMing Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science,, City University of Hong Kong, No. 83 Tat Chee Avenue, Kowloon Tong, Hong Kong, China
Xiaotie Deng
Computer Science Department, Cornell University, 5144 Upson Hall, NY 14853, Ithaca, USA
John E. Hopcroft
Provincial Key Laboratory of High-Performance Computing, Jiangxi Normal University, 330027, Nanchang, China
Jinyun Xue

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, H., Hu, X., Zhang, Y. (2009). An Improved Database Classification Algorithm for Multi-database Mining. In: Deng, X., Hopcroft, J.E., Xue, J. (eds) Frontiers in Algorithmics. FAW 2009. Lecture Notes in Computer Science, vol 5598. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02270-8_35

Download citation

DOI: https://doi.org/10.1007/978-3-642-02270-8_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02269-2
Online ISBN: 978-3-642-02270-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics