A Purity Measure Based Transductive Learning Algorithm
- 3.2k Downloads
The increasing on the human ability to gather data has led to an increasing effort on labeling them to be used in specific applications such as classification and regression. Therefore, automatic labeling methods such as semi-supervised transdutive learning algorithms are of a major concern on the machine learning and data mining community nowadays. This paper proposes a graph-based algorithm which uses the purity measure to help spreading the labels throughout the graph. The purity measure determines how intertwined are different subspaces of data regarding its classes. As high values of purity indicate low mixture among patterns of different classes, its maximization helps finding well-separated connected subgraphs; which facilitates the label spreading process. Results on benchmark data sets comparing to state-of-the-art methods show the potential of the proposed algorithm.
KeywordsGraph-based Transduction Purity Measure KNN Mutual Graph Semi-supervised Learning
Unable to display preview. Download preview PDF.
- 3.Blum, A., Mitchell, T.: Combining Labeled and Unlabeled Data with Co-training. In: Proc. 11th Annual Conf. on Computational Learning Theory, pp. 92–100 (1998)Google Scholar
- 4.Chapelle, O., Zien, A., Schölkopf, B. (eds.): Semi-supervised Learning. MIT Press (2006)Google Scholar
- 5.Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to Algorithms. MIT Press (2009)Google Scholar
- 7.Delalleau, O., Bengio, Y., Roux, N.: Efficient Non-parametric Function Induction in Semi-supervised Learning. In: Proc. 10th Int. Workshop on Artificial Intelligence and Statistics, Society for Artificial Intelligence and Statistics, pp. 96–103 (2005)Google Scholar
- 8.Joachims, T.: Transductive Learning via Spectral Graph Partitioning. In: Proc. 20th Int. Conf. on Machine Learning, pp. 290–297 (2003)Google Scholar
- 11.Silva, T.C., Zhao, L.: Network-based Stochastic Semisupervised Learning. IEEE T. Neural Networ. 23, 451–466 (2012)Google Scholar
- 12.Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (2008)Google Scholar
- 13.Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised Learning using Gaussian fields and Harmonic Functions. In: Proc. 20th Int. Conf. on Machine Learning, pp. 912–919 (2003)Google Scholar
- 14.Zhu, X.: Semi-Supervised Learning Literature Survey. Technical Report 1530, Computer-Science, University of Wisconsin-Madison (2008)Google Scholar