Unsupervised Feature Analysis with Class Margin Optimization

Wang, Sen; Nie, Feiping; Chang, Xiaojun; Yao, Lina; Li, Xue; Sheng, Quan Z.

doi:10.1007/978-3-319-23528-8_24

Sen Wang¹⁰,
Feiping Nie¹¹,
Xiaojun Chang¹²,
Lina Yao¹³,
Xue Li¹⁰ &
…
Quan Z. Sheng¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9284))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

4777 Accesses
8 Citations

Abstract

Unsupervised feature selection has been attracting research attention in the communities of machine learning and data mining for decades. In this paper, we propose an unsupervised feature selection method seeking a feature coefficient matrix to select the most distinctive features. Specifically, our proposed algorithm integrates the Maximum Margin Criterion with a sparsity-based model into a joint framework, where the class margin and feature correlation are taken into account at the same time. To maximize the total data separability while preserving minimized within-class scatter simultaneously, we propose to embed K-means into the framework generating pseudo class label information in a scenario of unsupervised feature selection. Meanwhile, a sparsity-based model, $\ell _{2,p}$-norm, is imposed to the regularization term to effectively discover the sparse structures of the feature coefficient matrix. In this way, noisy and irrelevant features are removed by ruling out those features whose corresponding coefficients are zeros. To alleviate the local optimum problem that is caused by random initializations of K-means, a convergence guaranteed algorithm with an updating strategy for the clustering indicator matrix, is proposed to iteratively chase the optimal solution. Performance evaluation is extensively conducted over six benchmark data sets. From our comprehensive experimental results, it is demonstrated that our method has superior performance against all other compared approaches.

Download to read the full chapter text

Chapter PDF

Effective Learning with Joint Discriminative and Representative Feature Selection

Multi-class Feature Selection Based on Softmax with $$L_{2,0}$$ -Norm Regularization

Unsupervised Feature Selection via Unified Trace Ratio Formulation and K-means Clustering (TRACK)

Keywords

References

Cai, D., Zhang, C., He, X.: Unsupervised feature selection for multi-cluster data. In: SIGKDD (2010)
Google Scholar
Chang, X., Nie, F., Wang, S., Yang, Y.: Compound rank-k projections for bilinear analysis. IEEE Trans. Neural Netw. Learning Syst. (2015)
Google Scholar
Chang, X., Nie, F., Yang, Y., Huang, H.: A convex formulation for semi-supervised multi-label feature selection. In: AAAI (2014)
Google Scholar
Chang, X., Shen, H., Wang, S., Liu, J., Li, X.: Semi-supervised feature analysis for multimedia annotation by mining label correlation. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.) PAKDD 2014, Part II. LNCS, vol. 8444, pp. 74–85. Springer, Heidelberg (2014)
Chapter Google Scholar
Chang, X., Yang, Y., Hauptmann, A.G., Xing, E.P., Yu, Y.: Semantic concept discovery for large-scale zero-shot event detection. In: IJCAI (2015)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern classification. John Wiley & Sons (2012)
Google Scholar
Georghiades, A.S., Belhumeur, P.N., Kriegman, D.: From few to many: Illumination cone models for face recognition under variable lighting and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 23(6), 643–660 (2001)
Article Google Scholar
He, X., Cai, D., Niyogi, P.: Laplacian score for feature selection. In: NIPS (2005)
Google Scholar
Hou, C., Nie, F., Li, X., Yi, D., Wu, Y.: Joint embedding learning and sparse regression: A framework for unsupervised feature selection. IEEE T. Cybernetics 44(6), 793–804 (2014)
Article Google Scholar
Hull, J.J.: A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 16(5), 550–554 (1994)
Article Google Scholar
Kira, K., Rendell, L.A.: A practical approach to feature selection. In: IWML (1992)
Google Scholar
Kononenko, I.: Estimating attributes: analysis and extensions of relief. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994)
Chapter Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Li, Z., Yang, Y., Liu, J., Zhou, X., Lu, H.: Unsupervised feature selection using nonnegative spectral analysis. In: AAAI (2012)
Google Scholar
Maugis, C., Celeux, G., Martin-Magniette, M.L.: Variable selection for clustering with gaussian mixture models. Biometrics 65(3), 701–709 (2009)
Article MathSciNet MATH Google Scholar
Nene, S.A., Nayar, S.K., Murase, H., et al.: Columbia object image library (coil-20). Tech. rep., Technical Report CUCS-005-96 (1996)
Google Scholar
Nie, F., Huang, H., Cai, X., Ding, C.H.Q.: Efficient and robust feature selection via joint l2, 1-norms minimization. In: NIPS (2010)
Google Scholar
Nie, F., Huang, H., Cai, X., Ding, C.H.: Efficient and robust feature selection via joint ${\ell }$2, 1-norms minimization. In: NIPS, pp. 1813–1821 (2010)
Google Scholar
Qian, M., Zhai, C.: Robust unsupervised feature selection. In: IJCAI (2013)
Google Scholar
Raileanu, L.E., Stoffel, K.: Theoretical comparison between the gini index and information gain criteria. AMAI (2004)
Google Scholar
Samaria, F.S., Harter, A.C.: Parameterisation of a stochastic model for human face identification. In: IEEE Workshop on Applications of Computer Vision (1994)
Google Scholar
Strehl, A., Ghosh, J.: Cluster ensembles–a knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research (JMLR) 3, 583–617 (2003)
MathSciNet MATH Google Scholar
Tabakhi, S., Moradi, P., Akhlaghian, F.: An unsupervised feature selection algorithm based on ant colony optimization. Engineering Applications of Artificial Intelligence 32, 112–123 (2014)
Article Google Scholar
Wang, D., Nie, F., Huang, H.: Unsupervised feature selection via unified trace ratio formulation and K-means clustering (TRACK). In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.) ECML PKDD 2014, Part III. LNCS, vol. 8726, pp. 306–321. Springer, Heidelberg (2014)
Google Scholar
Wang, S., Chang, X., Li, X., Sheng, Q.Z., Chen, W.: Multi-task support vector machines for feature selection with shared knowledge discovery. Signal Processing (December 2014)
Google Scholar
Wang, S., Tang, J., Liu, H.: Embedded unsupervised feature selection. AAAI (2015)
Google Scholar
Yang, Y., Shen, H.T., Ma, Z., Huang, Z., Zhou, X.: l2, 1-norm regularized discriminative feature selection for unsupervised learning. In: IJCAI (2011)
Google Scholar
Yang, Y., Zhuang, Y., Wu, F., Pan, Y.: Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Transactions on Multimedia 10(3), 437–446 (2008)
Article Google Scholar
Yang, Y., Ma, Z., Hauptmann, A.G., Sebe, N.: Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks. IEEE TMM 15(3), 661–669 (2013)
Google Scholar
Zhao, Z., Liu, H.: Spectral feature selection for supervised and unsupervised learning. In: ICML
Google Scholar
Zhu, X., Huang, Z., Yang, Y., Shen, H.T., Xu, C., Luo, J.: Self-taught dimensionality reduction on the high-dimensional small-sized data. Pattern Recognition 46(1), 215–229 (2013)
Article MATH Google Scholar
Zhu, X., Suk, H.-I., Shen, D.: Matrix-similarity based loss function and feature selection for alzheimer’s disease diagnosis. In: IEEE CVPR, pp. 3089–3096 (2014)
Google Scholar
Zhu, X., Suk, H.-I., Shen, D.: Discriminative feature selection for multi-class alzheimer’s disease classification. In: MLMI, pp. 157–164 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

School of ITEE, The University of Queensland, Brisbane, Australia
Sen Wang & Xue Li
Center for OPTIMAL, Northwestern Polytechnical University, Shaanxi, China
Feiping Nie
Center for QCIS, University of Technology Sydney, Ultimo, Australia
Xiaojun Chang
School of CS, The University of Adelaide, Adelaide, Australia
Lina Yao & Quan Z. Sheng

Authors

Sen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Feiping Nie
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojun Chang
View author publications
You can also search for this author in PubMed Google Scholar
Lina Yao
View author publications
You can also search for this author in PubMed Google Scholar
Xue Li
View author publications
You can also search for this author in PubMed Google Scholar
Quan Z. Sheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaojun Chang .

Editor information

Editors and Affiliations

University of Bari Aldo Moro, Bari, Italy
Annalisa Appice
University of Porto, Porto, Portugal
Pedro Pereira Rodrigues
University of Porto - CRACS/INESC TEC, Porto, Portugal
Vítor Santos Costa
University of Porto - INESC TEC, Porto, Portugal
Carlos Soares
University of Porto - INESC TEC, Porto, Portugal
João Gama
University of Porto - INESC TEC, Porto, Portugal
Alípio Jorge

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, S., Nie, F., Chang, X., Yao, L., Li, X., Sheng, Q.Z. (2015). Unsupervised Feature Analysis with Class Margin Optimization. In: Appice, A., Rodrigues, P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9284. Springer, Cham. https://doi.org/10.1007/978-3-319-23528-8_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-23528-8_24
Published: 29 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23527-1
Online ISBN: 978-3-319-23528-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Unsupervised Feature Analysis with Class Margin Optimization

Abstract

Chapter PDF

Similar content being viewed by others

Effective Learning with Joint Discriminative and Representative Feature Selection

Multi-class Feature Selection Based on Softmax with $$L_{2,0}$$ -Norm Regularization

Unsupervised Feature Selection via Unified Trace Ratio Formulation and K-means Clustering (TRACK)

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Unsupervised Feature Analysis with Class Margin Optimization

Abstract

Chapter PDF

Similar content being viewed by others

Effective Learning with Joint Discriminative and Representative Feature Selection

Multi-class Feature Selection Based on Softmax with $$L_{2,0}$$ -Norm Regularization

Unsupervised Feature Selection via Unified Trace Ratio Formulation and K-means Clustering (TRACK)

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation