Skip to main content

Representative-Based Active Learning with Max-Min Distance

  • Conference paper
  • First Online:
Rough Sets (IJCRS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9920))

Included in the following conference series:

Abstract

Active learning has been a hot topic because labeled data are useful, however expensive. Many existing approaches are based on decision trees, Naïve Bayes algorithms, etc. In this paper, we propose a representative-based active learning algorithm with max-min distance. Our algorithm has two techniques interacting with each other. One is the representative-based classification inspired by covering-based neighborhood rough sets. The other is critical instance selection with max-min distance. Experimental results on six UCI datasets indicate that, with the same number of labeled instances, our algorithm is comparable with or better than the ID3, C4.5 and Naïve Bayes algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Settles, B.: Active learning. Synth. Lect. Artif. Intell. Mach. Learn. 6(1), 1–114 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  2. Basu, S.: Semi-supervised learning. J. Roy. Stat. Soc. 6493(10), 2465–2472 (2010)

    Google Scholar 

  3. Jin, R., Si, L.: A Bayesian approach toward active learning for collaborative filtering, pp. 278–285 (2004)

    Google Scholar 

  4. Saartsechansky, M., Provost, F.: Active sampling for class probability estimation and ranking. Mach. Learn. 54(2), 153–178 (2004)

    Article  MATH  Google Scholar 

  5. Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986)

    Google Scholar 

  6. Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, San Francisco (2014)

    Google Scholar 

  7. McCallum, A., Nigam, K.: A comparison of event models for Naive Bayes text classification. In: AAAI 1998 Workshop on Learning for Text Categorization, vol. 752, pp. 41–48 (1998)

    Google Scholar 

  8. Utgoff, P.E.: An incremental Id3. In: Machine Learning Proceedings, pp. 107–120 (1988)

    Google Scholar 

  9. Dwyer, K., Holte, R.: Decision tree instability and active learning. In: Kok, J.N., Koronacki, J., Mantaras, R.L., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 128–139. Springer, Heidelberg (2007). doi:10.1007/978-3-540-74958-5_15

    Chapter  Google Scholar 

  10. Wang, L.M., Yuan, S.M., Li, L., Li, H.J.: Boosting Naïve Bayes by active learning. In: Proceedings of 2004 International Conference on Machine Learning and Cybernetics, vol. 3, pp. 1383–1386 (2004)

    Google Scholar 

  11. Hu, Q.H., Yu, D.R., Xie, Z.X.: Neighborhood classifiers. Expert Syst. Appl. 34(2), 866–876 (2008)

    Article  Google Scholar 

  12. Zakowski, W.: Approximations in the space (u, \(\pi \)). Demonstratio Math. 16(3), 761–769 (1983)

    MathSciNet  MATH  Google Scholar 

  13. Zhu, W.: Topological approaches to covering rough sets. Inf. Sci. 177(6), 1499–1508 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  14. Hu, Q.H., Yu, D.R., Liu, J.F., Wu, C.X.: Neighborhood rough set based heterogeneous feature subset selection. Inf. Sci. 178(18), 3577–3594 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  15. Xu, Z., Yu, K., Tresp, V., Xu, X., Wang, J.: Representative sampling for text classification using support vector machines. ECIR 2633, 393–407 (2013)

    MATH  Google Scholar 

  16. Huang, S.J., Jin, R., Zhou, Z.H.: Active learning by querying informative and representative examples. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 1936–1949 (2014)

    Article  Google Scholar 

  17. Zhang, B.W., Min, F., Ciucci, D.: Representative-based classification through covering-based neighborhood rough sets. Appl. Intell. 43(4), 840–854 (2015)

    Article  Google Scholar 

  18. Blake, C., Merz, C.J.: UCI repository of machine learning databases (1998)

    Google Scholar 

  19. Ea, S., Peterson, R.: Decision Systems for Inventory Management and Production Planning. Wiely, New York (1985)

    Google Scholar 

  20. Zhao, H., Zhu, W.: Optimal cost-sensitive granularization based on rough sets for variable costs. Knowl.-Based Syst. 65, 72–82 (2014)

    Article  Google Scholar 

  21. Min, F., Zhu, W.: Attribute reduction of data with error ranges and test costs. Inf. Sci. 211, 48–67 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  22. Yao, Y.Y., Zhao, Y.: Conflict analysis based on discernibility and indiscernibility. In: IEEE Symposium on Foundations of Computational Intelligence, pp. 302–307 (2007)

    Google Scholar 

Download references

Acknowledgements

This work is supported in part by National Natural Science Foundation of China (Grant No. 61379089), and the Natural Science Foundation of Department of Education of Sichuan Province (Grant No. 16ZA0060).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fan Min .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Liu, FL., Min, F., Wen, LY., Wang, HJ. (2016). Representative-Based Active Learning with Max-Min Distance. In: Flores, V., et al. Rough Sets. IJCRS 2016. Lecture Notes in Computer Science(), vol 9920. Springer, Cham. https://doi.org/10.1007/978-3-319-47160-0_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47160-0_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47159-4

  • Online ISBN: 978-3-319-47160-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics