Multi-granularity Semi-random Data Partitioning

Liu, Han; Cocea, Mihaela

doi:10.1007/978-3-319-70058-8_6

Han Liu⁴ &
Mihaela Cocea⁵

Part of the book series: Studies in Big Data ((SBD,volume 35))

1305 Accesses

Abstract

In this chapter, we introduce the concepts of semi-heuristic data partitioning, and present a proposed multi-granularity framework for semi-heuristic data partitioning. We also discuss the advantages of the proposed framework in terms of dealing with class imbalance and the sample representativeness issue, from granular computing perspectives.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Esfahani, M.S., and E.R. Dougherty. 2014. Effect of separate sampling on classification accuracy. Bioinformatics 30 (2): 242–250.
Article Google Scholar
K. Lang, E. Liberty, and K. Shmakov. 2016. Stratified sampling meets machine learning. In Proceedings of the 33rd International Conference on Machine Learning. New York: JMLR.org, 2320–2329.
Google Scholar
C.-E. S\(\ddot{{\rm {a}}}\)rndal B. Swensson, and J. Wretman. 1992. Model Assisted Survey Sampling. New York: Springer.
Google Scholar
H. Liu and M. Cocea. Semi-random partitioning of data into training and test sets in granular computing context. Granular Computing, 2 (4) 2017.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Informatics, Cardiff University, Cardiff, UK
Han Liu
School of Computing, University of Portsmouth, Portsmouth, UK
Mihaela Cocea

Authors

Han Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mihaela Cocea
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Han Liu .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Liu, H., Cocea, M. (2018). Multi-granularity Semi-random Data Partitioning. In: Granular Computing Based Machine Learning. Studies in Big Data, vol 35. Springer, Cham. https://doi.org/10.1007/978-3-319-70058-8_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-70058-8_6
Published: 05 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70057-1
Online ISBN: 978-3-319-70058-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics