Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Microaggregation

  • Josep Domingo-Ferrer
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1496

Definition

Microaggregation is a family of masking methods for statistical disclosure control of numerical microdata (although variants for categorical data exist). The rationale behind microaggregation is that confidentiality rules in use allow publication of microdata sets if records correspond to groups of k or more individuals, where no individual dominates (i.e., contributes too much to) the group and k is a threshold value. Strict application of such confidentiality rules leads to replacing individual values with values computed on small aggregates (microaggregates) prior to publication. This is the basic principle of microaggregation.

To obtain microaggregates in a microdata set with n records, these are combined to form g groups of size at least k. For each attribute, the average value over each group is computed and is used to replace each of the original averaged values. Groups are formed using a criterion of maximal similarity. Once the procedure has been completed, the...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Domingo-Ferrer J, Mateo-Sanz JM. Practical data-oriented microaggregation for statistical disclosure control. IEEE Trans Knowl Data Eng. 2002;14(1):189–201.CrossRefGoogle Scholar
  2. 2.
    Domingo-Ferrer J, Sebé F, Solanas A. A polynomial-time approximation to optimal multivariate microaggregation. Comput Math Appl. 2008;55(4):714–32.MathSciNetzbMATHCrossRefGoogle Scholar
  3. 3.
    Domingo-Ferrer J, Torra V. Ordinal, continuous and heterogenerous k-anonymity through microaggregation. Data Min Knowl Dis. 2005;11(2):195–212.CrossRefGoogle Scholar
  4. 4.
    Hundepool A, Van de Wetering A, Ramaswamy R, Franconi L, Capobianchi A, DeWolf P-P, Domingo-Ferrer J, Torra V, Brand R, Giessing S. μ-ARGUS version 4.0 software and user’s manual. Statistics Netherlands, Voorburg, May 2005. http://neon.vb.cbs.nl/casc

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Universitat Rovira i VirgiliTarragonaSpain

Section editors and affiliations

  • Elena Ferrari
    • 1
  1. 1.DiSTAUniv. of InsubriaVareseItaly