Skip to main content
Log in

Combating discrimination using Bayesian networks

  • Published:
Artificial Intelligence and Law Aims and scope Submit manuscript

Abstract

Discrimination in decision making is prohibited on many attributes (religion, gender, etc…), but often present in historical decisions. Use of such discriminatory historical decision making as training data can perpetuate discrimination, even if the protected attributes are not directly present in the data. This work focuses on discovering discrimination in instances and preventing discrimination in classification. First, we propose a discrimination discovery method based on modeling the probability distribution of a class using Bayesian networks. This measures the effect of a protected attribute (e.g., gender) in a subset of the dataset using the estimated probability distribution (via a Bayesian network). Second, we propose a classification method that corrects for the discovered discrimination without using protected attributes in the decision process. We evaluate the discrimination discovery and discrimination prevention approaches on two different datasets. The empirical results show that a substantial amount of discrimination identified in instances is prevented in future decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Calders T, Verwer S (2010) Three naive Bayes approaches for discrimination-free classification. Data Min J (special issue with selected papers from ECML/PKDD)

  • Cooper GF, Herskovits E (1991) A Bayesian method for the induction of probabilistic networks from data. Mach Learn BMIR-1991-0293

  • Dwork C, Hardt M, Pitassi T, Reingold O, Zemel R (2012) Fairness through awareness. In: ITCS, pp 214–226

  • Hajian S, Domingo-Ferrer J (2012) A study on the impact of data anonymization on anti-discrimination. In: IEEE ICDM international workshop on discrimination and privacy-aware data mining

  • Hajian S, Monreale A, Pedreschi D, Domingo-Ferrer J, Giannotti F (2012) Injecting discrimination and privacy awareness into pattern discovery. In: IEEE ICDM international workshop on discrimination and privacy-aware data mining

  • Kamiran F, Calders T (2009) Classifying without discriminating. IEEE Press, New York

    Google Scholar 

  • Kamiran F, Calders T (2011) Data preprocessing techniques for classification without discrimination. Knowl Inf Syst

  • Kamiran F, Calders T, Pechenizkiy M (2010) Discrimination aware decision tree learning. In: Proceedings IEEE ICDM international conference on data mining

  • Kamiran F, Karim A, Zhang X (2012) Decision theory for discrimination-aware classification. In: IEEE international conference on data mining

  • Luong BT, Ruggieri S, Turini F (2011) k-NN as an implementation of situation testing for discrimination discovery and prevention. In: 17th ACM international conference on knowledge discovery and data mining (KDD 2011). ACM, pp 502–510

  • Mancuhan K, Clifton C (2012) Discriminatory decision policy aware classification. In: IEEE ICDM international workshop on discrimination and privacy-aware data mining

  • Newman DJ, Hettich S, Blake CL, Merz CJ UCI repository of machine learning databases, http://archive.ics.uci.edu/ml/

  • Pedreschi D, Ruggieri S, Turini F (2008) Discrimination-aware data mining. In: KDD conference

  • Pedreschi D, Ruggieri S, Turini F (2009) Measuring discrimination in socially-sensitive decision records. In: 9th SIAM conference on data mining (SDM 2009). SIAM, pp 581–592

  • Romei A, Ruggieri S (2013) A multidisciplinary survey on discrimination analysis. Knowl Eng Rev 1–57

  • Ruggieri S, Pedreschi D, Turini F (2011) DCUBE: discrimination discovery in databases. In: ACM international conference on knowledge discovery and data mining (KDD 2011). ACM, pp 502–510

  • Tan P.-N, Steinbach M, Kumar V (2006) Introduction to data mining. Addison-Wesley, Reading, MA, pp 227-246

    Google Scholar 

  • Witten IH, Frank E (2011) Data mining: practical machine learning tools and techniques. 3rd edn. Morgan Kaufmann, Los Altos, CA

    Google Scholar 

  • Zemel R, Wu Y, Swersky K, Pitassi T, Dwork C (2013) Learning fair representations. In: ICML

  • Zliobaite I, Kamiran F, Calders T (2011) Handling conditional discrimination. In: Proceedings IEEE ICDM international conference on data mining

Download references

Acknowledgments

We wish to thank Alysa C. Rollock, J.D., Vice President for Ethics and Compliance at Purdue University, for discussion and pointers to relevant U.S. law. We also wish to thank journal reviewers for their helpful comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Koray Mancuhan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mancuhan, K., Clifton, C. Combating discrimination using Bayesian networks. Artif Intell Law 22, 211–238 (2014). https://doi.org/10.1007/s10506-014-9156-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10506-014-9156-4

Keywords

Navigation