Causal inference for social discrimination reasoning
The discovery of discriminatory bias in human or automated decision making is a task of increasing importance and difficulty, exacerbated by the pervasive use of machine learning and data mining. Currently, discrimination discovery largely relies upon correlation analysis of decisions records, disregarding the impact of confounding biases. We present a method for causal discrimination discovery based on propensity score analysis, a statistical tool for filtering out the effect of confounding variables. We introduce causal measures of discrimination which quantify the effect of group membership on the decisions, and highlight causal discrimination/favoritism patterns by learning regression trees over the novel measures. We validate our approach on two real world datasets. Our proposed framework for causal discrimination has the potential to enhance the transparency of machine learning with tools for detecting discriminatory bias both in the training data and in the learning algorithms.
KeywordsSocial discrimination Fairness, accountability, and transparency Propensity score Causal analysis
Compliance with Ethical Standards
Conflict of interests
The authors declare that they have no conflict of interest.
- Agresti, A. (2002). Categorical data analysis. Wiley series in probability and statistics, 2 edn. Wiley-Interscience.Google Scholar
- Barocas, S., & Selbst, A.D. (2016). Big data’s disparate impact. California Law Review, 104.Google Scholar
- Bendic, M. (2007). Situation testing for employment discrimination in the United States of America. Horizons Stratégiques, 3(5), 17–39.Google Scholar
- Berk, R., Heidari, H., Jabbari, S., Kearns, M., Roth, A. (2018). Fairness in criminal justice risk assessments: the state of the art. Sociological Methods & Research.Google Scholar
- Bolukbasi, T., Chang, K., Zou, J.Y., Saligrama, V., Kalai, A.T. (2016). Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In NIPS (pp. 4349–4357).Google Scholar
- Breiman, L., Friedman, J, Olshen, R., Stone, C. (1984). Classification and regression trees. Wadsworth Publishing Company.Google Scholar
- Bryson, A., Dorsett, R., Purdon, S. (2002). The use of propensity score matching in the evaluation of active labour market policies. Crown.Google Scholar
- Calders, T., Karim, A., Kamiran, F., Ali, W., Zhang, X. (2013). Controlling attribute effect in linear regression. In ICDM (pp. 71–80): IEEE.Google Scholar
- Dressel, J., & Farid, H. (2018). The accuracy, fairness, and limits of predicting recidivism. Science Advances, 4(1).Google Scholar
- Fortin, N., Lemieux, T., Firpo, S. (2011). Decomposition methods in economics. In Handbook of labor economics, (Vol. 4 pp. 1–102): Elsevier.Google Scholar
- Foster, S.R. (2004). Causation in antidiscrimination law: beyond intent versus impact. Houston Law Review, 41(5), 1469–1548.Google Scholar
- Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., Pedreschi, D. (2019). A survey of methods for explaining black box models. ACM Computing Survey, 51(5), 93:1–93:42.Google Scholar
- Guo, X.S., & Fraser, M.W. (2015). Propensity score analysis: statistical methods and applications, Sage Publications, Inc., 2.Google Scholar
- Kilbertus, N., Ball, P.J., Kusner, M.J., Weller, A., Silva, R. (2019). The sensitivity of counterfactual fairness to unmeasured confounding. In UAI (p. 213): AUAI Press.Google Scholar
- Kohavi, R., & Longbotham, R. (2017). Online controlled experiments and A/B testing. In Encyclopedia of machine learning and data mining (pp. 922–929): Springer.Google Scholar
- Kohler-Hausmann, I. (2019). Eddie Murphy and the dangers of counterfactual causal thinking about detecting racial discrimination . Northwestern University Law Rev, 113, 1163–1227.Google Scholar
- Kusner, M.J., Loftus, J.R., Russell, C., Silva, R. (2017). Counterfactual fairness. In NIPS (pp. 4069–4079).Google Scholar
- Loftus, J.R., Russell, C., Kusner, M.J., Silva, R. (2018). Causal reasoning for algorithmic fairness. arXiv:abs/1805.05859.
- Luong, B.T., Ruggieri, S., Turini, F. (2011). k-NN as an implementation of situation testing for discrimination discovery and prevention. In KDD (pp. 502–510): ACM.Google Scholar
- Shadish, W.R., Cook, T.D., Campbell, D.T. (2002). Experimental and quasi-experimental designs for generalized causal inference. Houghton-Mifflin.Google Scholar
- Verma, S., & Rubin, J. (2018). Fairness definitions explained. In FairWare@ICSE (pp. 1–7): ACM.Google Scholar
- Wu, Y., Zhang, L., Wu, X. (2019). Counterfactual fairness: Unidentification, bound and algorithm. In IJCAI. ijcai.org (pp. 1438–1444).
- Zhang, J., & Bareinboim, E. (2018). Fairness in decision-making - the causal explanation formula. In AAAI: AAAI Press.Google Scholar
- Zhang, L., Wu, Y., Wu, X. (2016). Situation testing-based discrimination discovery: a causal inference approach. In IJCAI (pp. 2718–2724).Google Scholar
- Zhang, L., Wu, Y., Wu, X. (2017). Achieving non-discrimination in data release. In KDD (pp. 1335–1344): ACM.Google Scholar