ABSTRACT
We propose a criterion for discrimination against a specified sensitive attribute in supervised learning, where the goal is to predict some target based on available features. Assuming data about the predictor, target, and membership in the protected group are available, we show how to optimally adjust any learned predictor so as to remove discrimination according to our definition. Our framework also improves incentives by shifting the cost of poor classification from disadvantaged groups to the decision maker, who can respond by improving the classification accuracy. We enourage readers to consult the more complete manuscript on the arXiv.
References
- Solon Barocas and Andrew Selbst. Big data's disparate impact. California Law Review, 104, 2016.Google Scholar
- John Podesta, Penny Pritzker, Ernest J. Moniz, John Holdren, and Jefrey Zients. Big data: Seizing opportunities and preserving values. Executive Office of the President, May 2014.Google Scholar
- Big data: A report on algorithmic systems, opportunity, and civil rights. Executive Office of the President, May 2016.Google Scholar
- Dino Pedreshi, Salvatore Ruggieri, and Franco Turini. Discrimination-aware data mining. In Proc. 14th ACM SIGKDD, 2008. Google Scholar
Digital Library
- T. Calders, F. Kamiran, and M. Pechenizkiy. Building classifiers with independency constraints. In In Proc. IEEE International Conference on Data Mining Workshops, pages 13-18, 2009. Google Scholar
Digital Library
- Indre Zliobaite. On the relation between accuracy and fairness in binary classification. CoRR, abs/1505.05723, 2015.Google Scholar
- Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P Gummadi. Learning fair classifiers. CoRR, abs:1507.05259, 2015.Google Scholar
- Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard S. Zemel. Fairness through awareness. In Proc. ACM ITCS, pages 214-226, 2012. Google Scholar
Digital Library
- Jon M. Kleinberg, Sendhil Mullainathan, and Manish Raghavan. Inherent trade-offs in the fair determination of risk scores. CoRR, abs/1609.05807, 2016.Google Scholar
- Larry Wasserman. All of Statistics: A Concise Course in Statistical Inference. Springer, 2010. Google Scholar
Digital Library
- US Federal Reserve. Report to the congress on credit scoring and its effects on the availability and affordability of credit, 2007.Google Scholar
Index Terms
(auto-classified)Equality of opportunity in supervised learning




Comments