ABSTRACT
With the support of the legally-grounded methodology of situation testing, we tackle the problems of discrimination discovery and prevention from a dataset of historical decisions by adopting a variant of k-NN classification. A tuple is labeled as discriminated if we can observe a significant difference of treatment among its neighbors belonging to a protected-by-law group and its neighbors not belonging to it. Discrimination discovery boils down to extracting a classification model from the labeled tuples. Discrimination prevention is tackled by changing the decision value for tuples labeled as discriminated before training a classifier. The approach of this paper overcomes legal weaknesses and technical limitations of existing proposals.
References
- A. Agresti. Categorical Data Analysis. Wiley-Interscience, 2002.Google Scholar
Cross Ref
- Australian Legislation. (a) Equal Opportunity Act -- Victoria State, 2010, (b) Anti-Discrimination Act -- Queensland State, 1991.Google Scholar
- G. S. Becker. The Economics of Discrimination. University of Chicago Press, 2nd edition, 1971.Google Scholar
- M. Bendick. Situation testing for employment discrimination in the United States of America. Horizons Strategiques, 5:17--39, 2007.Google Scholar
- T. Calders and S. Verwer. Three naive Bayes approaches for discrimination-free classification. Data Mining & Knowledge Discovery, 21(2):277--292, 2010. Google Scholar
Digital Library
- W. W. Cohen. Fast effective rule induction. In Proc. of ICML 1995, pages 115--123. Morgan Kaufmann, 1995.Google Scholar
Cross Ref
- E. Ellis. EU Anti-Discrimination Law. Oxford University Press, 2005.Google Scholar
- ENAR. European Network Against Racism, Fact Sheet 33: Multiple Discrimination, 2007. http://www.enar-eu.orghttp://www.enar-eu.org.Google Scholar
- European Union Legislation. (a) Race Equality Directive, 2000; (b) Employment Equality Directive, 2000; (c) Equal Treatment of Persons, 2009.Google Scholar
- H. Fang and A. Moro. Theories of statistical discrimination and affirmative action: A survey. In Handbook of Social Economics, Vol 1B. North-Holland, 2010.Google Scholar
- J. L. Fleiss, B. Levin, and M. C. Paik. Statistical Methods for Rates and Proportions. Wiley, 2003.Google Scholar
Cross Ref
- A. Frank and A. Asuncion. UCI machine learning repository, 2011. http://archive.ics.uci.edu/mlhttp://archive.ics.uci.edu/ml.Google Scholar
- J. L. Gastwirth. Statistical reasoning in the legal setting. The American Statistician, 46(1):55--69, 1992.Google Scholar
- N. Lerner. Group Rights and Discrimination in International Law. Martinus Nijhoff Publishers, 1991.Google Scholar
Cross Ref
- R. G. Newcombe. Interval estimation for the difference between independent proportions: comparison of eleven methods. Statistics in Medicine, 17:873--89, 1998.Google Scholar
Cross Ref
- M. J. Piette and P. F. White. Approaches for dealing with small sample sizes in employment discrimination litigation. Journal of Forensic Economics, 12:43--56, 1999.Google Scholar
Cross Ref
- J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1993. Google Scholar
Digital Library
- W. M. Rodgers, editor. Handbook on the Economics of Discrimination. Edward Elgar Publishing, 2006.Google Scholar
Cross Ref
- I. Rorive. Proving Discrimination Cases - the Role of Situation Testing. Centre For Equal Rights & Migration Policy Group, 2009. http://www.migpolgroup.com/publications_detail.php?id=230http://www.migpolgroup.com.Google Scholar
- S. Ruggieri, D. Pedreschi, and F. Turini. Data mining for discrimination discovery. ACM Trans. on Knowledge Discovery from Data, 4(2):Article 9, 2010. Google Scholar
Digital Library
- S. Ruggieri, D. Pedreschi, and F. Turini. DCUBE: Discrimination discovery in databases. In Proc. of SIGMOD 2010, pages 1127--1130. ACM, 2010. Google Scholar
Digital Library
- T. Sowell, editor. Affirmative Action Around the World: An Empirical Analysis. Yale University Press, 2005.Google Scholar
- U.K. Legislation. (a) Sex Discrimination Act, 1975, (b) Race Relation Act, 1976.Google Scholar
- U.S. Federal Legislation. (a) Equal Credit Opportunity Act, 1974; (b) Fair Housing Act, 1968; (c) Employment Act, 1967; (d) Equal Pay Act, 1963; (e) Pregnancy Discrimination Act, 1978; (f) Civil Right Act, 1964, 1991.Google Scholar
Index Terms
k-NN as an implementation of situation testing for discrimination discovery and prevention





Comments