ABSTRACT
Classification, the development of rules for the allocation of observations to groups, is a fundamental machine learning task. A classic example is an automated system for a lending institution that decides whether to accept or reject a credit application. One might desire a machine that allows the non-classification of certain observations that exhibit attributes of belonging to more than one group. This option would allow inspection by an expert for "difficult" cases, or serve as an indication that more data needs to be collected. Classification with an option to reserve judgment on an observation is known as constrained discrimination.
We consider a two-stage model for multi-category constrained discrimination in which limits on misclassification rates of training observations may be pre-specified. The mechanism by which the misclassification limits are satisfied is a rejection option, also known as a reserved judgment group, for observations not demonstrating properties of membership to any of the groups.
References
- J. A. Anderson. "Constrained discrimination between k populations." Journal of the Royal Statistical Society. Series B (Methodological), 31:123--139, 1969.Google Scholar
Cross Ref
- L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth and Brooks/Cole, 1984.Google Scholar
- J. D. Broffit, R. H. Randles, and R. V. Hogg. "Distribution-free partial discriminant analysis." Journal of the American Statistical Association, 71:934--939, 1976.Google Scholar
Cross Ref
- J. P. Brooks and E. K. Lee. "Computing a multi-category constrained discrimination rule via mixed-integer programming and combinatorial optimization." Working paper.Google Scholar
- L. Devroye, L. Györfi, and G. Lugosi. A Probabilistic Theory of Pattern Recognition. Springer, 1996.Google Scholar
Cross Ref
- R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. Wiley, 2001. Google Scholar
Digital Library
- F. A. Feltus, E. K. Lee, J. F. Costello, C. Plass, and P. M. Vertino. "Predicting aberrant CpG island methylation." Proceedings of the National Academy of Sciences, 100:12253--12258, 2003.Google Scholar
Cross Ref
- F. A. Feltus, E. K. Lee, J. F. Costello, C. Plass, and P. M. Vertino. "Dna motifs associated with aberrant CpG island methylation." Genomics, 87:572--579, 2006.Google Scholar
Cross Ref
- R. J. Gallagher, E. K. Lee, and D. A. Patterson. "Constrained discriminant analysis via 0/1 mixed integer programming." Annals of Operations Research, 74:65--88, 1997.Google Scholar
Cross Ref
- L. Györfi, Z. Györfi, and I. Vajda. "Bayesian decision with rejection." Problems of Control and Information Theory, 8:445--452, 1979.Google Scholar
- J. D. F. Habbema, J. Hermans, and A. T. Van Der Burgt. "Cases of doubt in allocation problems." Biometrika, 61:313--324, 1974.Google Scholar
Cross Ref
- D. J. Hand and W. E. Henley. "Statistical classification methods in consumer credit scoring: a review." J. R. Statist. Soc. A, 160:523--541.Google Scholar
Cross Ref
- E. K. Lee, A. Y. C. Fung, J. P. Brooks, and M. Zaider. "Automated planning volume definition in soft-tissue sarcoma adjuvant brachytherapy." Biology in Physics and Medicine, 47:1891--1910, 2002.Google Scholar
Cross Ref
- E. K. Lee, R. J. Gallagher, A. M. Campbell, and M. R. Prausnitz. "Prediction of ultrasound-mediated disruption of cell membranes using machine learning techniques and statistical analysis of acoustic spectra." IEEE Transactions on Biomedical Engineering, 51:1--9, 2004.Google Scholar
Cross Ref
- E. K. Lee, R. J. Gallagher, and D. A. Patterson. "A linear programming approach to discriminant analysis with a reserved-judgment region." INFORMS Journal on Computing, 15:23--41, 2003. Google Scholar
Digital Library
- E. K. Lee, "Optimization-Based Predictive Models in Medicine and Biology." Optimization in Medicine, Springer Computer Science Series, 2006, to appear.Google Scholar
- O. L. Mangasarian and W. H. Wolberg. "Cancer diagnosis via linear programming." SIAM News, 23:1--18, 1990.Google Scholar
- D. J. Newman, S. Hettich, C. L. Blake, and C. J. Merz. UCI repository of machine learning databases, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html.Google Scholar
- C. P. Quesenberry and M. P. Gessaman. "Nonparametric discrimination using tolerance regions." Annals of Mathematical Statistics, 39:664--673, 1968.Google Scholar
Cross Ref
- J. R. Quinlan. "Simplifying decision trees." International Journal of Man-Machine Studies, 27:221--234, 1987. Google Scholar
Digital Library
- V. Vapnik. Statistical Learning Theory. Wiley, 1998.Google Scholar
Digital Library
Index Terms
(auto-classified)Mixed integer programming constrained discrimination model for credit screening



Comments