THE  ERROR-REJECT  TRADEOFF


                       Lars Kai Hansen
             CONNECT, Electronics Institute B349
             The Technical University of Denmark
                  DK-2800 Lyngby, Denmark,
                      lkhansen@ei.dtu.dk

                     Christian Liisberg
                  Applied Bio Cybernetics
                        Overdrevsvej
                 DK-3390 Hundested, Denmark 
                    abcyabc@pop.denet.dk

                       Peter Salamon
               Dept. of Mathematical Sciences
                 San Diego State University 
                   San Diego CA 92182 USA,
                    salamon@math.sdsu.edu     


Abstract:

We investigate the error versus reject tradeoff for classifiers. 
Our analysis is motivated by the remarkable similarity in error-reject 
tradeoff curves for widely differing algorithms classifying handwritten 
characters. We present the data in a new scaled version that makes this 
universal character particularly evident.
Based on Chow's theory of the error-reject tradeoff and its underlying 
Bayesian analysis we argue that such universality is in fact to be 
expected for general classification problems. Furthermore, we extend 
Chow's theory to classifiers working from finite samples on a broad, 
albeit limited, class of problems. 
The problems we consider are _effectively binary_, i.e., classification 
problems for which almost all inputs involve a choice between the right 
classification and at most one predominant alternative. 
We show that for such problems at most half of the initially rejected 
inputs would have been erroneously classified. We show further that 
such problems arise naturally as small perturbations of the PAC model 
for large training sets. The perturbed model leads us to conclude that 
the dominant source of error comes from pairwise overlapping categories. 
For infinite training sets, the overlap is due to noise and/or poor 
preprocessing. For finite training sets there is an additional 
contribution from the inevitable displacement of the decision boundaries 
due to finiteness of the sample. In either case, a rejection mechanism which 
rejects inputs in a shell surrounding the decision boundaries leads to 
a universal form for the error-reject tradeoff. Finally we analyze a 
specific reject mechanism based on the extent of consensus among an
ensemble of classifiers. For the ensemble reject mechanism we find  
an analytic expression for the error-reject tradeoff based on a maximum 
entropy estimate of the problem difficulty distribution.

Keywords: error-reject tradeoff, handwritten digits, 
          ensembles, neural networks.