Unknown

Dataset Information

0

Prediction and outlier detection in classification problems.


ABSTRACT: We consider the multi-class classification problem when the training data and the out-of-sample test data may have different distributions and propose a method called BCOPS (balanced and conformal optimized prediction sets). BCOPS constructs a prediction set C(x) as a subset of class labels, possibly empty. It tries to optimize the out-of-sample performance, aiming to include the correct class and to detect outliers x as often as possible. BCOPS returns no prediction (corresponding to C(x) equal to the empty set) if it infers x to be an outlier. The proposed method combines supervised learning algorithms with conformal prediction to minimize a misclassification loss averaged over the out-of-sample distribution. The constructed prediction sets have a finite sample coverage guarantee without distributional assumptions. We also propose a method to estimate the outlier detection rate of a given procedure. We prove asymptotic consistency and optimality of our proposals under suitable assumptions and illustrate our methods on real data examples.

SUBMITTER: Guan L 

PROVIDER: S-EPMC9305480 | biostudies-literature | 2022 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction and outlier detection in classification problems.

Guan Leying L   Tibshirani Robert R  

Journal of the Royal Statistical Society. Series B, Statistical methodology 20220215 2


We consider the multi-class classification problem when the training data and the out-of-sample test data may have different distributions and propose a method called BCOPS (balanced and conformal optimized prediction sets). BCOPS constructs a prediction set <i>C</i>(<i>x</i>) as a subset of class labels, possibly empty. It tries to optimize the out-of-sample performance, aiming to include the correct class and to detect outliers <i>x</i> as often as possible. BCOPS returns no prediction (corres  ...[more]

Similar Datasets

| S-EPMC7284557 | biostudies-literature
| S-EPMC3191277 | biostudies-literature
| S-EPMC11323683 | biostudies-literature
| S-EPMC9890313 | biostudies-literature
| S-EPMC8549655 | biostudies-literature
| S-EPMC10326160 | biostudies-literature
| S-EPMC11232596 | biostudies-literature
| S-EPMC11392271 | biostudies-literature
2013-09-05 | GSE22850 | GEO
| S-EPMC5843703 | biostudies-literature