Unknown

Dataset Information

0

Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning.


ABSTRACT: Empirical risk minimization (ERM) is the workhorse of machine learning, whether for classification and regression or for off-policy policy learning, but its model-agnostic guarantees can fail when we use adaptively collected data, such as the result of running a contextual bandit algorithm. We study a generic importance sampling weighted ERM algorithm for using adaptively collected data to minimize the average of a loss function over a hypothesis class and provide first-of-their-kind generalization guarantees and fast convergence rates. Our results are based on a new maximal inequality that carefully leverages the importance sampling structure to obtain rates with the good dependence on the exploration rate in the data. For regression, we provide fast rates that leverage the strong convexity of squared-error loss. For policy learning, we provide regret guarantees that close an open gap in the existing literature whenever exploration decays to zero, as is the case for bandit-collected data. An empirical investigation validates our theory.

SUBMITTER: Bibaut A 

PROVIDER: S-EPMC9799962 | biostudies-literature | 2021 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning.

Bibaut Aurélien A   Kallus Nathan N   Dimakopoulou Maria M   Chambaz Antoine A   van der Laan Mark M  

Advances in neural information processing systems 20211201


Empirical risk minimization (ERM) is the workhorse of machine learning, whether for classification and regression or for off-policy policy learning, but its model-agnostic guarantees can fail when we use adaptively collected data, such as the result of running a contextual bandit algorithm. We study a generic importance sampling weighted ERM algorithm for using adaptively collected data to minimize the average of a loss function over a hypothesis class and provide first-of-their-kind generalizat  ...[more]

Similar Datasets

| S-EPMC7058059 | biostudies-literature
| S-EPMC9858109 | biostudies-literature
| S-EPMC10343141 | biostudies-literature
2019-11-13 | GSE140262 | GEO
| S-EPMC8982718 | biostudies-literature
| S-EPMC8259695 | biostudies-literature
| S-EPMC10440826 | biostudies-literature
| S-EPMC8725656 | biostudies-literature
| S-EPMC11585117 | biostudies-literature
| S-EPMC6550282 | biostudies-literature