Ontology highlight
ABSTRACT:
SUBMITTER: Bibaut A
PROVIDER: S-EPMC9799962 | biostudies-literature | 2021 Dec
REPOSITORIES: biostudies-literature
Bibaut Aurélien A Kallus Nathan N Dimakopoulou Maria M Chambaz Antoine A van der Laan Mark M
Advances in neural information processing systems 20211201
Empirical risk minimization (ERM) is the workhorse of machine learning, whether for classification and regression or for off-policy policy learning, but its model-agnostic guarantees can fail when we use adaptively collected data, such as the result of running a contextual bandit algorithm. We study a generic importance sampling weighted ERM algorithm for using adaptively collected data to minimize the average of a loss function over a hypothesis class and provide first-of-their-kind generalizat ...[more]