{ "id": "1502.00598", "version": "v1", "published": "2015-02-02T20:00:13.000Z", "updated": "2015-02-02T20:00:13.000Z", "title": "Thompson sampling with the online bootstrap", "authors": [ "Maurits Kaptein", "Davide Ianuzzi" ], "comment": "7 Pages, 4 Figures", "categories": [ "cs.LG" ], "abstract": "We often encounter situations in which an experimenter wants to find, by sequential experimentation, $x_{max} = \\arg\\max_{x} f(x)$, where $f(x)$ is a (possibly unknown) function of a well controllable variable $x$. Taking inspiration from physics and engineering, we have designed a new method to address this problem. In this paper, we first introduce the method in continuous time, and then present two algorithms for use in sequential experiments. Through a series of simulation studies, we show that the method is effective for finding maxima of unknown functions by experimentation, even when the maximum of the functions drifts or when the signal to noise ratio is low.", "revisions": [ { "version": "v1", "updated": "2015-02-02T20:00:13.000Z" } ], "analyses": { "keywords": [ "online bootstrap", "thompson sampling", "sequential experimentation", "noise ratio", "encounter situations" ], "note": { "typesetting": "TeX", "pages": 7, "language": "en", "license": "arXiv", "status": "editable", "adsabs": "2015arXiv150200598K" } } }