Hostname: page-component-5c6d5d7d68-wbk2r Total loading time: 0 Render date: 2024-08-16T19:44:17.676Z Has data issue: false hasContentIssue false

A two armed bandit type problem revisited

Published online by Cambridge University Press:  15 November 2005

Gilles Pagès*
Affiliation:
Laboratoire de Probabilités et Modèles Aléatoires, UMR 7599, Université Paris 6, case 188, 4, place Jussieu, 75252 Paris Cedex 5, France; gpa@ccr.jussieu.fr
Get access

Abstract

In Benaïm and Ben Arous (2003) is solved a multi-armed bandit problem arising in the theory of learning in games. We propose a short and elementary proof of this result based on a variant of the Kronecker lemma.

Type
Research Article
Copyright
© EDP Sciences, SMAI, 2005

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Benaïm, M., Dynamics of stochastic algorithms, in Séminaire de probabilités XXXIII, J. Azéma et al. Eds., Springer-Verlag, Berlin. Lect. Notes Math. 1708 (1999) 168.
Benaïm, M. and Ben Arous, G., A two armed bandit type problem. Game Theory 32 (2003) 316. CrossRef