학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

E-Mail
EndNote
RefWorks

Infinite-arms bandit: optimality via confidence bounds.

Resource Type: Journal
Authors: Chan, Hock Peng (SGP-SING-SA) AMS Author Profile; Hu, Shouri (SGP-SING-SA) AMS Author Profile
Source: Statistica Sinica (Statist. Sinica) (20220101), 32, no.~3, 1683-1699. ISSN: 1017-0405 (print).eISSN: 1996-8507.
Subject: 62 Statistics -- 62L Sequential methods
62L15 Optimal stopping
Language: English

Online Access

Web of Science
Scopus
JCR 저널정보

초록

A switching strategy is proposed for the bandit problem with infinitely many arms. An arm is played as long as the value of some statistic computed from this arm sample mean and variance does not exceed a certain threshold; then a new arm is tried. Optimality features of the strategy are discussed under assumptions on the prior distribution of the mean reward, and a heuristics is suggested in the situation with unspecified prior.

공지

DAU Library

학술논문

요약정보

Infinite-arms bandit: optimality via confidence bounds.

Online Access

초록