학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Globally Informative Thompson Sampling for Structured Bandit Problems with Application to CrowdTranscoding

Resource Type: Conference
Authors: Liu, Xingchi; Derakhshani, Mahsa; Zhu, Ziming; Lambotharan, Sangarapillai
Source: 2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) Artificial Intelligence in Information and Communication (ICAIIC), 2021 International Conference on. :210-215 Apr, 2021
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Correlation
Simulation
Computational modeling
Decision making
Stochastic processes
Streaming media
Benchmark testing
Multi-armed bandit
Thompson sampling
Structured bandit
Edge computing
Language

Online Access

Full Text (IEEE)

초록

Multi-armed bandit is a widely-studied model for sequential decision-making problems. The most studied model in the literature is stochastic bandits wherein the reward of each arm follows an independent distribution. However, there is a wide range of applications where the rewards of different alternatives are correlated to some extent. In this paper, a class of structured bandit problems is studied in which rewards of different arms are functions of the same unknown parameter vector. To minimize the cumulative learning regret, we propose a globally-informative Thompson sampling algorithm to learn and leverage the correlation among arms, which can deal with unknown multi-dimensional parameter and non-monotonic reward functions. Our studies demonstrate that the proposed algorithm achieves significant improvement in the learning speed. In particular, the designed algorithm is used to solve an edge transcoder selection problem in crowdsourced live video streaming systems and shows superior performance as compared to the existing schemes.

공지

DAU Library

학술논문

요약정보

Globally Informative Thompson Sampling for Structured Bandit Problems with Application to CrowdTranscoding

Online Access

초록