Dynamic Program for Linear Program IC Problem

Linear Programming for Finite State Multi-Armed Bandit Problems

We consider the multi-armed bandit problem. We show that when the state space is finite the computation of the dynamic allocation indices can be handled by linear programming methods.

一些您可能无法访问的结果已被隐去。

显示无法访问的结果

反馈

Linear Programming for Finite State Multi-Armed Bandit Problems

今日热点