CH2-Bellman

【RL】CH2-Bellman equation

### the discounted return $$ \begin{aligned} G_t & =R_{t+1}+\gamma R_{t+2}+\gamma^2 R_{t+3}+\ldots \\ & =R_{t+1}+\gamma\left(R_{t+2}+\gamma R_{t+3}+\l ......
CH2-Bellman equation Bellman CH2 CH
共1篇  :1/1页 首页上一页1下一页尾页