Ch1

【笔记】在线凸优化 - Ch1. Introduction

1.1 The Online Convex Optimization Setting 在线凸优化 Online convex optimization (OCO),是一个带有博弈论、统计学习、凸优化的新玩意 给出如下问题叙述: 考虑一种博弈游戏,回合制,第 \(t\in [T]\) 回合,先由玩家从 ......
Introduction 笔记 Ch1 Ch

【RL】CH1-Basic Concepts

# 1.7 Markov decision processes This section presents these concepts in a more formal way under the framework of Markov decision processes (MDPs). An ......
CH1-Basic Concepts Basic CH1 CH
共2篇  :1/1页 首页上一页1下一页尾页