Reinforcement
Robust Deep Reinforcement Learning through Adversarial Loss
郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Abstract 最近的研究表明,深度强化学习智能体很容易受到智能体输入上的小对抗性扰动的影响 ......
Heuristic-Guided Reinforcement Learning
**发表时间:**2021 (NeurIPS 2021) **文章要点:**这篇文章提出了一个Heuristic-Guided Reinforcement Learning (HuRL)的框架,用domain knowledge或者offline data构建heuristic,将问题变成一个sho ......
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! NeurIPS 2020 ......
Teachable Reinforcement Learning via Advice Distillation
**发表时间:**2021 (NeurIPS 2021) **文章要点:**这篇文章提出了一种学习policy的监督范式,大概思路就是先结构化advice,然后先学习解释advice,再从advice中学policy。这个advice来自于外部的teacher,相当于一种human-in-the-l ......
Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness
郑重声明:原文参见标题,如有侵权,请联系作者,将会撤销发布! ......
强化学习 Reinforcement Learning
强化学习 Reinforcement Learning 强化学习是一种机器学习思想,其关心一个智能体如何采取行动以达到最大化激励回报。 基本的强化学习模型以马尔可夫决策过程建模。 马尔可夫决策过程 Markov Decision Process 系统要素 A 行动空间; S状态空间; $P^a_{s ......
Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation
**发表时间:**2021(ICML 2021) **文章要点:**这篇文章提出了demonstration-conditioned reinforcement learning (DCRL)来做Few-Shot Imitation,将demonstration和当前状态作为输入,通过强化学习最大化 ......
02.Deep Reinforcement Learning for Quantitative Trading Challenges and Opportunities
Deep Reinforcement Learning for Quantitative Trading Challenges and Opportunities 量化交易的深度强化学习:挑战与机遇 IEEE 背景 量化交易:量化交易是指借助现代统计学和数学的方法,利用计算机技术来进行交易的证券投资 ......