off-policy learning planning policy

EXPLORING MODEL-BASED PLANNING WITH POLICY NETWORKS

**发表时间：**2020（ICLR 2020） **文章要点：**这篇文章说现在的planning方法都是在动作空间里randomly generated，这样很不高效（其实瞎扯了，很多不是随机的方法啊）。作者提出在model based RL里用policy网络来做online planning ......

MODEL-BASED EXPLORING PLANNING NETWORKS POLICY更新时间 2023-04-27

User installations are disabled via policy on the machine. 安装python

User installations are disabled via policy on the machine. 解决办法 1、在运行里输入gpedit.msc;（group policy)组策略 2、计算机配置管理>>管理模板>>windows组件>>windows Installer>>禁止 ......

installations disabled machine policy python更新时间 2023-04-27

Representation Learning for Attributed Multiplex Heterogeneous Network

Cen Y., Zou X., Zhang J., Yang H., Zhou J. and Tang J. Representation learning for attributed multiplex heterogeneous network. KDD, 2019. 概本文在 Attrib ......

Representation Heterogeneous Attributed Multiplex Learning更新时间 2023-04-27

2022AAAI_Semantically Contrastive Learning for Low-light Image Enhancement(SCL_LLE)

1. motivation 利用语义对比学习 2. network (1) 输入的是低光图像首先经过图像增强的网络（Zero-DCE), 再将它传入语义分割网络中（2）语义分割网络用的是DeepLabv3+ ......

AAAI_Semantically Semantically Contrastive Enhancement Low-light更新时间 2023-04-26

MEMORY REPLAY WITH DATA COMPRESSION FOR CONTINUAL LEARNING--阅读笔记

MEMORY REPLAY WITH DATA COMPRESSION FOR CONTINUAL LEARNING--阅读笔记摘要：在这项工作中，我们提出了使用数据压缩（MRDC）的内存重放，以降低旧的训练样本的存储成本，从而增加它们可以存储在内存缓冲区中的数量。观察到压缩数据的质量和数量之间 ......

COMPRESSION CONTINUAL LEARNING 笔记 MEMORY更新时间 2023-04-25

Deep-Learning-Based Spatio-Temporal-Spectral Integrated Fusion of Heterogeneous Remote Sensing Images

Deep-Learning-Based Spatio-Temporal-Spectral Integrated Fusion of Heterogeneous Remote Sensing Images abstract 为了解决STF中的生成heterogeneous images问题: 为此，本 ......

Spatio-Temporal-Spectral Deep-Learning-Based Heterogeneous Integrated Learning更新时间 2023-04-25

Medicine River-------------Learning Journals 8

htttp://www.enotes.com ......

Medicine Learning Journals River更新时间 2023-04-24

Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness

郑重声明：原文参见标题，如有侵权，请联系作者，将会撤销发布！ ......

Reinforcement Adversarial Redefining Robustness Learning更新时间 2023-04-23

Learning Off-Policy with Online Planning

**发表时间：**2021（CoRL 2021） **文章要点：**这篇文章提出Off-Policy with Online Planning (LOOP)算法，将H-step lookahead with a learned model和terminal value function learne ......

Off-Policy Learning Planning Policy Online更新时间 2023-04-23

论文解读（VAT）《Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning》

论文信息论文标题：Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning论文作者：Takeru Miyato, S. Maeda, Masanori Koya ......

Supervised Semi-Supervised Regularization Adversarial Training更新时间 2023-04-22

论文解读（PGD）《Towards deep learning models resistant to adversarial attacks》

论文信息论文标题：Towards deep learning models resistant to adversarial attacks论文作者：Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Ad ......

adversarial resistant learning Towards attacks更新时间 2023-04-22

基于RL(Q-Learning)的迷宫寻路算法

强化学习是一种机器学习方法，旨在通过智能体在与环境交互的过程中不断优化其行动策略来实现特定目标。与其他机器学习方法不同，强化学习涉及到智能体对环境的观测、选择行动并接收奖励或惩罚。因此，强化学习适用于那些需要自主决策的复杂问题，比如游戏、机器人控制、自动驾驶等。强化学习可以分为基于价值的方法和基于策 ......

迷宫算法 Q-Learning Learning RL更新时间 2023-04-21

MySQL Execution Plan--DISTINCT语句优化

问题描述在很多业务场景中业务需要过滤掉重复数据，对于MySQL数据库可以有多种SQL写法能实现这种需求，如：使用DISTINCT，如： SELECT DISTINCT username FROM hotel_owner WHERE username IN ('yqdsyey4474','xrnh ......

语句 Execution DISTINCT MySQL Plan更新时间 2023-04-21

1、题目：Engineering Design Thinking, Teaching, and Learning

期刊信息（1）作者：Dym，Clive L.，Agogino，Alice M.，Eris，Ozgur，Frey，Daniel D.，Leifer，Larry J. （2）期刊：Journal of Engineering Education：94-1-103-120，01/2005 （3）DOI： ......

Engineering Thinking Teaching Learning 题目更新时间 2023-04-20

M3AE: Multimodal Representation Learning for Brain Tumor Segmentation with Missing Modalities

摘要提出SimCLR，用于视觉表征的对比学习，简化了最近提出的对比自监督学习算法，为了理解是什么使对比预测任务能够学习有用的表示，系统研究了提出框架的主要组成部分，发现：（1）数据增强的组成在定义有效的预测任务中起着关键的作用（2）在表示和对比损失之间引入一个可学习的非线性变换，大大提高了已学 ......

Representation Segmentation Multimodal Modalities Learning更新时间 2023-04-20

阅读文献《SCNet：Deep Learning-Based Downlink Channel Prediction for FDD Massive MIMO System》

该文献的作者是清华大学的高飞飞老师，于2019年11月发表在IEEE COMMUNICATIONS LETTERS上。文章给出了当用户位置到信道的映射是双射时上行到下行的确定映射函数；还提出了一个**稀疏复值神经网络( sparse complex-valued neural network，SC ......

Learning-Based Prediction 文献 Learning Downlink更新时间 2023-04-20

文献阅读《AcsiNet: Attention-Based Deep Learning Network for CSI Prediction in FDD MIMO Systems》

这篇文献的作者是南华大学的林文斌老师，于2023年3月3日发表在IEEE WIRELESS COMMUNICATIONS LETTERS。文章直接对上行 CSI 矩阵使用离散傅里叶逆变换进行压缩，然后将其输入一个基于注意力（attention-based）的深度学习网络，该网络可以专注于关键的 C ......

Attention-Based Prediction Attention 文献 Learning更新时间 2023-04-18

Lecture#14 Query Planning & Optimization

SQL是声明性的，这意味着用户告诉 DBMS 他们想要什么答案，而不是如何得到答案。因此，DBMS 需要将 SQL 语句转换为可执行的查询计划。但不同的查询计划的效率可能出现多个数量级的差别，如 Join Algorithms 一节中的 Simple Nested Loop Join 与 Hash ......

Optimization Planning Lecture Query amp更新时间 2023-04-18

GCR Gradient Coreset based Replay Buffer Selection for Continual Learning

GCR: Gradient Coreset based Replay Buffer Selection for Continual Learning 摘要：本文提出了一种创新的重放缓冲区选择和更新策略，梯度核心集重放（GCR），使用一种设计优化标准。该方法选择和维持一个“coreset” ，它非常 ......

Continual Selection Gradient Learning Coreset更新时间 2023-04-17

Codeforces Round 625 (Div. 1, based on Technocup 2020 Final Round) A. Journey Planning（dp）

https://codeforces.com/contest/1320/problem/A ###A. Journey Planning 题目大意：给定一组数，问我们ai-aj==i-j的时候就可以把ai的值加起来，问我们可以凑到的最大总值是多少？ input 6 10 7 1 9 10 15 o ......

Round Codeforces Technocup Planning Journey更新时间 2023-04-17

论文解读《Automatically discovering and learning new visual categories with ranking statistics》

论文信息论文标题：Automatically discovering and learning new visual categories with ranking statistics论文作者：K. Han, Sylvestre-Alvise Rebuffi, Sébastien Ehrhard ......

Automatically discovering categories statistics learning更新时间 2023-04-17

五天学会Deep Learning

五天学完deep learning。。。。。。是时候来证明chatGPT和new bing的能力了。。。。。。 DAY1 Sigmoid function Sigmoid 函数是一种常用的激活函数，它将输入值映射到 0 和 1 之间。它的公式为 f(x) = 1 / (1 + e^-x)。Sigmo ......

Learning Deep更新时间 2023-04-17

论文解读（PAWS）《Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples》

论文信息论文标题：Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples论文作者：Mahmoud Assran, Mathi ......

Non-Parametrically Semi-Supervised Parametrically Assignments Predicting更新时间 2023-04-17

共600篇 :17/20页 首页上一页14151617181920下一页尾页

526互联