首页 > 其他分享 >读论文《IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures》——（续）实

读论文《IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures》——（续）实

时间：2022-10-11 21:13:27浏览次数：91

标签：Weighted Importance 论文 Learner Scalable Actor Architectures

论文地址：

https://arxiv.org/pdf/1802.01561v2.pdf

论文《IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures》是基于论文《Safe and efficient off-policy reinforcement learning》改进后的分布式版本，基础论文《Safe and efficient off-policy reinforcement learning》的地址为：

https://arxiv.org/pdf/1606.02647.pdf

相关资料：

Deepmind Lab环境的python扩展库的安装：

https://www.cnblogs.com/devilmaycry812839668/p/16750126.html

读论文《IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures》

=========================================

官方的代码地址：（现已无法运行）

https://gitee.com/devilmaycry812839668/scalable_agent

需要注意的一点是这个offical的代码由于多年无人维护，现在已经无法运行，只做留档之用。

=========================================

标签：Weighted,Importance,论文,Learner,Scalable,Actor,Architectures
From： https://www.cnblogs.com/devilmaycry812839668/p/16782564.html

相关文章

Fairness without Demographics through Adversarially Reweighted Learning
目录概符号说明本文方法代码LahotiP.,BeutelA.,ChenJ.,LeeK.,ProstF.,ThainN.,WangX.andCHiE.H.Fairnesswithoutdemographicsthroughadversariall......
读论文《IMPALA: Scalable Distributed Deep-RL with Importance WeightedActor-Learn
论文地址：https://arxiv.org/pdf/1802.01561v2.pdf ========================================= ========================================= ......
Importance Sampling and Rejection Sampling
目录ImportanceSamplingRejectionSamplingChenY.Lecture4:ImportanceSamplingandRejectionSampling.ImportanceSampling设想我们希望估计这样的一个值:......
Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic La
目录概符号说明Motivation本文方法更简洁的形式BengioY.andSen\acute{e}calJ.S.Adaptiveimportancesamplingtoacceleratetrainingofaneuralprobabilistic......
Weighted Distribution
约束提供了对随机化的控制，用户可以从中控制随机化的值。有很多方法可以控制这些值。其中之一是加权分布。加权分布在约束块中创建分布，使得某些值的选择频率高于其他值。加......
hdu7215 Weighted Beautiful Tree
problem一个点的点权的可能为不变或者变为连着的边的边权。然后dp、dp[u][0]表示变成大于等于w[u]边的最小代价。dp[u][1]表示变成小于等于w[u]边的最小代价。然后对......

赞助商

阅读排行