DeepMind Control Suite: dm_control 1.0.13

时间：2023-06-26 18:00:55浏览次数：42

标签：Control control 13 name step velocity action position array

dm_control: DeepMind Infrastructure for Physics-Based Simulation

DeepMind的软件堆栈，用于基于物理的模拟和强化学习环境，使用MuJoCo物理。

1、基准任务

for domain_name, task_name in suite.BENCHMARKING:
　　print(domain_name, task_name)
　　env = suite.load(domain_name, task_name)

　　action_spec = env.action_spec()
　　time_step = env.reset()

　　action = np.random.uniform(action_spec.minimum,
　　action_spec.maximum,
　　size=action_spec.shape)
　　time_step = env.step(action)
　　print(time_step.last(), time_step.reward, time_step.discount, time_step.observation)

acrobot swingup
- orientations: array(4)
- velocity: array(2)
acrobot swingup_sparse
- orientations: array(4)
- velocity: array(2)
ball_in_cup catch
- position: array(4)
- velocity: array(4)
cartpole balance
- position: array(3)
- velocity: array(2)
cartpole balance_sparse
- position: array(3)
- velocity: array(2)
cartpole swingup
- position: array(3)
- velocity: array(2)
cartpole swingup_sparse
- position: array(3)
- velocity: array(2)
cheetah run
- position: array(8)
- velocity: array(9)
finger spin
- position: array(4)
- velocity: array(3)
- touch: array(2)
finger turn_easy
- position: array(4)
- velocity: array(3)
- touch: array(2)
- target_position: array(2)
- dist_to_target: float
finger turn_hard
- position: array(4)
- velocity: array(3)
- touch: array(2)
- target_position: array(2)
- dist_to_target: float
fish upright
- joint_angles: array(7)
- upright: float
- velocity: array(13)
fish swim
- joint_angles: array(7)
- upright: float
- target: array(3)
- velocity: array(13)
hopper stand
- position: array(6)
- velocity: array(7)
- touch: array(2)
hopper hop
- position: array(6)
- velocity: array(7)
- touch: array(2)
humanoid stand
humanoid walk
humanoid run
manipulator bring_ball
pendulum swingup
point_mass easy
reacher easy
reacher hard
swimmer swimmer6
swimmer swimmer15
walker stand
walker walk
walker run

标签：Control,control,13,name,step,velocity,action,position,array
From： https://www.cnblogs.com/lucifer1997/p/17506393.html

积分竟然比微分早了1300年！一文讲清积分的历史
将思考应用于实际，用自己的力量去推导面积、体积，这才是积分的乐趣，也是学习积分的真正意义。小学所学的图形面积、体积的计算，实际上是与积分世界相连通的。积分并不是高中教材中突然半路杀出的“程咬金”，初等教育中相关内容的学习，已经为迈入积分世界做了充分的热身。而对于微分，大部分......
JavaScript进阶13笔记
数字和日期数字在JavaScript里面，数字均为双精度浮点类型，即一个介于±2^−1023和±2^+1024之间的数字，或约为±10^−308到±10^+308，数字精度为53位。整数数值仅在±(2^53-1)的范围内可以表示准确。即一个介于±2^−1023和±2^+1024之间的数字，或约为±10^−30......
i5/i7该选谁？差距大不大？i5-13490F、i7-13790F深度测试
一、i5、i7还是性能差不多吗？自从2017年Zen架构发布开始，Intel与AMD在CPU性能竞争上就进入了激烈的内卷。随着双方在产品竞争上日趋白热化，同世代不同档次CPU产品的性能差距被明显拉大。那么，过去那种“i5、i7性能差不多，用i5性价比比较高”的观点是否依然成立？今天就带来Intel i5-13490......
WIN11 INTEL 12-13代大小核默认调度策略
WIN11INTEL12-13代大小核默认调度策略生效的异类策略：使用异类策略0.异类线程调度策略：自动。异类短运行线程调度策略：自动。......
LoginController
usingMicrosoft.AspNetCore.Http;usingMicrosoft.AspNetCore.Mvc;usingPT.BLL;usingPT.Common;usingPT.Model;usingPT.Web.Mvc.App_Start;usingSystem;usingSystem.Collections.Generic;usingSystem.Linq;usingSystem.Threading.Tasks;namespacePT.Web.......
DataMigrateController
usingSystem;usingSystem.Collections.Generic;usingSystem.Data;usingSystem.Linq;usingSystem.Text;usingSystem.Text.RegularExpressions;usingSystem.Threading.Tasks;usingMicrosoft.AspNetCore.Mvc;usingPT.BLL;usingPT.Common;usingPT.Model;usi......
IPListController
usingMicrosoft.AspNetCore.Http;usingMicrosoft.AspNetCore.Mvc;usingPT.BLL;usingPT.Common;usingPT.Model;usingPT.Web.Mvc.App_Start;usingSystem;usingSystem.Collections.Generic;usingSystem.Data;usingSystem.Linq;usingSystem.Text;usingSyst......
FileManageController
usingSystem;usingSystem.Collections.Generic;usingSystem.Data;usingSystem.IO;usingSystem.Linq;usingSystem.Threading.Tasks;usingMicrosoft.AspNetCore.Hosting;usingMicrosoft.AspNetCore.Http;usingMicrosoft.AspNetCore.Mvc;usingNPOI.HSSF.UserM......
Android13 安装最新版 Frida
本文所有教程及源码、软件仅为技术研究。不涉及计算机信息系统功能的删除、修改、增加、干扰，更不会影响计算机信息系统的正常运行。不得将代码用于非法用途，如侵立删！Android13安装最新版Frida环境win10Pixel4Android13Python3.9Frida16.1.0Frida-tools12.1.3PC安装......
CF1393E2 Twilight and Ancient Scroll
显然有一个\(|S|\log|S|\)的dp做法，但是瓶颈在给字符串排序。也就是真正的瓶颈在于求lcp。AFewSuns给出了一种不需要科技的做法，orz。第一个排序的部分，令\(t_{i,j}\)代表第\(i\)个字符串去掉第\(j\)个字符后的字符串，要给所有\(t_{i,j}\)排序。注意到相同颜色段是可......

DeepMind Control Suite: dm_control 1.0.13

dm_control: DeepMind Infrastructure for Physics-Based Simulation

相关文章

赞助商

阅读排行