吴恩达Coursera, 机器学习专项课程, Machine Learning：Unsupervised Learning, Recommenders, Reinforcement Learning第

时间：2022-12-03 13:23:12浏览次数：29

标签：吴恩达 return 正确 Machine learning state Learning action

Practice quiz: Reinforcement learning introduction

第 1 个问题：You are using reinforcement learning to control a four legged robot. The position of the robot would be its _____.

【正确】state

第 2 个问题：You are controlling a Mars rover. You will be very very happy if it gets to state 1 (significant scientific discovery), slightly happy if it gets to state 2 (small scientific discovery), and unhappy if it gets to state 3 (rover is permanently damaged). To reflect this, choose a reward function so that:

【正确】R(1) > R(2) > R(3), where R(1) and R(2) are positive and R(3) is negative.
【解释】Good job!

第 3 个问题：You are using reinforcement learning to fly a helicopter. Using a discount factor of 0.75, your helicopter starts in some state and receives rewards -100 on the first step, -100 on the second step, and 1000 on the third and final step (where it has reached a terminal state). What is the return?

【正确】-100 - 0.75100 + 0.75^21000

第 4 个问题：Given the rewards and actions below, compute the return from state 3 with a discount factor of \gamma = 0.25.

【正确】6.25 Correct
【解释】If starting from state 3, the rewards are in states 3, 2, and 1. The return is 0+(0.25)×0+(0.25) ^2×100=6.25.

Practice quiz: State-action value function

第 1 个问题：Which of the following accurately describes the state-action value function Q(s,a)?

【正确】It is the return if you start from state s, take action a (once), then behave optimally after that.

第 2 个问题：You are controlling a robot that has 3 actions: ← (left), → (right) and STOP. From a given state s, you have computed Q(s, ←) = -10, Q(s, →) = -20, Q(s, STOP) = 0.What is the optimal action to take in state s?

【正确】STOP

第 3 个问题：For this problem, \gamma = 0.25. The diagram below shows the return and the optimal action from each state. Please compute Q(5, ←).

【正确】0.625

Practice quiz: Continuous state spaces

第 1 个问题：The Lunar Lander is a continuous state MDP because:

【正确】The state contains numbers such as position and velocity that are continuous valued

第 2 个问题：In the learning algorithm described in the videos, we repeatedly create an artificial training set to which we apply supervised learning where the input x = (s,a) and the target, constructed using Bellman’s equations, is y = _____?

【正确】见上图

第 3 个问题：You have reached the final practice quiz of this class! What does that mean? (Please check all the answers, because all of them are correct!)

【正确】The DeepLearning.AI and Stanford Online teams would like to give you a round of applause!
【正确】You deserve to celebrate!
【正确】Andrew sends his heartfelt congratulations to you!
【正确】What an accomplishment -- you made it!

标签：吴恩达,return,正确,Machine,learning,state,Learning,action
From： https://www.cnblogs.com/chuqianyu/p/16947468.html

吴恩达Coursera, 机器学习专项课程, Machine Learning：Unsupervised Learning, Recomme
Practicequiz:CollaborativeFiltering第1个问题：Youhavethefollowingtableofmovieratings:Refertothetableaboveforquestion1and2；Assumenumberings......
吴恩达Coursera, 机器学习专项课程, Machine Learning：Unsupervised Learning, Recomme
Practicequiz:Clustering第1个问题：Whichofthesebestdescribesunsupervisedlearning?【正确】Aformofmachinelearningthatfindspatternsusingunlabel......
吴恩达课程学习笔记--第二课第一周：深度学习的实践层面
训练，验证，测试在机器学习的小数据时代，70%验证集，30%测试集，或者60%训练，20%验证和20%测试。大数据时代，如果有百万条数据，我们可以训练集占98%，验证测试各占1%。深度学习的一个趋......
吴恩达深度学习第四课第四周人脸识别和神经风格转换
文章目录人脸识别one-shot学习siamesenetwork（计算相似度）tripletloss三元组数据集的选择人脸验证与二分类神经网络......
eclipse启动失败Could not create the Java virtual machine
eclipse启动失败CouldnotcreatetheJavavirtualmachine解决办法删掉C:\Windows\System32下面的java.exe,javaw.exe和javaws.exe三个文件......
吴恩达出品《Machine Learning Yearning》完整中文版
编者荐语《MachineLearningYearning》是吴恩达历时两年，根据自己多年实践经验整理出来的一本机器学习、深度学习实践经验宝典。作为一本AI实战圣经，本书主要教你如何在......
论文解读（CDCL）《Cross-domain Contrastive Learning for Unsupervised Domain Adaptati
论文信息论文标题：Cross-domainContrastiveLearningforUnsupervisedDomainAdaptation论文作者：RuiWang,ZuxuanWu,ZejiaWeng,JingjingChen,Guo-JunQi,Yu-Ga......
Deep learning_CNN_Review：A Survey of the Recent Architectures
CNN综述文章的翻译[2019CVPR]ASurveyoftheRecentArchitecturesofDeepConvolutionalNeuralNetworks 翻译综述深度卷积神经网络架构：从基本组件到结构创新目......
Unity--Cinemachine官方实例详解
1.2DCamera搭建一个快速场景，MainCamera选择Orthographic。在Cinemachine下有Create2DCamera,在生成的相机中设置follow，同时注意body的设置，如下图所示在虚拟相机中还需要......
注册不到两年半Github标星39k+，吴恩达、李航老师的作品的笔记和代码实现
2017年11月，我注册了github，现在差不多两年半了，一共收获了约39000star，排名个人用户81。今天，我就对我的github做下介绍，里面的几个仓库，非常适合机器学习和深度学习入门。......