Today9

2024-07-07强化学习（Value Function Approximation）-Today9
ValueFunctionApproximation主要是使用神经网络来求最优解问题，主要包括Algorithmforstatevaluefunction、Sarsa和valuefunctionapproximation的结合、Q-learning和valuefunctionapproximation的结合、DeepQ-learning。由于tables的数据不能处理很大的statespace或