课程大纲
介绍
强化学习基础
强化学习基本技术
BURLAP简介
值迭代和策略迭代的收敛
奖赏塑形(Reward Shaping)
探索(Exploration)
泛化(Generalization)
部分可观察的马尔可夫决策过程(POMDP)
选择(Options)
Logistics
TD Lambda
策略梯度(Policy Gradient)
深度Q学习
博弈论(Game Theory)专题
总结和结论
要求
- 熟练掌握Python
- 了解大学微积分和线性代数
- 基本了解概率和统计
- 用Python和Numpy创建机器学习模型的经验
客户评论 (4)
Hunter非常出色,非常有吸引力,知识渊博且平易近人。表现非常出色。
Rick Johnson - Laramie County Community College
课程 - Artificial Intelligence (AI) Overview
机器翻译
I liked the new insights in deep machine learning.
Josip Arneric
课程 - Neural Network in R
机器翻译
Ann created a great environment to ask questions and learn. We had a lot of fun and also learned a lot at the same time.
Gudrun Bickelq
课程 - Introduction to the use of neural networks
机器翻译
It was very interactive and more relaxed and informal than expected. We covered lots of topics in the time and the trainer was always receptive to talking more in detail or more generally about the topics and how they were related. I feel the training has given me the tools to continue learning as opposed to it being a one off session where learning stops once you've finished which is very important given the scale and complexity of the topic.
Jonathan Blease
课程 - Artificial Neural Networks, Machine Learning, Deep Thinking
机器翻译