课程大纲
介绍
强化学习基础
强化学习基本技术
BURLAP简介
值迭代和策略迭代的收敛
奖赏塑形(Reward Shaping)
探索(Exploration)
泛化(Generalization)
部分可观察的马尔可夫决策过程(POMDP)
选择(Options)
Logistics
TD Lambda
策略梯度(Policy Gradient)
深度Q学习
博弈论(Game Theory)专题
总结和结论
要求
- 熟练掌握Python
- 了解大学微积分和线性代数
- 基本了解概率和统计
- 用Python和Numpy创建机器学习模型的经验
客户评论 (4)
Hunter非常出色,非常有吸引力,知识渊博且平易近人。表现非常出色。
Rick Johnson - Laramie County Community College
课程 - Artificial Intelligence (AI) Overview
机器翻译
培训师是该领域的专业人士,能够出色地将理论与实际应用相结合
Fahad Malalla - Tatweer Petroleum
课程 - Applied AI from Scratch in Python
机器翻译
The interactive part, tailored to our specific needs.
Thomas Stocker
课程 - Introduction to the use of neural networks
机器翻译
It was very interactive and more relaxed and informal than expected. We covered lots of topics in the time and the trainer was always receptive to talking more in detail or more generally about the topics and how they were related. I feel the training has given me the tools to continue learning as opposed to it being a one off session where learning stops once you've finished which is very important given the scale and complexity of the topic.
Jonathan Blease
课程 - Artificial Neural Networks, Machine Learning, Deep Thinking
机器翻译