Fundamentals of Reinforcement Learning培训

Reinforcement Learning (RL) 是一种机器学习技术,计算机程序(代理人)通过执行行动并收到对行动结果的反馈来学习如何在环境中行事。对于每一个好行为,代理人都会得到积极的反馈,而对于每一个坏行为,代理人都会得到负面反馈(罚款)。

这个导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向导向

在本研讨会结束后,参与者将能够:

安装并应用需要实施的图书馆和编程语言(0)。
创建一个能够通过反馈而不是通过监督学习的软件代理人。
编程一个代理人解决问题,在决策是序列和终端。
应用知识设计软件,可以以类似于人类如何学习的方式学习。

课程格式

互动讲座和讨论。
很多练习和练习。
在现场实验室环境中进行手动实施。

课程定制选项

要申请此课程的定制培训,请联系我们安排。

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

相关课程

用Python进行深度强化学习

Reinforcement Learning with Java

Introduction to Data Science and AI using Python

AI in Digital Marketing

Artificial Intelligence (AI) for Managers

Artificial Intelligence (AI) for Robotics

Introduction to Artificial Intelligence (AI)

AI and Robotics for Nuclear - Extended

AI and Robotics for Nuclear

AI in business and Society & The future of AI - AI/Robotics

Introduction to Bing AI: Enhancing Search with Artificial Intelligence

IBM Cloud Pak for Data

Fundamentals of Intelligent Driving

Intelligent Testing

OptaPlanner in Practice

课程分类

该网站在其他国家/地区

Europe

Österreich (Austria)

Schweiz (Switzerland)

Deutschland (Germany)

Magyarország (Hungary)

España (Spain)

Nederland (Netherlands)

România (Romania)

Sverige (Sweden)

Belgique (Belgium)

Polska (Poland)

Asia Pacific

香港 (Hong Kong)

台灣 (Taiwan)

North America

México (Mexico)

South America

Brasil (Brazil)

Africa / Middle East

United Arab Emirates

Other sites

加入我们（人力资源）

NobleProg加盟商

DaDesktop - Cloud Desktop