Get in Touch

Course Outline

Introduction

  • Learning through positive reinforcement

Core Components of Reinforcement Learning

Key Terminology (Actions, States, Rewards, Policy, Value, Q-Value, etc.)

Survey of Tabular Solution Methods

Building a Software Agent

Exploring Value-based, Policy-based, and Model-based Approaches

Implementing the Markov Decision Process (MDP)

How Policies Dictate Agent Behavior

Application of Monte Carlo Methods

Temporal-Difference Learning

n-step Bootstrapping

Approximate Solution Methods

On-policy Prediction with Approximation

On-policy Control with Approximation

Off-policy Methods with Approximation

Understanding Eligibility Traces

Utilizing Policy Gradient Methods

Summary and Conclusion

Requirements

  • Background in machine learning
  • Proficiency in programming

Target Audience

  • Data scientists
 21 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories