Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Introduction
- Learning through positive reinforcement
Core Components of Reinforcement Learning
Key Terminology (Actions, States, Rewards, Policy, Value, Q-Value, etc.)
Survey of Tabular Solution Methods
Building a Software Agent
Exploring Value-based, Policy-based, and Model-based Approaches
Implementing the Markov Decision Process (MDP)
How Policies Dictate Agent Behavior
Application of Monte Carlo Methods
Temporal-Difference Learning
n-step Bootstrapping
Approximate Solution Methods
On-policy Prediction with Approximation
On-policy Control with Approximation
Off-policy Methods with Approximation
Understanding Eligibility Traces
Utilizing Policy Gradient Methods
Summary and Conclusion
Requirements
- Background in machine learning
- Proficiency in programming
Target Audience
- Data scientists
21 Hours