Table of Contents
- What is Reinforcement Learning?
- OpenAI Gym
- Deep Learning with PyTorch
- The Cross-Entropy Method
- Tabular Learning and the Bellman Equation
- Deep Q-Networks
- DQN Extensions
- Stocks Trading Using RL
- Policy Gradients – An Alternative
- The Actor-Critic Method
- Asynchronous Advantage Actor-Critic
- Chatbots Training with RL
- Web Navigation
- Continuous Action Space
- Trust Regions – TRPO, PPO, and ACKTR
- Black-Box Optimization in RL
- Beyond Model-Free – Imagination
- AlphaGo Zero

