Skip to content

Latest commit

 

History

History
116 lines (70 loc) · 4.93 KB

File metadata and controls

116 lines (70 loc) · 4.93 KB

Reinforcement Learning Code with PyTorch

Papers

Algorithms

01. Model-Free Reinforcement Learning

Deep Q-Network (DQN)

Double DQN (DDQN)

Advantage Actor-Critic (A2C)

Asynchronous Advantage Actor-Critic (A3C)

Deep Deterministic Policy Gradient (DDPG)

Truncated Natural Policy Gradient (TNPG)

Trust Region Policy Optimization (TRPO)

TRPO + Generalized Advantage Estimator (GAE)

Proximal Policy Optimization (PPO)

PPO + Generalized Advantage Estimator (GAE)

Soft Actor-Critic (SAC)


02. Inverse Reinforcement Learning

Apprenticeship Learning via Inverse Reinforcement Learning (APP)

Maximum Entropy Inverse Reinforcement Learning (MaxEnt)

Generative Adversarial Imitation Learning (GAIL)

Variational Adversarial Imitation Learning (VAIL)


Learning curve

CartPole

Pendulum

Hopper


Reference