Policy Gradient 2018-11-30 RL - Deep Deterministic Policy Gradient (DDPG) 2018-11-25 RL - Proximal Policy Optimization (PPO) 2018-11-22 RL - Trust Region Policy Optimization (TRPO) 2018-01-06 RL - Policy Gradient