This example includes the implementations of the following reinforcement learning algorithms: ES Cross Entropy Method (CEM) Covariance Matrix Adaptation Evolution Strategy (CMA-ES) OpenAI-ES RL Vanilla Policy Gradient (VPG) Proximal Policy Optimization (PPO) Deep Deterministic Policy Gradients (DDPG) Twin Delayed DDPG (TD3) Soft Actor-Critic (SAC) Benchmarks ES RL