Reinforcement Learning

AnimalAI - Imitation Learning

Implemented a reformed architecture that uses Proximal Policy Optimization for training the agent and uses Behavior Cloning for incorporating expert trajectories.

Policy Gradient Algorithms

Policy gradient algorithms on Lunar lander environment

Linear Inverse RL Algorithms

Implementation of LIRL algorithms on Mountain Car Environment.