Varshil Gandhi
Home
Projects
Contact
CV
Reinforcement Learning
AnimalAI - Imitation Learning
Implemented a reformed architecture that uses Proximal Policy Optimization for training the agent and uses Behavior Cloning for incorporating expert trajectories.
Policy Gradient Algorithms
Policy gradient algorithms on Lunar lander environment
Linear Inverse RL Algorithms
Implementation of LIRL algorithms on Mountain Car Environment.
Cite
×