Reinforcement Learning: CartPole, Lunar Lander and Bipedal Walker

Created by team Ack-Ack Learn on 2022-07-23T14:45:28.697Z
Reinforcement Learning: CartPole, Lunar Lander and Bipedal Walker

Using the stable_baselines3 library, we tried to solve the problems proposed in the challenge. We used a Proximal Policy Optimization (PPO) Model. The Policy we used is a standard MLP. We tried to change the number of iteration to achieve a better performance.