Reinforcement Learning OpenAI Gym

Introduction to the hackathon Challenge

Teams Formation

Code time

Time's up! Wrap up & submit your solution

Team presentations and winner announcement

Ack-Ack Learn

Deraam

gius_cat

Mario

Using the stable_baselines3 library, we tried to solve the problems proposed in the challenge.
We used a Proximal Policy Optimization (PPO) Model. The Policy we used is a standard MLP.
We tried to change the number of iteration to achieve a better performance.



Reinforcement Learning: CartPole, Lunar Lander and Bipedal Walker

"Ack-Ack Learn team on Reinforcement Learning OpenAI Gym Hackathon"

Team Idea

Submission