Mesa-optimizer team's submission to the RL hackathon
Used A2C and DQN for Lunar Lander DQN for Cartpole TQC for Bipedal Walker