Reinforcement Learning OpenAI Gym team: Bananas