Reinforcement Learning OpenAI Gym team: NoPolicy