Reinforcement Learning Pytorch Example