Interior Point Method Reinforcement Learning