Interior Point Method Reinforcement Learning Diagram