Policy Based Reinforcement Learning Diagram