Reinforcement Learning With Human Feedback Diagram