Reinforcement Learning With Human Feedback Diagram On A 2005