Example Of Dynamic Programming Algorithms For Reinforcement