Dynamic Programming Vs Reinforcement Learning