Policy Evaluation In Reinforcement Learning Course