Algorithm For Reinforcement Learning