State Value Function Reinforcement Learning