Single Agent Reinforcement Learning