Reinforcement Learning And Optimization