Reinforcement Learning And Stochastic Optimization