Maximum Diffusion Reinforcement Learning Example