Maximum Diffusion Reinforcement Learning