Max Diffusion Reinforcement Learning