Diffusion Model Reinforcement Learning