Scalable Diffusion Models With Transformers Dit