Diffusion Guided Language Modeling From Scratch