Small Language Models From Scratch