How Bad Is Training On Synthetic Data