Data Parallelism How To Train Deep Learning Models On Multiple Gpus