Big Data Exercise Python Github Repo