Spark Drop Duplicates Pyspark