Testing Data Pipelines With Apache