Statistical Inference In Massive Data Sets