Data Preprocessing For Unstructured Hat