Cleaning and Learning Over Dirty Tabular Data