Data cleaning

After a dataset has been collected from a source, “data cleaning” are the preliminary operations (“pre processing”) to spot, remove and / or correct errors or other undesirable aspects in the dataset. The processing of the data (for any useful purpose) can then begin.