Data Cleansing
Data Cleansing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database.
Data cleansing is typically an important step in the implementation of a new Asset Management System where the data used in the initial set up of the system comes from a range of sources.
Common Issues
Listed below are a number of common issues that may result in the need for data cleansing:
- Spelling mistakes and alternate names
- Incorrectly categorised items
- Duplicated items
- Inconsistent units. e.g some data entered in metres and some entered in millimetres
- Illegal Characters
Data Cleansing Tips & Tricks
Group & Count
It is often useful to count the number of instances of each value in a particular field. A single instance or a small number of instances of a given value are often an indication of an error.