Data Cleansing

Data Cleansing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database.

Data cleansing is typically an important step in the implementation of a new Asset Management System where the data used in the initial set up of the system comes from a range of sources.

Common Issues

Listed below are a number of common issues that may result in the need for data cleansing:

  • Spelling mistakes and alternate names
  • Incorrectly categorised items
  • Duplicated items
  • Inconsistent units. e.g some data entered in metres and some entered in millimetres
  • Illegal Characters

Data Cleansing Tips & Tricks

Group & Count

It is often useful to count the number of instances of each value in a particular field. A single instance or a small number of instances of a given value are often an indication of an error.

Search and Replace

Concatenate

Notepad

Related Pages

External Links & References

  1. Wikipedia
  2. Google Search
d
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License