What is Data Profiling?
The first approach to data is to know its behaviour. Each variable has a particular distribution, diferent shapes of the curve, diferent central tendency measurements and diferent dispersion. Data profiling process allows us to know the characteristics of each variable. Later, we would know the relationship between each variable and the others.
So, data profiling clarifies the structure, content, relationships, and derivation rules of the data. A good level of statistical knowledge is needed. It needs too a wide experience with data sets and proficience with logical rules. After this, analist can face the complexity of data cleaning process and apply powerfull statistical tools.