Data Profiling for Beginners with Talend Open Profiler: Data Quality Tutorial
Data profiling and data quality assessment are some of the most important techniques in the data quality management process.
The ability to understand your data and assess its fitness to support the business services your organisation executes is paramount.
In this feature we show you how to create a data profiling process by providing a data quality tutorial that combines:
Free database, free data profiling tool and free pattern analysis tools from Talend, Oracle and Data Quality Pro
A sample data set in various formats from a real property database
A full 21 page tutorial with step-by-step instructions and worked examples that are suitable for all abilities
Alternative Free Data Profiling Tools
UPDATE: If you wish to try free data quality software check out the additional tools available in our (free) Practitioner Pack that is available when you subscribe (for free) to Data Quality Pro.
What does this data profiling tutorial cover?
The focus on the tutorial is to introduce you to some of the basic techniques and tools that are required to build your own data profiling strategy.
Structured approach to data quality
Data profiling is really about building a set of well managed and controlled data quality rules as part of a structured data quality management programme.
This allows your organisation to constantly assess business data and alert your data quality custodians of potential danger points caused by defects.
Free data quality tools
By using the free Talend Open Profiler, the freely downloadable version of Oracle and our own data quality pattern analyser, any member of Data Quality Pro can now create and monitor a wide variety of data quality rules for their own live data.
Multiple benefits
In addition, data profiling is one of the first steps for any data integration or data migration project so members of our sister community Data Migration Pro will also reap great benefits of following this tutorial if they do not yet have a profiling solution in place.
Download this free data profiling tutorial
To access the tutorial just follow these simple 3 steps:
Subscribe to Data Quality Pro (it’s free) to get your free Practitioner Pack
Receive your email containing the link with a password to the Practitioner Pack resources
Scroll down the Practitioner Pack and download the Talend bundle of resources
NOTE: You will need ideally an Oracle or MySQL, DB2, SQL Server database to use the Talend Open Data Profiler. All of these database formats can be downloaded and Oracle/MySQL download instructions are provided in the tutorial.