How SQL Change Automation helps you deliver value faster
Sql saturday 179 kharkov
1. Data Quality Services (DQS)
First steps to Data Mining approach
Konstantin Khomyakov
konstantin.khomyakov@gmail.com
2. Agenda
What is DQS ?
Why use DQS ?
Installation
Knowledge base sources
KB for Cleansing
Matching Policy
Demo
3. What is DQS ?
A set of tools and services that allow data
experts improve Data Quality
Components:
- Cleansing
- Matching
- Profiling
- Monitoring
4. Why Use DQS ?
To allow users with data domain knowledge to
improve the quality of the data
Manually define, match, cleanse
Can ‘learn’
Can incorporate 3rd party data
Can integrate with other data processes (SSIS)
5. Installation DQS
SQL Server 2012
Not installed by default
Must run ‘DQS Server Installer’ post SQL Install
7. KB for Cleansing
Using the DQS KB to do Cleansing
Create or Open a Data Quality Project
Map the DQS KB to the new data
Perform Cleansing
Manage/View Results
Export corrected results
8. Matching Policy
A matching policy consists of matching rules that
asses how well one record matches to another
Specify in the rule whether records values have
to be an exact or similar
Train a policy by running and tuning each rule
separately