SlideShare uma empresa Scribd logo
1 de 23
Advance Data Quality Management
Basice Overview
Khaled Mosharraf. Msc
mosharrafkhaled@gmx.de
A.K.M Bhalul Haque. M.Sc
b.haque@gmx.de
FH Kiel, Germany
2016
Agenda
• Motivation / Introduction
• Data Quality Definitions
• Foundation of Data Quality
• Data Quality Assessments
• Measuring Data Quality
• DQ-Organisation
• Data Policies
• Data Governance
• DQ Policies
• Data Profiling
Kiel University of Applied Sciences
Introduction
Today is world of heterogeneity.
We have different technologies.
We operate on different platforms.
We have large amount of data being generated
everyday in all sorts of organizations and
Enterprises.
And we do have problems with data.
Kiel University of Applied Sciences
The previous slide we discuss
about introduction part and
data quality definitions.
If you missed it please check
that slide
Kiel University of Applied Sciences
Foundation of Data Quality
I. Data Production System
II. IQ-Dimensions
III. IQ-Categories / IQ-Pattern
Kiel University of Applied Sciences
Maintenance of data quality
• Data quality results from the process of going
through the data and scrubbing it, standardizing it,
and de duplicating records, as well as doing some of
the data enrichment.
Maintain complete data.
Clean up your data by standardizing it using rules.
Use fancy algorithms to detect duplicates. Eg: ICS
and Informatics Computer System.
Avoid entry of duplicate leads and contacts.
Merge existing duplicate records.
Use roles for security.
Kiel University of Applied Sciences
Data Production System
• Data collector
• Data custodain
• Data consummer
Kiel University of Applied Sciences
Data Production System
Kiel University of Applied Sciences
IQ Dimanssion
• Relevance
• Accuracy
• Timellness
• Compliteness
• Coherence
• Format
• Accessibility
• Compatibillity
• Security
• Validity
• Accessibility
• Appropriate Amount of Data
• Believability
• Concise Representation
• Consistent Representation
• Ease of Manipulation
• Free of Error
• Interpretability
• Objectivity
• Relevancy
• Understandability
• Value-Added
Kiel University of Applied Sciences
Information Quality Dimensions
Dimensions
• Accessibility
The extent to which data is available, or easily and quickly
retrievable
• Appropriate Amount of Data
The extent to which the volume of data is appropriate for
the task at hand
• Believability
The extent to which data is regarded as true and credible
• Completeness
The extent to which data is not missing and is of sufficient
breadth and depth for the task at hand
Kiel University of Applied Sciences
• Concise Representation
The extent to which data is compactly represented
• Consistent Representation
The extent to which data is presented in the same
format
• Ease of Manipulation
The extent to which data is easy to manipulate and
apply to different tasks
• Free of Error
The extent to which data is correct and reliable
Kiel University of Applied Sciences
• Interpretability
The extent to which data is in appropriate languages,
symbols, and units, and the definitions are clear
• Objectivity
The extent to which data is unbiased, unprejudiced, and
impartial
• Relevancy
The extent to which data is applicable and helpful for the
task at hand
• Security
The extent to which access to data is restricted
appropriately to maintain its security
Kiel University of Applied Sciences
• Timeliness
The extent to which data is sufficiently up-to-date
for the task at hand
• Understandability
The extent to which data is easily comprehended
• Value-Added
The extent to which data is beneficial and
provides advantages from its use
Kiel University of Applied Sciences
Questions
• How do organisations define data quality?
• What data quality problems arise in
organizations?
• How do organizations identify, analyze, and
resolve data quality problems?
• How do organizations encourage employees to
work on a proactive management of DQ / IQ?
• Are there common data quality patterns?
• Across Organisations
• Across DQ-projects
Kiel University of Applied Sciences
IQ Categories / Patterns
Intrinsic IQ
• Information have quality in their own right
Contextual IQ
• Information quality must be considered within
the context of the task
Accessibility IQ / Representational IQ
• Emphasize the importance of the role of
systems
Kiel University of Applied Sciences
Intrinsic IQ
• Mismatch between several sources of the
“same” data
• “consistency” vs. “accuracy”
• Believability issues
• Poor reputation of sources
• Poor reputation for quality
• Subjective production of data
• Human judgment / knowledge in coding
Kiel University of Applied Sciences
Intrinsic IQ
Kiel University of Applied Sciences
Contextual IQ
Mismatch between information available and what
information is relevant for information consumers
• Missing data –the easy case
• Data bundling and analyzability –the hard case
Issue is aggregation
• Across record (transaction) analysis of data
• e.g. Corporate Actions in banking
• Often across distributed systems
Incompatible, distributed systems (HMO)
Kiel University of Applied Sciences
Contextual IQ
Kiel University of Applied Sciences
Accessability IQ / Representational IQ
Technical Accessibility
• Physical access
• Computing resources
Time to Access / Ease of Access:
• Amount of data
• Privacy, confidentiality
Interpretability and Understandability:
• Coding
Representation and its Analyzability:
• Image and text data
Kiel University of Applied Sciences
Accessability IQ / Representational IQ
Kiel University of Applied Sciences
Data Quality Problems
Kiel University of Applied Sciences
Thank You

Mais conteúdo relacionado

Mais procurados

Automating Data Quality Processes at Reckitt
Automating Data Quality Processes at ReckittAutomating Data Quality Processes at Reckitt
Automating Data Quality Processes at Reckitt
Databricks
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
DATAVERSITY
 
The data explosion along the care cycle (Dell Healthcare)
The data explosion along the care cycle (Dell Healthcare)The data explosion along the care cycle (Dell Healthcare)
The data explosion along the care cycle (Dell Healthcare)
Eric Van 't Hoff
 
Building a Data Quality Program from Scratch
Building a Data Quality Program from ScratchBuilding a Data Quality Program from Scratch
Building a Data Quality Program from Scratch
dmurph4
 

Mais procurados (20)

Data Quality
Data QualityData Quality
Data Quality
 
Data-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success StoriesData-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success Stories
 
Data Quality
Data QualityData Quality
Data Quality
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data Quality
 
Data Quality: principles, approaches, and best practices
Data Quality: principles, approaches, and best practicesData Quality: principles, approaches, and best practices
Data Quality: principles, approaches, and best practices
 
Automating Data Quality Processes at Reckitt
Automating Data Quality Processes at ReckittAutomating Data Quality Processes at Reckitt
Automating Data Quality Processes at Reckitt
 
Data Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great AccountabilityData Governance Strategies - With Great Power Comes Great Accountability
Data Governance Strategies - With Great Power Comes Great Accountability
 
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
DAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best PracticesDAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best Practices
 
Data Quality
Data QualityData Quality
Data Quality
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
 
Data Quality for Non-Data People
Data Quality for Non-Data PeopleData Quality for Non-Data People
Data Quality for Non-Data People
 
Data Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing ConcernData Quality: A Raising Data Warehousing Concern
Data Quality: A Raising Data Warehousing Concern
 
The data explosion along the care cycle (Dell Healthcare)
The data explosion along the care cycle (Dell Healthcare)The data explosion along the care cycle (Dell Healthcare)
The data explosion along the care cycle (Dell Healthcare)
 
Creating an Effective MDM Strategy for Salesforce
Creating an Effective MDM Strategy for SalesforceCreating an Effective MDM Strategy for Salesforce
Creating an Effective MDM Strategy for Salesforce
 
Data quality metrics infographic
Data quality metrics infographicData quality metrics infographic
Data quality metrics infographic
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management Functionality
 
Building a Data Quality Program from Scratch
Building a Data Quality Program from ScratchBuilding a Data Quality Program from Scratch
Building a Data Quality Program from Scratch
 
Data Lifecycle Management
Data Lifecycle ManagementData Lifecycle Management
Data Lifecycle Management
 

Semelhante a Foundation of data quality

Sharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical viewSharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical view
Open City Foundation
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLH
Jisc
 
Data presentation and transfer
Data presentation and transferData presentation and transfer
Data presentation and transfer
Iyad Abou Rabii
 

Semelhante a Foundation of data quality (20)

Data-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success StoriesData-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success Stories
 
Data Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data QualityData Profiling: The First Step to Big Data Quality
Data Profiling: The First Step to Big Data Quality
 
Architecting Academic Intelligence
Architecting Academic IntelligenceArchitecting Academic Intelligence
Architecting Academic Intelligence
 
Sharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical viewSharon Dawes (CTG Albany) Open data quality: a practical view
Sharon Dawes (CTG Albany) Open data quality: a practical view
 
Survival Guide: Taming the Data Quality Beast
Survival Guide: Taming the Data Quality BeastSurvival Guide: Taming the Data Quality Beast
Survival Guide: Taming the Data Quality Beast
 
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupCrowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality 
 
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your DataFoundational Strategies for Trust in Big Data Part 2: Understanding Your Data
Foundational Strategies for Trust in Big Data Part 2: Understanding Your Data
 
Trendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sourcesTrendspotting: Helping you make sense of large information sources
Trendspotting: Helping you make sense of large information sources
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Ensuring data quality
Ensuring data qualityEnsuring data quality
Ensuring data quality
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing
 
Managing data responsibly to enable research interity
Managing data responsibly to enable research interityManaging data responsibly to enable research interity
Managing data responsibly to enable research interity
 
Data discovery and sharing at UCLH
Data discovery and sharing at UCLHData discovery and sharing at UCLH
Data discovery and sharing at UCLH
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data presentation and transfer
Data presentation and transferData presentation and transfer
Data presentation and transfer
 
Chapter 4 Organizational Aspects of Data Management.ppt
Chapter 4 Organizational Aspects of Data Management.pptChapter 4 Organizational Aspects of Data Management.ppt
Chapter 4 Organizational Aspects of Data Management.ppt
 
Informs Conference, Huntington Beach
Informs Conference, Huntington BeachInforms Conference, Huntington Beach
Informs Conference, Huntington Beach
 
New Paradigm for Ensuring and Improving Data Quality and Usability
New Paradigm for Ensuring and Improving Data Quality and UsabilityNew Paradigm for Ensuring and Improving Data Quality and Usability
New Paradigm for Ensuring and Improving Data Quality and Usability
 

Mais de Khaled Mosharraf

Mais de Khaled Mosharraf (6)

PCI DSS introduction by khaled mosharraf,
PCI DSS introduction by khaled mosharraf,PCI DSS introduction by khaled mosharraf,
PCI DSS introduction by khaled mosharraf,
 
Pixel Bar Charts A New Technique for Visualizing Large Multi-Attribute Data S...
Pixel Bar Charts A New Technique for Visualizing Large Multi-Attribute Data S...Pixel Bar Charts A New Technique for Visualizing Large Multi-Attribute Data S...
Pixel Bar Charts A New Technique for Visualizing Large Multi-Attribute Data S...
 
Open ssl heart bleed weakness.
Open ssl heart bleed weakness.Open ssl heart bleed weakness.
Open ssl heart bleed weakness.
 
Six sigma
Six sigmaSix sigma
Six sigma
 
Introduction to anonymity network tor
Introduction to anonymity network torIntroduction to anonymity network tor
Introduction to anonymity network tor
 
Beginners Node.js
Beginners Node.jsBeginners Node.js
Beginners Node.js
 

Último

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
cnajjemba
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
vexqp
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 

Último (20)

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 

Foundation of data quality

  • 1. Advance Data Quality Management Basice Overview Khaled Mosharraf. Msc mosharrafkhaled@gmx.de A.K.M Bhalul Haque. M.Sc b.haque@gmx.de FH Kiel, Germany 2016
  • 2. Agenda • Motivation / Introduction • Data Quality Definitions • Foundation of Data Quality • Data Quality Assessments • Measuring Data Quality • DQ-Organisation • Data Policies • Data Governance • DQ Policies • Data Profiling Kiel University of Applied Sciences
  • 3. Introduction Today is world of heterogeneity. We have different technologies. We operate on different platforms. We have large amount of data being generated everyday in all sorts of organizations and Enterprises. And we do have problems with data. Kiel University of Applied Sciences
  • 4. The previous slide we discuss about introduction part and data quality definitions. If you missed it please check that slide Kiel University of Applied Sciences
  • 5. Foundation of Data Quality I. Data Production System II. IQ-Dimensions III. IQ-Categories / IQ-Pattern Kiel University of Applied Sciences
  • 6. Maintenance of data quality • Data quality results from the process of going through the data and scrubbing it, standardizing it, and de duplicating records, as well as doing some of the data enrichment. Maintain complete data. Clean up your data by standardizing it using rules. Use fancy algorithms to detect duplicates. Eg: ICS and Informatics Computer System. Avoid entry of duplicate leads and contacts. Merge existing duplicate records. Use roles for security. Kiel University of Applied Sciences
  • 7. Data Production System • Data collector • Data custodain • Data consummer Kiel University of Applied Sciences
  • 8. Data Production System Kiel University of Applied Sciences
  • 9. IQ Dimanssion • Relevance • Accuracy • Timellness • Compliteness • Coherence • Format • Accessibility • Compatibillity • Security • Validity • Accessibility • Appropriate Amount of Data • Believability • Concise Representation • Consistent Representation • Ease of Manipulation • Free of Error • Interpretability • Objectivity • Relevancy • Understandability • Value-Added Kiel University of Applied Sciences
  • 10. Information Quality Dimensions Dimensions • Accessibility The extent to which data is available, or easily and quickly retrievable • Appropriate Amount of Data The extent to which the volume of data is appropriate for the task at hand • Believability The extent to which data is regarded as true and credible • Completeness The extent to which data is not missing and is of sufficient breadth and depth for the task at hand Kiel University of Applied Sciences
  • 11. • Concise Representation The extent to which data is compactly represented • Consistent Representation The extent to which data is presented in the same format • Ease of Manipulation The extent to which data is easy to manipulate and apply to different tasks • Free of Error The extent to which data is correct and reliable Kiel University of Applied Sciences
  • 12. • Interpretability The extent to which data is in appropriate languages, symbols, and units, and the definitions are clear • Objectivity The extent to which data is unbiased, unprejudiced, and impartial • Relevancy The extent to which data is applicable and helpful for the task at hand • Security The extent to which access to data is restricted appropriately to maintain its security Kiel University of Applied Sciences
  • 13. • Timeliness The extent to which data is sufficiently up-to-date for the task at hand • Understandability The extent to which data is easily comprehended • Value-Added The extent to which data is beneficial and provides advantages from its use Kiel University of Applied Sciences
  • 14. Questions • How do organisations define data quality? • What data quality problems arise in organizations? • How do organizations identify, analyze, and resolve data quality problems? • How do organizations encourage employees to work on a proactive management of DQ / IQ? • Are there common data quality patterns? • Across Organisations • Across DQ-projects Kiel University of Applied Sciences
  • 15. IQ Categories / Patterns Intrinsic IQ • Information have quality in their own right Contextual IQ • Information quality must be considered within the context of the task Accessibility IQ / Representational IQ • Emphasize the importance of the role of systems Kiel University of Applied Sciences
  • 16. Intrinsic IQ • Mismatch between several sources of the “same” data • “consistency” vs. “accuracy” • Believability issues • Poor reputation of sources • Poor reputation for quality • Subjective production of data • Human judgment / knowledge in coding Kiel University of Applied Sciences
  • 17. Intrinsic IQ Kiel University of Applied Sciences
  • 18. Contextual IQ Mismatch between information available and what information is relevant for information consumers • Missing data –the easy case • Data bundling and analyzability –the hard case Issue is aggregation • Across record (transaction) analysis of data • e.g. Corporate Actions in banking • Often across distributed systems Incompatible, distributed systems (HMO) Kiel University of Applied Sciences
  • 19. Contextual IQ Kiel University of Applied Sciences
  • 20. Accessability IQ / Representational IQ Technical Accessibility • Physical access • Computing resources Time to Access / Ease of Access: • Amount of data • Privacy, confidentiality Interpretability and Understandability: • Coding Representation and its Analyzability: • Image and text data Kiel University of Applied Sciences
  • 21. Accessability IQ / Representational IQ Kiel University of Applied Sciences
  • 22. Data Quality Problems Kiel University of Applied Sciences