SlideShare a Scribd company logo
1 of 25
Download to read offline
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 1
Unlock Potential
William McKnight
President
McKnight Consulting Group
(214) 514-1444
wmcknight@mcknightcg.com
www.mcknightcg.com
@williammcknight
Measuring Data Quality Return on
Investment
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 2
Today’s Agenda
How to Define the Quality Expectations
How to Profile Data Against the Defined Expectations
How to Measure Data Quality Impact Across Various Thresholds
How to Improve Quality of Data and Improve the Business
47% of data records had at least one critical error. HBR study 2020.
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 3
Theme
• Data Quality is Essential to Business Success
» “Correct” data is a widespread need
• Yet, data quality lacks consistent definition
» You can’t improve what you can’t measure
» Tangible benefits accrue from improved efficacy of the
applications using the data
• To realize value
» define the quality expectations, profile data against the
defined expectations, measure data quality improvement
across various thresholds and improve the quality of data and
improve the business
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 4
Data Quality
► A lack of intolerable defects in the data
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 5
Agenda
Define the Quality Expectations
Profile Data Against the Defined Expectations
Measure Data Quality Impact Across Various
Thresholds
Improve Quality of Data and Improve the
Business
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 6
Consider these business imperatives
Information-based in-store and call center cross- and up-selling
(NEEDS clean customer and product data)
Credit card fraud detection (NEEDS clean customer and
transaction data)
Supply chain efficiencies and just-in-time production capabilities
(NEEDS clean product and location data)
Predictive churn management (NEEDS clean customer and
transaction data)
Many others
These have failed or underperformed because
of incomplete, incorrect, inconsistent data
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 7
Investments in Data Quality
Investments Yield “Cleaner” Data
Business objectives cannot be met without
quality data in support
Data Quality Returns are in the improved
efficacy of projects targeting business
objectives
Data Quality should be an integral part of most
projects
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 8
Iterate Data Observability Throughout
the Environment
ERP App Call Center App DW Analytical App
CRM
0 3 mos. 6 mos. 9 mos. 12 mos. 15 mos.
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 9
Data Quality Rule Categories: 1
Category Example
Referential integrity Order is placed for customer # 123, who exists in
the customer table
Uniqueness Only one customer exists with customer # 123
Cardinality We may expect to find between one and three
addresses for a customer – no more and no less
Subtype/supertype
constructs
A customer can be divorced or not, but only
divorced customers have a value in the ex-spouse
name field
Value reasonability Last-year purchases should not be > $500,000 or < 0
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 10
Data Quality Rule Categories: 2
Consistent Value Sets In a Name field, we would not expect to find certain
characters such as % and $
Formatting No period after middle initial
Data derivation While one system may contain discount amount and
unit price, the discount percentage, a simple
division calculation, could be a unrelated calculation
Completeness Sales data covers each day from beginning to
current date
Correctness William mcnight instead of William McKnight
Conformance to a clean
set of values
In the gender column we would expect to find M, F,
U (unknown)
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 11
Agenda
Define the Quality Expectations
Profile Data Against the Defined Expectations
Measure Data Quality Impact Across Various
Thresholds
Improve Quality of Data and Improve the
Business
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 12
Data Profiling Example
Gender Occurrences Overall Pct.
M 42855 43
F 44583 45
U 5986 6
1 4070 4
2 1264 1
Y 986 1
X 254 0
! 1 0
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 13
Data Quality Scoring
Scoring defines how well your data meets
business expectations
Adherence
Possibilities
Valid values M, F, U:
43%+45%+8%=94%
i.e., for
Gender:
Multiple rules used to determine overall
system score
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 14
Agenda
Define the Quality Expectations
Profile Data Against the Defined Expectations
Measure Data Quality Impact Across Various
Thresholds
Improve Quality of Data and Improve the
Business
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 15
1. Fewer bad contact data
2. Improved customer segmentation
3. Highest Marketing Initiative ROI reached
4. Cleaner data for other applications
How can Improved DQ Improve Initiatives?
Example: Targeted Marketing
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 16
Cost of Poor Data Quality
People X average pay rate for people doing DQ work today
According to research by Sirius Decisions, between 10 percent and 25
percent of B2B marketing database contacts contain critical errors
Records X 20% X $100 = $ Millions/year opportunity
$100 per data record attributed to:
Printing and mailing, emailing bad addresses
Losing disgruntled customers
Taking up extra space with duplicate records
Incorrect marketing segmentation and personalization
Marketing and CRM for duplicate records
On average corporate data grows at 40% per year
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 17
Case Example: Office Products
Retailer Marketing to Prospects in
Database
DQ
Score
Prospects
Reached
Return on
Marketing
Avg.
profit
Return
(prospects X
ROM X avg.
profit)
investment
*
ROI (return-
investment/
investment) *
90 110000 0.04 250 1100000 400000 175.00%
85 105000 0.03 250 787500 390000 101.92%
80 100000 0.025 250 625000 380000 64.47%
*considering only outreach costs, not considering DQ improvements
and better targeting
Higher scores means
more contact gets through
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 18
Case Example: Where was the DQ
investment to go?
Reverse-append existing customers for
addresses
Reverse-append existing customers for
demographics
Purchase additional prospects
Use of multiple and different third-party data
providers (and corresponding de-duplication of
results)
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 19
Agenda
Define the Quality Expectations
Profile Data Against the Defined Expectations
Measure Data Quality Impact Across Various
Thresholds
Improve Quality of Data and Improve the
Business
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 20
Data Quality Improvement
Four Actions to Perform for Data Quality
Screen Data Entry
Quarantine Data to Prevent Improper Use
Report on Quality Violations to Raise
Awareness
Change or Repair Incorrect Data to Conform,
when possible
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 21
Data Collection Systems
Exactly once versus At Least Once
At Least Once
Guarantees Order of Delivery
Apache Kafka/Amazon MSK
Kinesis Data Streams
Does not Guarantee Order of Delivery
SQS in Standard Mode
Kinesis Data Firehose
Exactly Once with Guaranteed Order
SQS in FIFO Mode
Dynamo DB Streams
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 22
Cost/Benefit of Adding Data Quality
Efforts
DQ Score investment *
ROI (return-
investment/
investment) *
additional
cost to
improve DQ
to the level
Revised Project ROI
(return-
(investment+DQ
cost)/ investment+
DQ cost)
99 410000 198.02% 300000 72.10%
90 400000 175.00% 200000 83.33%
85 390000 101.92% 75000 69.35%
80** 380000 64.47% 0 64.47%
*without considering costing for DQ improvements **Default score – will happen without data quality improvements
Best
Value
Proposition
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 23
Summary of the Steps to
Data Quality Success
1. Define the Quality Expectations
2. Profile Data Against Defined Expectations
3. Measure Data Quality Impact Across Various
Thresholds
4. Improve Quality of Data and Improve the
Business
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 24
Recommendations
Data quality can and should have a value proposition
Data quality is never an accident
Consider data quality when considering applications
Measure the level of your data quality
Data quality is a business-driven imperative
Business: definition of quality rules
Business: validation of quality scores
Business: validation of quality actions
Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 25
Measuring Data Quality Return on
Investment
Presented by:
William McKnight
President
McKnight Consulting Group LLC
(214) 514-1444
wmcknight@mcknightcg.com
www.mcknightcg.com
Twitter @williammcknight

More Related Content

What's hot

Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management DATAVERSITY
 
DAS Slides: Data Governance - Combining Data Management with Organizational ...
DAS Slides: Data Governance -  Combining Data Management with Organizational ...DAS Slides: Data Governance -  Combining Data Management with Organizational ...
DAS Slides: Data Governance - Combining Data Management with Organizational ...DATAVERSITY
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data GovernanceJohn Bao Vuu
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureDATAVERSITY
 
Data Management vs. Data Governance Program
Data Management vs. Data Governance ProgramData Management vs. Data Governance Program
Data Management vs. Data Governance ProgramDATAVERSITY
 
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureLDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureDATAVERSITY
 
Data Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesData Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesDATAVERSITY
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesDATAVERSITY
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesBoris Otto
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingEmerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingDATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Data Governance for the Executive
Data Governance for the ExecutiveData Governance for the Executive
Data Governance for the ExecutiveDATAVERSITY
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Data Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherData Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherDATAVERSITY
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?DATAVERSITY
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 

What's hot (20)

Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
DAS Slides: Data Governance - Combining Data Management with Organizational ...
DAS Slides: Data Governance -  Combining Data Management with Organizational ...DAS Slides: Data Governance -  Combining Data Management with Organizational ...
DAS Slides: Data Governance - Combining Data Management with Organizational ...
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data Governance
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
Data Management vs. Data Governance Program
Data Management vs. Data Governance ProgramData Management vs. Data Governance Program
Data Management vs. Data Governance Program
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureLDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
 
Why data governance is the new buzz?
Why data governance is the new buzz?Why data governance is the new buzz?
Why data governance is the new buzz?
 
Data Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesData Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical Approaches
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingEmerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big Thing
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
DMBOK and Data Governance
DMBOK and Data GovernanceDMBOK and Data Governance
DMBOK and Data Governance
 
Data Governance for the Executive
Data Governance for the ExecutiveData Governance for the Executive
Data Governance for the Executive
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Data Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherData Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working Together
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 

Similar to Measuring Data Quality Return on Investment

Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
CDI-MDMSummit.290213824
CDI-MDMSummit.290213824CDI-MDMSummit.290213824
CDI-MDMSummit.290213824ypai
 
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDM
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDMOptimizing Solution Value – Dynamic Data Quality, Governance, and MDM
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDMPrecisely
 
Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...
Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...
Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...Precisely
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesPrecisely
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesPrecisely
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesCisco Canada
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Denodo
 
Cognitivo - Tackling the enterprise data quality challenge
Cognitivo - Tackling the enterprise data quality challengeCognitivo - Tackling the enterprise data quality challenge
Cognitivo - Tackling the enterprise data quality challengeAlan Hsiao
 
Analytics ROI Best Practices
Analytics ROI Best PracticesAnalytics ROI Best Practices
Analytics ROI Best PracticesDATAVERSITY
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit UnionsWebinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit UnionsDenodo
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality RightDATAVERSITY
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Denodo
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityDATAVERSITY
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceDATAVERSITY
 
DDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: DatakwaliteitDDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: DatakwaliteitDDMA
 
Improve Your Data Quality & Eliminate Duplicates
Improve Your Data Quality &  Eliminate DuplicatesImprove Your Data Quality &  Eliminate Duplicates
Improve Your Data Quality & Eliminate DuplicatesData 8
 
Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...
Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...
Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...David Resnic
 

Similar to Measuring Data Quality Return on Investment (20)

Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
CDI-MDMSummit.290213824
CDI-MDMSummit.290213824CDI-MDMSummit.290213824
CDI-MDMSummit.290213824
 
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDM
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDMOptimizing Solution Value – Dynamic Data Quality, Governance, and MDM
Optimizing Solution Value – Dynamic Data Quality, Governance, and MDM
 
Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...
Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...
Maximize ROI of Insurance Digital Transformation Initiatives with Proven Data...
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business Outcomes
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
 
Cognitivo - Tackling the enterprise data quality challenge
Cognitivo - Tackling the enterprise data quality challengeCognitivo - Tackling the enterprise data quality challenge
Cognitivo - Tackling the enterprise data quality challenge
 
Seagate
SeagateSeagate
Seagate
 
Analytics ROI Best Practices
Analytics ROI Best PracticesAnalytics ROI Best Practices
Analytics ROI Best Practices
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit UnionsWebinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business Intelligence
 
DDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: DatakwaliteitDDMA / T-Mobile: Datakwaliteit
DDMA / T-Mobile: Datakwaliteit
 
Improve Your Data Quality & Eliminate Duplicates
Improve Your Data Quality &  Eliminate DuplicatesImprove Your Data Quality &  Eliminate Duplicates
Improve Your Data Quality & Eliminate Duplicates
 
Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...
Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...
Virtualization Adoption: The Secrets of a Successful Journey: A Case Study of...
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...DATAVERSITY
 
Including All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and AnalyticsIncluding All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and AnalyticsDATAVERSITY
 
Assessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-ModelAssessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-ModelDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 
Including All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and AnalyticsIncluding All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and Analytics
 
Assessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-ModelAssessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-Model
 

Recently uploaded

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 

Recently uploaded (20)

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 

Measuring Data Quality Return on Investment

  • 1. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 1 Unlock Potential William McKnight President McKnight Consulting Group (214) 514-1444 wmcknight@mcknightcg.com www.mcknightcg.com @williammcknight Measuring Data Quality Return on Investment
  • 2. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 2 Today’s Agenda How to Define the Quality Expectations How to Profile Data Against the Defined Expectations How to Measure Data Quality Impact Across Various Thresholds How to Improve Quality of Data and Improve the Business 47% of data records had at least one critical error. HBR study 2020.
  • 3. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 3 Theme • Data Quality is Essential to Business Success » “Correct” data is a widespread need • Yet, data quality lacks consistent definition » You can’t improve what you can’t measure » Tangible benefits accrue from improved efficacy of the applications using the data • To realize value » define the quality expectations, profile data against the defined expectations, measure data quality improvement across various thresholds and improve the quality of data and improve the business
  • 4. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 4 Data Quality ► A lack of intolerable defects in the data
  • 5. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 5 Agenda Define the Quality Expectations Profile Data Against the Defined Expectations Measure Data Quality Impact Across Various Thresholds Improve Quality of Data and Improve the Business
  • 6. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 6 Consider these business imperatives Information-based in-store and call center cross- and up-selling (NEEDS clean customer and product data) Credit card fraud detection (NEEDS clean customer and transaction data) Supply chain efficiencies and just-in-time production capabilities (NEEDS clean product and location data) Predictive churn management (NEEDS clean customer and transaction data) Many others These have failed or underperformed because of incomplete, incorrect, inconsistent data
  • 7. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 7 Investments in Data Quality Investments Yield “Cleaner” Data Business objectives cannot be met without quality data in support Data Quality Returns are in the improved efficacy of projects targeting business objectives Data Quality should be an integral part of most projects
  • 8. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 8 Iterate Data Observability Throughout the Environment ERP App Call Center App DW Analytical App CRM 0 3 mos. 6 mos. 9 mos. 12 mos. 15 mos.
  • 9. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 9 Data Quality Rule Categories: 1 Category Example Referential integrity Order is placed for customer # 123, who exists in the customer table Uniqueness Only one customer exists with customer # 123 Cardinality We may expect to find between one and three addresses for a customer – no more and no less Subtype/supertype constructs A customer can be divorced or not, but only divorced customers have a value in the ex-spouse name field Value reasonability Last-year purchases should not be > $500,000 or < 0
  • 10. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 10 Data Quality Rule Categories: 2 Consistent Value Sets In a Name field, we would not expect to find certain characters such as % and $ Formatting No period after middle initial Data derivation While one system may contain discount amount and unit price, the discount percentage, a simple division calculation, could be a unrelated calculation Completeness Sales data covers each day from beginning to current date Correctness William mcnight instead of William McKnight Conformance to a clean set of values In the gender column we would expect to find M, F, U (unknown)
  • 11. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 11 Agenda Define the Quality Expectations Profile Data Against the Defined Expectations Measure Data Quality Impact Across Various Thresholds Improve Quality of Data and Improve the Business
  • 12. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 12 Data Profiling Example Gender Occurrences Overall Pct. M 42855 43 F 44583 45 U 5986 6 1 4070 4 2 1264 1 Y 986 1 X 254 0 ! 1 0
  • 13. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 13 Data Quality Scoring Scoring defines how well your data meets business expectations Adherence Possibilities Valid values M, F, U: 43%+45%+8%=94% i.e., for Gender: Multiple rules used to determine overall system score
  • 14. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 14 Agenda Define the Quality Expectations Profile Data Against the Defined Expectations Measure Data Quality Impact Across Various Thresholds Improve Quality of Data and Improve the Business
  • 15. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 15 1. Fewer bad contact data 2. Improved customer segmentation 3. Highest Marketing Initiative ROI reached 4. Cleaner data for other applications How can Improved DQ Improve Initiatives? Example: Targeted Marketing
  • 16. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 16 Cost of Poor Data Quality People X average pay rate for people doing DQ work today According to research by Sirius Decisions, between 10 percent and 25 percent of B2B marketing database contacts contain critical errors Records X 20% X $100 = $ Millions/year opportunity $100 per data record attributed to: Printing and mailing, emailing bad addresses Losing disgruntled customers Taking up extra space with duplicate records Incorrect marketing segmentation and personalization Marketing and CRM for duplicate records On average corporate data grows at 40% per year
  • 17. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 17 Case Example: Office Products Retailer Marketing to Prospects in Database DQ Score Prospects Reached Return on Marketing Avg. profit Return (prospects X ROM X avg. profit) investment * ROI (return- investment/ investment) * 90 110000 0.04 250 1100000 400000 175.00% 85 105000 0.03 250 787500 390000 101.92% 80 100000 0.025 250 625000 380000 64.47% *considering only outreach costs, not considering DQ improvements and better targeting Higher scores means more contact gets through
  • 18. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 18 Case Example: Where was the DQ investment to go? Reverse-append existing customers for addresses Reverse-append existing customers for demographics Purchase additional prospects Use of multiple and different third-party data providers (and corresponding de-duplication of results)
  • 19. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 19 Agenda Define the Quality Expectations Profile Data Against the Defined Expectations Measure Data Quality Impact Across Various Thresholds Improve Quality of Data and Improve the Business
  • 20. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 20 Data Quality Improvement Four Actions to Perform for Data Quality Screen Data Entry Quarantine Data to Prevent Improper Use Report on Quality Violations to Raise Awareness Change or Repair Incorrect Data to Conform, when possible
  • 21. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 21 Data Collection Systems Exactly once versus At Least Once At Least Once Guarantees Order of Delivery Apache Kafka/Amazon MSK Kinesis Data Streams Does not Guarantee Order of Delivery SQS in Standard Mode Kinesis Data Firehose Exactly Once with Guaranteed Order SQS in FIFO Mode Dynamo DB Streams
  • 22. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 22 Cost/Benefit of Adding Data Quality Efforts DQ Score investment * ROI (return- investment/ investment) * additional cost to improve DQ to the level Revised Project ROI (return- (investment+DQ cost)/ investment+ DQ cost) 99 410000 198.02% 300000 72.10% 90 400000 175.00% 200000 83.33% 85 390000 101.92% 75000 69.35% 80** 380000 64.47% 0 64.47% *without considering costing for DQ improvements **Default score – will happen without data quality improvements Best Value Proposition
  • 23. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 23 Summary of the Steps to Data Quality Success 1. Define the Quality Expectations 2. Profile Data Against Defined Expectations 3. Measure Data Quality Impact Across Various Thresholds 4. Improve Quality of Data and Improve the Business
  • 24. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 24 Recommendations Data quality can and should have a value proposition Data quality is never an accident Consider data quality when considering applications Measure the level of your data quality Data quality is a business-driven imperative Business: definition of quality rules Business: validation of quality scores Business: validation of quality actions
  • 25. Copyright © 2021 McKnight Consulting Group, LLC All Rights Reserved – Confidential and Proprietary Slide 25 Measuring Data Quality Return on Investment Presented by: William McKnight President McKnight Consulting Group LLC (214) 514-1444 wmcknight@mcknightcg.com www.mcknightcg.com Twitter @williammcknight