SlideShare a Scribd company logo
1 of 14
Data Enrichment: The
Key to Turbocharging
your AI/ML Data
Workflow
Tim McKenzie | Director, Solution Architecture
1
Location and Data are solving real-world challenges
in a complex, digital economy
2
• Underwriting
• Risk Accumulation
• Catastrophe Modelling
• Claims Processing
• Fraud analysis
• Customer Insights
• Network and coverage
planning
• Opportunity analysis
• Location-based
marketing & advertising
• Asset management
INSURANCE TELECOMMUNICATIONS
• Citizen communications
• Service optimization
• Election operations
• Census operations
• Emergency response
and management
• Home search
• Data cleaning
• Data preparation
• Automated valuations
• Geotargeting
• Audience profile creation
• Mobile marketing &
advertising
• Geofence campaigns
GOVERNMENT REAL ESTATE AD TECH
• Retail location analysis
• Location-based
marketing & advertising
• Store finder
• Service area analysis
RETAIL
• Address data capture
• Customer insight
• Reduce abandonments
• Logistics and delivery
• Location-based
marketing & advertising
ECOMMERCE
• Mortgage processing
• Customer Insight
• Master Data
Management
• Financial crimes
and compliance
• Branch location analytics
FINANCIAL SERVICES
Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Location data challenges
• Location is Messy: Addresses, Lat/Long,
Shapes, Lines, Formats
• Complexity of Joining Location Based Data
Sources (3rd Party and Internal)
• Data Sourcing Challenges: Many Providers,
Many Formats, Many Pricing, and Licensing
Differences
• Global Extensibility: Data Sources Tend to
Be Regional Yet Use Cases are Often
Global
• Need to Identify and Process Multi-Family
and Condo Properties
• De-centralized repositories of data
• Complex properties can often have multiple
valid addresses, parcels, and buildings.
• Legal descriptions in variety of format
leading to discrepancy, inefficiencies, errors,
and non-compliance
3
“For every minute spent in
organizing, an hour is earned.”
Benjamin Franklin
Inventor, Statesman, Insurer
Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Data prep slows data science
3%
19%
9%
4%
5%
What data
scientists spend
the most time
doing
Building datasets
Cleaning and organizing data
Collecting datasets
Mining data for patterns
Refining algorithms
Other
accounts for about 80%
of the work of data
scientists
4 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Location enabling strategies for data analytics
03.
Analyze
Apply data science at
scale to gain a
competitive advantage
02.
Enrich
Leverage trusted ID to
join massive amounts of
your own and 3rd party
data sources
01.
Organize
Assign a trusted ID that is
unique and persistent to
each address
5 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Fast, easy, and consistent data enrichment
6
Precisely’s Geo Addressing with hyper-accurate Master Location Data (MLD) reference data
• Belgium & Luxembourg
• Canada
• Finland
• France
• Germany
• Great Britain
• Ireland
• Netherlands
• Sweden
• Singapore
• United States
• More coming soon!
International
Coverage
Data
Sources
• Postal Authorities
• Government
datasets: local city,
county, and state
• Global Vendors
• Local Players
• Open Sources
• Proprietary
Sources
• Largest & Best available
• Unparalleled &
• Parent-child relationship,
• Unique and Persistent Identifier,
• Multi-sourced,
• Simplify data enrichment process,
MLD Attributes
Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Cloud-based location analytics technology
7
Spatial
Functions
30+ Common
Spatial Processes
Global
Geocoding
Forward & Reverse
Global Geocoding
and Trusted ID
Global
Addressing
Validate,
standardize and
parse global
addresses
Global Tax
Jurisdiction
How do extreme
weather events
affect the
“creditworthiness”
of my portfolio?
Map
Visualization
What alternative
data helps me
better understand
investment
opportunities?
Global Street
Routing
Where are my
customers and
how do they want
to interact with
me?
Data Enrichment – A global product portfolio
Addresses & Property
Verified and validated address and
property data for map display and
analytics
Boundaries
Administrative, community, and
industry-specific boundaries for data
enrichment and territory analysis
Demographics
Demographic and consumer context
data for better understanding people
and behavior
Points of Interest
Detailed business, leisure, and
geographic features for location
and competitive intelligence
Streets
Robust street-level data for mapping,
analysis, routing, and geocoding
Risk
Natural hazard boundaries related to
flood, fire, earthquakes, and weather
Expertly curated datasets containing thousands of attributes for faster, confident decisions
8 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Uniquely positioned to address data enrichment needs
Global coverage location enrichment data. Our portfolio includes:
• 400+ datasets
• 250+ countries and territories
• 100s of millions of data points
Datasets that are interoperable and are managed to quality standard, with consistent documentation, and
support e.g.
• Property Graph
• Market and Community Link
Ability to enrich with dynamic data (Dynamic Weather and Dynamic Demographics)
• Data that includes time as a dimension
• Creating insights from data that is updated at regular and short time intervals (e.g. 5 min)
Data experience through deep-domain expertise
• Adding data through, development, partnerships, and acquisitions
Best-in-class addressing and property datasets with a unique and persistent ID
• Link Precisely and customer address, buildings, demographics, risk, and more data using the PreciselyID,
a unique and persistent location identifier
9 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Understanding the
data challenge
10
• Accessing the right raw data
• Keeping up with continuously changing data feeds
• Building features from raw data
• Combining features into training data
• Calculating and serving features in production
• Monitoring features in production
Key data challenges that organizations
face when productionizing ML systems
Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
Location-enabled analytics
Bank Branch & ATM
Call Center/ Web
Customers by Product
Commercial & Mortgage
Active Mortgages
Historical Defaults
Geocoding and location
intelligence capabilities to
organize and enrich your data
Financial Transactions
All of your sources
Any structure
or frequency
Analytics capabilities for
any use case or persona
Ad Hoc Data Science
Low-cost, rapid experimentation with
new data and models.
Explainable Machine Learning
High volume, fine-grained analysis at scale
served in the tightest of service windows.
BI Reporting & Dashboarding
Power real-time dashboarding directly,
or feed data to a data warehouse for
high-concurrency reporting.
Real-time Applications
Provide real-time data to downstream
applications or power applications via APIs.
PreciselyID
ADMIN
BOUNDARIES
BANK DEPOSITS
MOBILE
MOVEMENT
WEATHER
EVENTS
HAZARD &
RISK DATA
AMENITIES &
COMPETITION
EVERY US/CAN
ADDRESS
BUSINESS
LOCATIONS
PROPERTY
ATTRIBUTES
SCHOOLS &
NEIGHBORHOODS
POPULATION
DEMOGRAPHICS
PARCELS &
BUILDINGS
Analytics Platform
What is a “feature-based”
architecture?
12
A feature store is an ML-specific data system that:
• Runs data pipelines that transform raw data into
feature values
• Stores and manages the feature data itself, and
• Serves feature data consistently for training and
inference purposes
A feature is data used as an input
signal to a predictive model
Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
13
Processing
Storage
Inputs
Location specific records Shape files Streaming records
Address Fabric
Analytics
Processing
• Model outputs
• Scores
• Computed columns
• Analysis outcome
Batch Geocoding
with the Operational
Addressing SDKs
• Vaildate input addresses
• Validate other data
• Locate addresses
• Match inputs
• Assign PreciselyID
• Relate data around
PrecisleyID
Batch Spatial
Processing
with the Location
Intelligence SDK
• Flatten shape files
• Compute PIP
• Compute D2P, D2L
• Compute basic scores
• Generate geohash
• Relate data around geohash
(where application)
Realtime Processing
with the Precisely SDKs
• Operational Addressing APIs
• Assign PreciselyID
• Generate geohash
• Relate data
Message Bus
Feature Store
In-stream Analytics Layer
Model outputs, scores, computed columns,
analysis outcomes
PrecisleyID Address
P0000MK1IAAD 287 E 300 S. Provo, UT 84606
P0000MK1DPRD 410 N University Ave. Provo, UT 84601
Vendor
data files
Customer Loyalty Records
Equipment Inventories
Franchise
Zones
Pricing Delivery
Territories
Mobile Trace
Data
POS/IOT
Data
Administration, Governance, Security, Connectivity, Schema, Catalog
Model
Training
EDW
precisely
Data subscriptions
with PreciselyID
PrecisleyID Address Name Type Score Location MICode PointCode DemoRgn
P0000MK1IAAD 287 E 300 S. Provo, UT 84606 Empas LLC REST 91.529 UT108 10020100 101067669 8926
P0000MK1DPRD 410 N University Ave. Provo, UT 84601 THAI HUT REST 65.981 UT108 10020100 100854441 4144
…. ….. ….. ….. ….. …. …. ….. ….
Thank you
Tim McKenzie
Tim.McKenzie@precisely.com
Phone: 678-428-1770

More Related Content

Similar to Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment

How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewDenodo
 
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
(Data) Integrity Matters: Four Ways You Can Build Trust in Your DataPrecisely
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackPrecisely
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DATAVERSITY
 
Do You Trust Your Machine Learning Outcomes?
 Do You Trust Your Machine Learning Outcomes?  Do You Trust Your Machine Learning Outcomes?
Do You Trust Your Machine Learning Outcomes? Precisely
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AIGary Allemann
 
Unlocking the Full Potential of Your Telecom Data with Data Integrity
Unlocking the Full Potential of Your Telecom Data with Data IntegrityUnlocking the Full Potential of Your Telecom Data with Data Integrity
Unlocking the Full Potential of Your Telecom Data with Data IntegrityPrecisely
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationDenodo
 
Enrich Your Data Your Way
Enrich Your Data Your WayEnrich Your Data Your Way
Enrich Your Data Your WayPrecisely
 
The New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need ThemThe New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need ThemPrecisely
 
Accelerate Confident Decision-Making with Data Enrichment
Accelerate Confident Decision-Making with Data EnrichmentAccelerate Confident Decision-Making with Data Enrichment
Accelerate Confident Decision-Making with Data EnrichmentPrecisely
 
Liberate Legacy Data Sources with Precisely and Databricks
Liberate Legacy Data Sources with Precisely and DatabricksLiberate Legacy Data Sources with Precisely and Databricks
Liberate Legacy Data Sources with Precisely and DatabricksPrecisely
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Then & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data QualityThen & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data QualityPrecisely
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Neo4j
 
Using ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime ValueUsing ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime ValueNavin Albert
 
Unlock Data driven Insights in Databricks Using Location Intelligence
Unlock Data driven Insights in Databricks Using Location IntelligenceUnlock Data driven Insights in Databricks Using Location Intelligence
Unlock Data driven Insights in Databricks Using Location IntelligencePrecisely
 
Die Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AIDie Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AIDenodo
 
How to Make Complex Spatial Processing Simple
How to Make Complex Spatial Processing SimpleHow to Make Complex Spatial Processing Simple
How to Make Complex Spatial Processing SimplePrecisely
 

Similar to Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment (20)

I
II
I
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 
Do You Trust Your Machine Learning Outcomes?
 Do You Trust Your Machine Learning Outcomes?  Do You Trust Your Machine Learning Outcomes?
Do You Trust Your Machine Learning Outcomes?
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 
Unlocking the Full Potential of Your Telecom Data with Data Integrity
Unlocking the Full Potential of Your Telecom Data with Data IntegrityUnlocking the Full Potential of Your Telecom Data with Data Integrity
Unlocking the Full Potential of Your Telecom Data with Data Integrity
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data Virtualization
 
Enrich Your Data Your Way
Enrich Your Data Your WayEnrich Your Data Your Way
Enrich Your Data Your Way
 
The New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need ThemThe New Trillium DQ: Big Data Insights When and Where You Need Them
The New Trillium DQ: Big Data Insights When and Where You Need Them
 
Accelerate Confident Decision-Making with Data Enrichment
Accelerate Confident Decision-Making with Data EnrichmentAccelerate Confident Decision-Making with Data Enrichment
Accelerate Confident Decision-Making with Data Enrichment
 
Liberate Legacy Data Sources with Precisely and Databricks
Liberate Legacy Data Sources with Precisely and DatabricksLiberate Legacy Data Sources with Precisely and Databricks
Liberate Legacy Data Sources with Precisely and Databricks
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Then & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data QualityThen & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data Quality
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
Using ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime ValueUsing ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime Value
 
Unlock Data driven Insights in Databricks Using Location Intelligence
Unlock Data driven Insights in Databricks Using Location IntelligenceUnlock Data driven Insights in Databricks Using Location Intelligence
Unlock Data driven Insights in Databricks Using Location Intelligence
 
Die Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AIDie Big Data Fabric als Enabler für Machine Learning & AI
Die Big Data Fabric als Enabler für Machine Learning & AI
 
How to Make Complex Spatial Processing Simple
How to Make Complex Spatial Processing SimpleHow to Make Complex Spatial Processing Simple
How to Make Complex Spatial Processing Simple
 

More from Precisely

Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenPrecisely
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Crucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfCrucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfPrecisely
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Precisely
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Precisely
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Precisely
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fPrecisely
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsPrecisely
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarPrecisely
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPPrecisely
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenPrecisely
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsPrecisely
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyPrecisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowPrecisely
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellencePrecisely
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation ManagementPrecisely
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowPrecisely
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckPrecisely
 
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformanceMainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformancePrecisely
 

More from Precisely (20)

Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Crucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfCrucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity Trends
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAP
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIs
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to Know
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center Excellence
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar Deck
 
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformanceMainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
 

Recently uploaded

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment

  • 1. Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow Tim McKenzie | Director, Solution Architecture 1
  • 2. Location and Data are solving real-world challenges in a complex, digital economy 2 • Underwriting • Risk Accumulation • Catastrophe Modelling • Claims Processing • Fraud analysis • Customer Insights • Network and coverage planning • Opportunity analysis • Location-based marketing & advertising • Asset management INSURANCE TELECOMMUNICATIONS • Citizen communications • Service optimization • Election operations • Census operations • Emergency response and management • Home search • Data cleaning • Data preparation • Automated valuations • Geotargeting • Audience profile creation • Mobile marketing & advertising • Geofence campaigns GOVERNMENT REAL ESTATE AD TECH • Retail location analysis • Location-based marketing & advertising • Store finder • Service area analysis RETAIL • Address data capture • Customer insight • Reduce abandonments • Logistics and delivery • Location-based marketing & advertising ECOMMERCE • Mortgage processing • Customer Insight • Master Data Management • Financial crimes and compliance • Branch location analytics FINANCIAL SERVICES Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 3. Location data challenges • Location is Messy: Addresses, Lat/Long, Shapes, Lines, Formats • Complexity of Joining Location Based Data Sources (3rd Party and Internal) • Data Sourcing Challenges: Many Providers, Many Formats, Many Pricing, and Licensing Differences • Global Extensibility: Data Sources Tend to Be Regional Yet Use Cases are Often Global • Need to Identify and Process Multi-Family and Condo Properties • De-centralized repositories of data • Complex properties can often have multiple valid addresses, parcels, and buildings. • Legal descriptions in variety of format leading to discrepancy, inefficiencies, errors, and non-compliance 3 “For every minute spent in organizing, an hour is earned.” Benjamin Franklin Inventor, Statesman, Insurer Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 4. Data prep slows data science 3% 19% 9% 4% 5% What data scientists spend the most time doing Building datasets Cleaning and organizing data Collecting datasets Mining data for patterns Refining algorithms Other accounts for about 80% of the work of data scientists 4 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 5. Location enabling strategies for data analytics 03. Analyze Apply data science at scale to gain a competitive advantage 02. Enrich Leverage trusted ID to join massive amounts of your own and 3rd party data sources 01. Organize Assign a trusted ID that is unique and persistent to each address 5 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 6. Fast, easy, and consistent data enrichment 6 Precisely’s Geo Addressing with hyper-accurate Master Location Data (MLD) reference data • Belgium & Luxembourg • Canada • Finland • France • Germany • Great Britain • Ireland • Netherlands • Sweden • Singapore • United States • More coming soon! International Coverage Data Sources • Postal Authorities • Government datasets: local city, county, and state • Global Vendors • Local Players • Open Sources • Proprietary Sources • Largest & Best available • Unparalleled & • Parent-child relationship, • Unique and Persistent Identifier, • Multi-sourced, • Simplify data enrichment process, MLD Attributes Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 7. Cloud-based location analytics technology 7 Spatial Functions 30+ Common Spatial Processes Global Geocoding Forward & Reverse Global Geocoding and Trusted ID Global Addressing Validate, standardize and parse global addresses Global Tax Jurisdiction How do extreme weather events affect the “creditworthiness” of my portfolio? Map Visualization What alternative data helps me better understand investment opportunities? Global Street Routing Where are my customers and how do they want to interact with me?
  • 8. Data Enrichment – A global product portfolio Addresses & Property Verified and validated address and property data for map display and analytics Boundaries Administrative, community, and industry-specific boundaries for data enrichment and territory analysis Demographics Demographic and consumer context data for better understanding people and behavior Points of Interest Detailed business, leisure, and geographic features for location and competitive intelligence Streets Robust street-level data for mapping, analysis, routing, and geocoding Risk Natural hazard boundaries related to flood, fire, earthquakes, and weather Expertly curated datasets containing thousands of attributes for faster, confident decisions 8 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 9. Uniquely positioned to address data enrichment needs Global coverage location enrichment data. Our portfolio includes: • 400+ datasets • 250+ countries and territories • 100s of millions of data points Datasets that are interoperable and are managed to quality standard, with consistent documentation, and support e.g. • Property Graph • Market and Community Link Ability to enrich with dynamic data (Dynamic Weather and Dynamic Demographics) • Data that includes time as a dimension • Creating insights from data that is updated at regular and short time intervals (e.g. 5 min) Data experience through deep-domain expertise • Adding data through, development, partnerships, and acquisitions Best-in-class addressing and property datasets with a unique and persistent ID • Link Precisely and customer address, buildings, demographics, risk, and more data using the PreciselyID, a unique and persistent location identifier 9 Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 10. Understanding the data challenge 10 • Accessing the right raw data • Keeping up with continuously changing data feeds • Building features from raw data • Combining features into training data • Calculating and serving features in production • Monitoring features in production Key data challenges that organizations face when productionizing ML systems Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 11. Location-enabled analytics Bank Branch & ATM Call Center/ Web Customers by Product Commercial & Mortgage Active Mortgages Historical Defaults Geocoding and location intelligence capabilities to organize and enrich your data Financial Transactions All of your sources Any structure or frequency Analytics capabilities for any use case or persona Ad Hoc Data Science Low-cost, rapid experimentation with new data and models. Explainable Machine Learning High volume, fine-grained analysis at scale served in the tightest of service windows. BI Reporting & Dashboarding Power real-time dashboarding directly, or feed data to a data warehouse for high-concurrency reporting. Real-time Applications Provide real-time data to downstream applications or power applications via APIs. PreciselyID ADMIN BOUNDARIES BANK DEPOSITS MOBILE MOVEMENT WEATHER EVENTS HAZARD & RISK DATA AMENITIES & COMPETITION EVERY US/CAN ADDRESS BUSINESS LOCATIONS PROPERTY ATTRIBUTES SCHOOLS & NEIGHBORHOODS POPULATION DEMOGRAPHICS PARCELS & BUILDINGS Analytics Platform
  • 12. What is a “feature-based” architecture? 12 A feature store is an ML-specific data system that: • Runs data pipelines that transform raw data into feature values • Stores and manages the feature data itself, and • Serves feature data consistently for training and inference purposes A feature is data used as an input signal to a predictive model Data Enrichment: The Key to Turbocharging your AI/ML Data Workflow
  • 13. 13 Processing Storage Inputs Location specific records Shape files Streaming records Address Fabric Analytics Processing • Model outputs • Scores • Computed columns • Analysis outcome Batch Geocoding with the Operational Addressing SDKs • Vaildate input addresses • Validate other data • Locate addresses • Match inputs • Assign PreciselyID • Relate data around PrecisleyID Batch Spatial Processing with the Location Intelligence SDK • Flatten shape files • Compute PIP • Compute D2P, D2L • Compute basic scores • Generate geohash • Relate data around geohash (where application) Realtime Processing with the Precisely SDKs • Operational Addressing APIs • Assign PreciselyID • Generate geohash • Relate data Message Bus Feature Store In-stream Analytics Layer Model outputs, scores, computed columns, analysis outcomes PrecisleyID Address P0000MK1IAAD 287 E 300 S. Provo, UT 84606 P0000MK1DPRD 410 N University Ave. Provo, UT 84601 Vendor data files Customer Loyalty Records Equipment Inventories Franchise Zones Pricing Delivery Territories Mobile Trace Data POS/IOT Data Administration, Governance, Security, Connectivity, Schema, Catalog Model Training EDW precisely Data subscriptions with PreciselyID PrecisleyID Address Name Type Score Location MICode PointCode DemoRgn P0000MK1IAAD 287 E 300 S. Provo, UT 84606 Empas LLC REST 91.529 UT108 10020100 101067669 8926 P0000MK1DPRD 410 N University Ave. Provo, UT 84601 THAI HUT REST 65.981 UT108 10020100 100854441 4144 …. ….. ….. ….. ….. …. …. ….. ….