Good decisions make great companies. That's why the data-driven mantra keeps gaining momentum. Increasingly, smart business people are taking a data-first approach for both strategic planning and tactical decision-making. They spend ample time exploring their data to better understand their options. In doing so, they capitalize on real opportunities, while avoiding low-value projects.
Register for this episode of The Briefing Room to hear veteran Analyst Dr. Robin Bloor explain why a data-first mindset can help companies optimize their resources and thus make better decisions. He'll be briefed by Rishi Patel and Erin Haselkorn of
The Briefing Room with Dr. Robin Bloor and Experian
Experian, who will showcase Experian Pandora, which enables the kind of discovery that businesses need to better understand their data. They'll explain how Pandora can help professionals build a business case for their ideas and plans.
4. u Reveal the essential characteristics of enterprise
software, good and bad
u Provide a forum for detailed analysis of today s innovative
technologies
u Give vendors a chance to explain their product to savvy
analysts
u Allow audience members to pose serious questions... and
get answers!
Mission
8. Experian Data Quality
u Experian Data Quality offers a comprehensive suite of
data quality solutions, including cleansing,
standardization, matching, monitoring, enrichment
and profiling
u Its real-time address verification helps maintain
accurate customer information for name, physical
address, email and phone
u Experian Pandora allows businesses to prototype data
quality rules and transform data on the fly
9. Guests
Rishi Patel, Senior Sales Engineer, Experian Data Quality
Rishi has over 10 years experience in data quality software from development and
implementation to best practices and solution strategy. He is an active member in the
data quality community and focuses on building out highly skilled consultancy practices
within Experian focused on enterprise applications and architecture. He works on go-to-
market strategies and technical subject matter expertise in new and emerging
technologies for Experian Data Quality such as Experian Pandora.
Erin Haselkorn, Analyst Relations Manager, Experian Data Quality
As the Analyst Relations Manager for Experian Data Quality, Erin Haselkorn leverages her
understanding of data quality to help organizations better understand leading data
management strategies and how to create actionable insights. She is the author of
numerous data quality research reports, guest blog posts and articles. During her eight
years at Experian Data Quality, Erin has helped numerous clients gain a deeper
understanding of their customers through data and analytics.
31. Data Value
Data per se has no value – it is raw
material.
The PROCESSING of data in its
myriad ways generates the value.
32. The Data Pyramid
u Most of us are aware of this refinement of data and the
processes involved. Difficulties arise from:
u Fragmentation (of data, information, knowledge &
understanding)
u The incessant supply of new data
Rules, Policies
Guidelines, Procedures
Linked data, Structured data,
Visualization, Glossaries, Schemas, Ontologies
Signals, Measurements, Recordings,
Events, Transactions, Calculations, Aggregations
New
Data
Refinement
33. The Hadoop/Spark “Lake” Scenario
u Multiple external and
internal data sources
u Presume IT Security
u Assume the full gamut of
Data Wrangling tools (LHS)
u Assume data management
tools (RHS)
u Assume Analytics and BI
tools either local or at the
data warehouse
u It all adds up to data
governance
Data Sources
Analytics
Service
Mgt
Life Cycle
Mgt
MetaData
Discovery
MDM
MetaData
Mgt
Data
Cleansing
Data
Lineage
A
C
C
E
S
S
W
R
A
N
G
L
I
N
G
Staging Area
(Hadoop)
Data Warehouse
or other location
Data Streams
ETL
ETL
34. The Analytics Business Process
§ The main point to note about
analytics is that it is still iterative
§ The process changed because of:
o Data Availability
o Parallel Technology
o Scalable Software
o Open Source Tools
o M/C Learning
§ It is naturally becoming
integrated into the Data Lake
Data
Access
Data
Prep
Model
Analyze
Deploy
Execute
35. A Practical View
The “data wrangling” activities
transform data into information in
preparation for transforming it into
knowledge
36. u How would you define data governance – would
you include provenance/lineage?
u How does Experian integrate with data streams
(or doesn’t it)?
u In respect of scale, what is your largest
implementation by data volume and what was
the industry sector/problem space?
u Who do you serve, the business analysts or the
data scientist?
37. u Is your capability only relevant to analytics or
does it have broader areas of application?
u Technically, what makes it fast?
u Please comment on analytical workloads:
- What do you see as the natural IT bottlenecks?
- What do you see as the natural business
bottlenecks?
u Who do you partner with?