Most organization who going through Digital Transformation need to break down their data silos as well as leverage existing and new data sources. Here is how to build a data lake for data change in your organization.
3. <Digital Tranformation Stats>
By end
2017, > 70%
of G500
By 2020,
50% of the
G2000
Digital Transformation is no Longer an Option
Are You Prepared?
But only
26% of
Organizations
Accenture and Forrester Digital
Transformation in the Age of the
Customer studyIDC Futurescape
4. The Data Lake is the New Digital Backbone
• Break down data silos
• Structured and
unstructured
• Granular data
• Machine learning
13. Demo flow
Data Lake
Incoming Lead Data
(Raw)
Amazon EMR Cluster Data Lake
Output Lead Data (Processed)
With Segmentation
1
Ingestion with Smart Data
Quality
2
Smart Data Pipeline
with Machine Learning
15. On-premise Data Lakes
On-Premise
Data Sources
Ingest Prepare Process Access Consume
Cloud
Data Sources
Governance
Processing
Storage
On-prem Datalake
16. Hybrid Data Lakes
On-Premise
Data Sources
Ingest Prepare Process Access Consume
Cloud
Data Sources
Governance
Cloud Processing
Processing
Cloud Storage
Storage
On-prem Datalake
Cloud Datalake
Distribute
17. Cloud Data Lakes – A Concrete Example
Ingest Prepare Process Access Consume
Governance
Cloud Processing
Cloud Storage
On-Premise
Data Sources
Cloud
Data Sources
S3
EMR
Cloud Storage
Cloud Dataflow
Azure DL Store
HDInsight
18. The Path to Agility
Ingestion+basic
visualization
DataQuality
SelfService
Data
Governance
Real-time
Machine
Learning
19. Deliver Value Along The Way
Start with quick wins & business outcome in mind
Get a cadence of constantly delivering value
Focus on game changer value drivers
Get the company onboard