SlideShare uma empresa Scribd logo
1 de 11
Baixar para ler offline
DEBUNKING THE MYTHS 
Speaker 10 of 17 
Martin Willcox 
@Willcoxmnk 
What is Data Lake, Anyway? 
Followed by 
Anthony Miller
One of the Big Data labels that we risk over-loading to 
complete abstraction is the idea of a "Data Lake”… 
2 © 2014 Teradata 
“…store all data present 
and future and create a 
centralised data archive 
location.” 
“A large 
object-based 
repository that 
holds data in 
its native 
format” 
“Sometimes 
called the bit 
bucket or the 
landing zone” 
“All Water 
and Little 
Substance” 
“As more and more applications 
are created that derive value 
from… new types of data… the 
Data Lake forms”
“Data lakes can 
help resolve the 
nagging problem of 
accessibility and 
data integration” 
…and some of the discussions sound eerily familiar 
3 © 2014 Teradata 
Data accessibility 
and integration? 
Isn’t that what the 
Data Warehouse is 
for?
So is the Data Lake a new architectural construct? 
4 © 2014 Teradata 
Or are we just re-platforming Data Marts? 
Simple, single subject area Dimensional 
Data Marts – with all of the dimensions 
pre-joined to the fact table? One-per-workload 
/ application? 
Is this really the future of Enterprise 
Analytics? Or circa 1995 silo, 
departmental Decision Support Systems 
warmed-over?
Take the merits of the different technologies out of the 
equation – and this is what some of us are thinking… 
5 © 2014 Teradata
…but there are no free lunches in Information 
Management – merely more and different options 
Explicit, or implicit, there 
is always, always, always 
(at least one) schema 
6 © 2014 Teradata 
Agile application 
development, versus 
agile data acquisition 
None of the information 
management 
strategies / technologies 
are magic - “pay me 
now, or pay me later”
7 © 2014 Teradata 
Big Data Are Plural 
For the foreseeable future, we will need multiple Information 
Management strategies - and multiple Information 
Management technologies 
DATA WAREHOUSE 
DISCOVERY PLATFORM 
Integration 
becomes a 
critical concern 
DATA 
PLATFORM 
– Gartner – 
Logical Data Warehouse 
– Forrester – 
Enterprise Data Hub 
– Teradata – 
Unified Data Architecture
8 © 2014 Teradata 
A definition of the Data Lake (Data Reservoir) 
A centralised, consolidated, persistent store of raw, un-modelled and un-transformed data from 
multiple sources / silos (without an explicit, pre-defined schema, without externally defined metadata – 
and without guarantees about the quality, provenance and security of the data) 
Agile data acquisition – 
a haystack to go looking 
for needles… 
…with a natural storage 
model for complex, 
multi-structured data… 
…support for efficient 
non-relational 
computation… 
Now that is new, interesting and (potentially) very, very useful… 
…and provision for cost-effective 
storage of large 
and noisy data-sets.
9 © 2014 Teradata 
Data. Science
does nature tend to give us a single, beautiful lake? Or a messy patchwork of lakes, plural? 
10 © 2014 Teradata 
Left to its own devices, 
STOP PRESS: Laws of Physics* Unchanged! 
(* More specifically, the 2nd Law of Thermodynamics) 
None of the new information management strategies and technologies is by itself a cure 
for information entropy – data silos form naturally, just like lakes form naturally
11 © 2014 Teradata 
Summary and conclusions

Mais conteúdo relacionado

Mais procurados

Info qiy foundation digital me - dappre-eng-aug17
Info qiy foundation   digital me - dappre-eng-aug17Info qiy foundation   digital me - dappre-eng-aug17
Info qiy foundation digital me - dappre-eng-aug17
BigDataExpo
 
Atlantis company overview
Atlantis company overviewAtlantis company overview
Atlantis company overview
Ariel Schwieg
 

Mais procurados (20)

Info qiy foundation digital me - dappre-eng-aug17
Info qiy foundation   digital me - dappre-eng-aug17Info qiy foundation   digital me - dappre-eng-aug17
Info qiy foundation digital me - dappre-eng-aug17
 
Modern Data Architecture
Modern Data ArchitectureModern Data Architecture
Modern Data Architecture
 
Agile v Warehouse? Maurice Lynch CEO of Nathaen Technologies - Dublinked Data...
Agile v Warehouse? Maurice Lynch CEO of Nathaen Technologies - Dublinked Data...Agile v Warehouse? Maurice Lynch CEO of Nathaen Technologies - Dublinked Data...
Agile v Warehouse? Maurice Lynch CEO of Nathaen Technologies - Dublinked Data...
 
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Education Seminar: Self-service BI, Logical Data Warehouse and Data LakesEducation Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
 
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
 
Dell hans timmerman v1.1
Dell hans timmerman v1.1Dell hans timmerman v1.1
Dell hans timmerman v1.1
 
A "First Time Right" Start with Data Virtualization by Bart De Groeve, Practi...
A "First Time Right" Start with Data Virtualization by Bart De Groeve, Practi...A "First Time Right" Start with Data Virtualization by Bart De Groeve, Practi...
A "First Time Right" Start with Data Virtualization by Bart De Groeve, Practi...
 
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
 
Accelerate Cloud Modernization using Data Virtualization
Accelerate Cloud Modernization using Data VirtualizationAccelerate Cloud Modernization using Data Virtualization
Accelerate Cloud Modernization using Data Virtualization
 
Data Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data EnvironmentData Virtualization for Compliance – Creating a Controlled Data Environment
Data Virtualization for Compliance – Creating a Controlled Data Environment
 
Vendor-Checklist
Vendor-ChecklistVendor-Checklist
Vendor-Checklist
 
Advanced Data Analytics and Open Data - Dr Ingo Keck of CeADAR - Dublinked Da...
Advanced Data Analytics and Open Data - Dr Ingo Keck of CeADAR - Dublinked Da...Advanced Data Analytics and Open Data - Dr Ingo Keck of CeADAR - Dublinked Da...
Advanced Data Analytics and Open Data - Dr Ingo Keck of CeADAR - Dublinked Da...
 
Atlantis company overview
Atlantis company overviewAtlantis company overview
Atlantis company overview
 
TechEvent 2019: Provisioning of Data Platforms - Why, how, what; Martin Wunde...
TechEvent 2019: Provisioning of Data Platforms - Why, how, what; Martin Wunde...TechEvent 2019: Provisioning of Data Platforms - Why, how, what; Martin Wunde...
TechEvent 2019: Provisioning of Data Platforms - Why, how, what; Martin Wunde...
 
A Successful Data Strategy for Insurers in Volatile Times (EMEA)
A Successful Data Strategy for Insurers in Volatile Times (EMEA)A Successful Data Strategy for Insurers in Volatile Times (EMEA)
A Successful Data Strategy for Insurers in Volatile Times (EMEA)
 
Multi-Cloud-Datenintegration mit Datenvirtualisierung
Multi-Cloud-Datenintegration mit DatenvirtualisierungMulti-Cloud-Datenintegration mit Datenvirtualisierung
Multi-Cloud-Datenintegration mit Datenvirtualisierung
 
Study: #Big Data in #Austria
Study: #Big Data in #AustriaStudy: #Big Data in #Austria
Study: #Big Data in #Austria
 
Data encryption-cloud
Data encryption-cloudData encryption-cloud
Data encryption-cloud
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 

Semelhante a Martin Willcox - What is a Data Lake, Anyway?

Semelhante a Martin Willcox - What is a Data Lake, Anyway? (20)

Data Virtualization – Gateway to a Digital Business - Barry Devlin
Data Virtualization – Gateway to a Digital Business - Barry DevlinData Virtualization – Gateway to a Digital Business - Barry Devlin
Data Virtualization – Gateway to a Digital Business - Barry Devlin
 
Data lakes
Data lakesData lakes
Data lakes
 
Data lake ppt
Data lake pptData lake ppt
Data lake ppt
 
Gerenral insurance Accounts IT and Investment
Gerenral insurance Accounts IT and InvestmentGerenral insurance Accounts IT and Investment
Gerenral insurance Accounts IT and Investment
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
From Single Purpose to Multi Purpose Data Lakes - Broadening End UsersFrom Single Purpose to Multi Purpose Data Lakes - Broadening End Users
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
 
Enterprise Data Lake
Enterprise Data LakeEnterprise Data Lake
Enterprise Data Lake
 
Enterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable DigitalEnterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable Digital
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Teradata
TeradataTeradata
Teradata
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data VirtualizationMyth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 

Mais de Saratoga

Mais de Saratoga (16)

Georgina Armstrong - Data Visualisations. Making Boring Data Exciting and Emp...
Georgina Armstrong - Data Visualisations. Making Boring Data Exciting and Emp...Georgina Armstrong - Data Visualisations. Making Boring Data Exciting and Emp...
Georgina Armstrong - Data Visualisations. Making Boring Data Exciting and Emp...
 
David Shorten - Artificial intelligence
David Shorten - Artificial intelligenceDavid Shorten - Artificial intelligence
David Shorten - Artificial intelligence
 
Theo Priestley - Internet of Things - Forget the Numbers, Let's Talk Realities
Theo Priestley - Internet of Things - Forget the Numbers, Let's Talk RealitiesTheo Priestley - Internet of Things - Forget the Numbers, Let's Talk Realities
Theo Priestley - Internet of Things - Forget the Numbers, Let's Talk Realities
 
Jasper Horrell - SKA and Big Data: Up in Space and on the Ground
Jasper Horrell - SKA and Big Data: Up in Space and on the GroundJasper Horrell - SKA and Big Data: Up in Space and on the Ground
Jasper Horrell - SKA and Big Data: Up in Space and on the Ground
 
Barry Devlin - The Myth of Data-Driven Business
Barry Devlin - The Myth of Data-Driven BusinessBarry Devlin - The Myth of Data-Driven Business
Barry Devlin - The Myth of Data-Driven Business
 
Jeff Fletcher - Building a Hadoop based infrastructure as a service product a...
Jeff Fletcher - Building a Hadoop based infrastructure as a service product a...Jeff Fletcher - Building a Hadoop based infrastructure as a service product a...
Jeff Fletcher - Building a Hadoop based infrastructure as a service product a...
 
Anthony Miller - The second Half of the Chessboard: Thriving in a Time of Exp...
Anthony Miller - The second Half of the Chessboard: Thriving in a Time of Exp...Anthony Miller - The second Half of the Chessboard: Thriving in a Time of Exp...
Anthony Miller - The second Half of the Chessboard: Thriving in a Time of Exp...
 
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
 
Tristan Bergh - Predictive Analytics in Action: Real Business Results in Sout...
Tristan Bergh - Predictive Analytics in Action: Real Business Results in Sout...Tristan Bergh - Predictive Analytics in Action: Real Business Results in Sout...
Tristan Bergh - Predictive Analytics in Action: Real Business Results in Sout...
 
Gill Staniland - Interconnected BI - A systems thinking approach
Gill Staniland - Interconnected BI - A systems thinking approachGill Staniland - Interconnected BI - A systems thinking approach
Gill Staniland - Interconnected BI - A systems thinking approach
 
Gary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you ThinkGary Hope - Machine Learning: It's Not as Hard as you Think
Gary Hope - Machine Learning: It's Not as Hard as you Think
 
Jerry Chetty - Myth About Data Investigation
Jerry Chetty - Myth About Data InvestigationJerry Chetty - Myth About Data Investigation
Jerry Chetty - Myth About Data Investigation
 
Mike McDougall - Business Intelligence - Perdition or Paradise
Mike McDougall - Business Intelligence - Perdition or ParadiseMike McDougall - Business Intelligence - Perdition or Paradise
Mike McDougall - Business Intelligence - Perdition or Paradise
 
Mbwana Alliy - Big data from Silicon Valley to Africa
Mbwana Alliy - Big data from Silicon Valley to AfricaMbwana Alliy - Big data from Silicon Valley to Africa
Mbwana Alliy - Big data from Silicon Valley to Africa
 
The art of visualising requirements
The art of visualising requirementsThe art of visualising requirements
The art of visualising requirements
 
Getting investment ready tech4 africa (zach)
Getting investment ready   tech4 africa (zach)Getting investment ready   tech4 africa (zach)
Getting investment ready tech4 africa (zach)
 

Último

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 

Último (20)

Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 

Martin Willcox - What is a Data Lake, Anyway?

  • 1. DEBUNKING THE MYTHS Speaker 10 of 17 Martin Willcox @Willcoxmnk What is Data Lake, Anyway? Followed by Anthony Miller
  • 2. One of the Big Data labels that we risk over-loading to complete abstraction is the idea of a "Data Lake”… 2 © 2014 Teradata “…store all data present and future and create a centralised data archive location.” “A large object-based repository that holds data in its native format” “Sometimes called the bit bucket or the landing zone” “All Water and Little Substance” “As more and more applications are created that derive value from… new types of data… the Data Lake forms”
  • 3. “Data lakes can help resolve the nagging problem of accessibility and data integration” …and some of the discussions sound eerily familiar 3 © 2014 Teradata Data accessibility and integration? Isn’t that what the Data Warehouse is for?
  • 4. So is the Data Lake a new architectural construct? 4 © 2014 Teradata Or are we just re-platforming Data Marts? Simple, single subject area Dimensional Data Marts – with all of the dimensions pre-joined to the fact table? One-per-workload / application? Is this really the future of Enterprise Analytics? Or circa 1995 silo, departmental Decision Support Systems warmed-over?
  • 5. Take the merits of the different technologies out of the equation – and this is what some of us are thinking… 5 © 2014 Teradata
  • 6. …but there are no free lunches in Information Management – merely more and different options Explicit, or implicit, there is always, always, always (at least one) schema 6 © 2014 Teradata Agile application development, versus agile data acquisition None of the information management strategies / technologies are magic - “pay me now, or pay me later”
  • 7. 7 © 2014 Teradata Big Data Are Plural For the foreseeable future, we will need multiple Information Management strategies - and multiple Information Management technologies DATA WAREHOUSE DISCOVERY PLATFORM Integration becomes a critical concern DATA PLATFORM – Gartner – Logical Data Warehouse – Forrester – Enterprise Data Hub – Teradata – Unified Data Architecture
  • 8. 8 © 2014 Teradata A definition of the Data Lake (Data Reservoir) A centralised, consolidated, persistent store of raw, un-modelled and un-transformed data from multiple sources / silos (without an explicit, pre-defined schema, without externally defined metadata – and without guarantees about the quality, provenance and security of the data) Agile data acquisition – a haystack to go looking for needles… …with a natural storage model for complex, multi-structured data… …support for efficient non-relational computation… Now that is new, interesting and (potentially) very, very useful… …and provision for cost-effective storage of large and noisy data-sets.
  • 9. 9 © 2014 Teradata Data. Science
  • 10. does nature tend to give us a single, beautiful lake? Or a messy patchwork of lakes, plural? 10 © 2014 Teradata Left to its own devices, STOP PRESS: Laws of Physics* Unchanged! (* More specifically, the 2nd Law of Thermodynamics) None of the new information management strategies and technologies is by itself a cure for information entropy – data silos form naturally, just like lakes form naturally
  • 11. 11 © 2014 Teradata Summary and conclusions