Enviar pesquisa
Carregar
Introduction to hadoop
•
1 gostou
•
824 visualizações
Marc Cluet
Seguir
Lynx Consultants training about Hadoop
Leia menos
Leia mais
Tecnologia
Denunciar
Compartilhar
Denunciar
Compartilhar
1 de 46
Baixar agora
Baixar para ler offline
Recomendados
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Stefan Lipp
Hybrid my sql_hadoop_datawarehouse
Hybrid my sql_hadoop_datawarehouse
Laine Campbell
The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
Jeff Smoley
Application Architectures with Hadoop
Application Architectures with Hadoop
hadooparchbook
Oracle Open World 2017 Delphix and DBVisit
Oracle Open World 2017 Delphix and DBVisit
Kellyn Pot'Vin-Gorman
Strata EU tutorial - Architectural considerations for hadoop applications
Strata EU tutorial - Architectural considerations for hadoop applications
hadooparchbook
PLNOG19 - Piotr Wojciechowski - Sieć w chmurze publicznej i hybrydowej dla si...
PLNOG19 - Piotr Wojciechowski - Sieć w chmurze publicznej i hybrydowej dla si...
PROIDEA
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Esther Kundin
Recomendados
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Stefan Lipp
Hybrid my sql_hadoop_datawarehouse
Hybrid my sql_hadoop_datawarehouse
Laine Campbell
The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
Jeff Smoley
Application Architectures with Hadoop
Application Architectures with Hadoop
hadooparchbook
Oracle Open World 2017 Delphix and DBVisit
Oracle Open World 2017 Delphix and DBVisit
Kellyn Pot'Vin-Gorman
Strata EU tutorial - Architectural considerations for hadoop applications
Strata EU tutorial - Architectural considerations for hadoop applications
hadooparchbook
PLNOG19 - Piotr Wojciechowski - Sieć w chmurze publicznej i hybrydowej dla si...
PLNOG19 - Piotr Wojciechowski - Sieć w chmurze publicznej i hybrydowej dla si...
PROIDEA
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Esther Kundin
SQL Saturday San Diego
SQL Saturday San Diego
Kellyn Pot'Vin-Gorman
Soft-Shake 2013 : Enabling Realtime Queries to End Users
Soft-Shake 2013 : Enabling Realtime Queries to End Users
Benoit Perroud
Big data - Online Training
Big data - Online Training
Learntek1
Data Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In Action
Frank Y
Big data Hadoop
Big data Hadoop
Ayyappan Paramesh
Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5
Mike King
Webinar 5-reasons-object-storage.pptx
Webinar 5-reasons-object-storage.pptx
Cloudian
Stockage des données : quel système pour quel usage ?
Stockage des données : quel système pour quel usage ?
Zouheir Cadi
Realtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBase
larsgeorge
Protect your private data with ORC column encryption
Protect your private data with ORC column encryption
Owen O'Malley
PLNOG 9: Ron Broersma - Enterprise IPv6 Deployment
PLNOG 9: Ron Broersma - Enterprise IPv6 Deployment
PROIDEA
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Cloudera, Inc.
Improvements in Hadoop Security
Improvements in Hadoop Security
DataWorks Summit
Red hatpartner2013edb futureofdatabase
Red hatpartner2013edb futureofdatabase
EDB
What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications
What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications
Todd Hoff
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010
Gavin Heavyside
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Hortonworks
C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsof...
C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsof...
DataStax Academy
Doing More With Less: The Economics of Open Source Database Adoption
Doing More With Less: The Economics of Open Source Database Adoption
EDB
Innovation in the Cloud - Rackspace Zurich Event
Innovation in the Cloud - Rackspace Zurich Event
Marc Cluet
Hadoop operations
Hadoop operations
Marc Cluet
Autoscaling Best Practices - WebPerf Barcelona Oct 2014
Autoscaling Best Practices - WebPerf Barcelona Oct 2014
Marc Cluet
Mais conteúdo relacionado
Mais procurados
SQL Saturday San Diego
SQL Saturday San Diego
Kellyn Pot'Vin-Gorman
Soft-Shake 2013 : Enabling Realtime Queries to End Users
Soft-Shake 2013 : Enabling Realtime Queries to End Users
Benoit Perroud
Big data - Online Training
Big data - Online Training
Learntek1
Data Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In Action
Frank Y
Big data Hadoop
Big data Hadoop
Ayyappan Paramesh
Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5
Mike King
Webinar 5-reasons-object-storage.pptx
Webinar 5-reasons-object-storage.pptx
Cloudian
Stockage des données : quel système pour quel usage ?
Stockage des données : quel système pour quel usage ?
Zouheir Cadi
Realtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBase
larsgeorge
Protect your private data with ORC column encryption
Protect your private data with ORC column encryption
Owen O'Malley
PLNOG 9: Ron Broersma - Enterprise IPv6 Deployment
PLNOG 9: Ron Broersma - Enterprise IPv6 Deployment
PROIDEA
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Cloudera, Inc.
Improvements in Hadoop Security
Improvements in Hadoop Security
DataWorks Summit
Red hatpartner2013edb futureofdatabase
Red hatpartner2013edb futureofdatabase
EDB
What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications
What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications
Todd Hoff
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010
Gavin Heavyside
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Hortonworks
C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsof...
C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsof...
DataStax Academy
Doing More With Less: The Economics of Open Source Database Adoption
Doing More With Less: The Economics of Open Source Database Adoption
EDB
Mais procurados
(19)
SQL Saturday San Diego
SQL Saturday San Diego
Soft-Shake 2013 : Enabling Realtime Queries to End Users
Soft-Shake 2013 : Enabling Realtime Queries to End Users
Big data - Online Training
Big data - Online Training
Data Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In Action
Big data Hadoop
Big data Hadoop
Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5
Webinar 5-reasons-object-storage.pptx
Webinar 5-reasons-object-storage.pptx
Stockage des données : quel système pour quel usage ?
Stockage des données : quel système pour quel usage ?
Realtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBase
Protect your private data with ORC column encryption
Protect your private data with ORC column encryption
PLNOG 9: Ron Broersma - Enterprise IPv6 Deployment
PLNOG 9: Ron Broersma - Enterprise IPv6 Deployment
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Improvements in Hadoop Security
Improvements in Hadoop Security
Red hatpartner2013edb futureofdatabase
Red hatpartner2013edb futureofdatabase
What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications
What Should I Do? Choosing SQL, NoSQL or Both for Scalable Web Applications
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsof...
C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsof...
Doing More With Less: The Economics of Open Source Database Adoption
Doing More With Less: The Economics of Open Source Database Adoption
Destaque
Innovation in the Cloud - Rackspace Zurich Event
Innovation in the Cloud - Rackspace Zurich Event
Marc Cluet
Hadoop operations
Hadoop operations
Marc Cluet
Autoscaling Best Practices - WebPerf Barcelona Oct 2014
Autoscaling Best Practices - WebPerf Barcelona Oct 2014
Marc Cluet
Puppet Camp London Fall 2015 - Service Discovery and Puppet
Puppet Camp London Fall 2015 - Service Discovery and Puppet
Marc Cluet
Ssh that wonderful thing
Ssh that wonderful thing
Marc Cluet
Networking & dns 101
Networking & dns 101
Marc Cluet
Puppet and your Metadata - PuppetCamp London 2015
Puppet and your Metadata - PuppetCamp London 2015
Marc Cluet
Destaque
(7)
Innovation in the Cloud - Rackspace Zurich Event
Innovation in the Cloud - Rackspace Zurich Event
Hadoop operations
Hadoop operations
Autoscaling Best Practices - WebPerf Barcelona Oct 2014
Autoscaling Best Practices - WebPerf Barcelona Oct 2014
Puppet Camp London Fall 2015 - Service Discovery and Puppet
Puppet Camp London Fall 2015 - Service Discovery and Puppet
Ssh that wonderful thing
Ssh that wonderful thing
Networking & dns 101
Networking & dns 101
Puppet and your Metadata - PuppetCamp London 2015
Puppet and your Metadata - PuppetCamp London 2015
Semelhante a Introduction to hadoop
Semantic web meetup 14.november 2013
Semantic web meetup 14.november 2013
Jean-Pierre König
Introduction to hadoop V2
Introduction to hadoop V2
TarjeiRomtveit
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Uwe Printz
Introduction to BIg Data and Hadoop
Introduction to BIg Data and Hadoop
Amir Shaikh
Big Data Strategy for the Relational World
Big Data Strategy for the Relational World
Andrew Brust
Application architectures with hadoop – big data techcon 2014
Application architectures with hadoop – big data techcon 2014
Jonathan Seidman
Application architectures with Hadoop – Big Data TechCon 2014
Application architectures with Hadoop – Big Data TechCon 2014
hadooparchbook
Microsoft's Big Play for Big Data- Visual Studio Live! NY 2012
Microsoft's Big Play for Big Data- Visual Studio Live! NY 2012
Andrew Brust
Microsoft's Big Play for Big Data
Microsoft's Big Play for Big Data
Andrew Brust
Polyglot Persistence - Two Great Tastes That Taste Great Together
Polyglot Persistence - Two Great Tastes That Taste Great Together
John Wood
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
tcloudcomputing-tw
Hortonworks Big Data & Hadoop
Hortonworks Big Data & Hadoop
Mark Ginnebaugh
Hadoop and Hive in Enterprises
Hadoop and Hive in Enterprises
markgrover
Run Your First Hadoop 2.x Program
Run Your First Hadoop 2.x Program
Skillspeed
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
Big Data Spain
DBA to Data Scientist
DBA to Data Scientist
pasalapudi
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
Slim Baltagi
Java One 2017: Open Source Big Data in the Cloud: Hadoop, M/R, Hive, Spark an...
Java One 2017: Open Source Big Data in the Cloud: Hadoop, M/R, Hive, Spark an...
Frank Munz
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
Kent Graziano
Horses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Eric Kavanagh
Semelhante a Introduction to hadoop
(20)
Semantic web meetup 14.november 2013
Semantic web meetup 14.november 2013
Introduction to hadoop V2
Introduction to hadoop V2
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to BIg Data and Hadoop
Introduction to BIg Data and Hadoop
Big Data Strategy for the Relational World
Big Data Strategy for the Relational World
Application architectures with hadoop – big data techcon 2014
Application architectures with hadoop – big data techcon 2014
Application architectures with Hadoop – Big Data TechCon 2014
Application architectures with Hadoop – Big Data TechCon 2014
Microsoft's Big Play for Big Data- Visual Studio Live! NY 2012
Microsoft's Big Play for Big Data- Visual Studio Live! NY 2012
Microsoft's Big Play for Big Data
Microsoft's Big Play for Big Data
Polyglot Persistence - Two Great Tastes That Taste Great Together
Polyglot Persistence - Two Great Tastes That Taste Great Together
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Hortonworks Big Data & Hadoop
Hortonworks Big Data & Hadoop
Hadoop and Hive in Enterprises
Hadoop and Hive in Enterprises
Run Your First Hadoop 2.x Program
Run Your First Hadoop 2.x Program
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
DBA to Data Scientist
DBA to Data Scientist
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
Java One 2017: Open Source Big Data in the Cloud: Hadoop, M/R, Hive, Spark an...
Java One 2017: Open Source Big Data in the Cloud: Hadoop, M/R, Hive, Spark an...
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
Horses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Mais de Marc Cluet
Your Kernel and You
Your Kernel and You
Marc Cluet
Managing DevOps teams, staying alive
Managing DevOps teams, staying alive
Marc Cluet
The DevOps journey - How to get there painlessly
The DevOps journey - How to get there painlessly
Marc Cluet
Elastic Beanstalk, usos prácticos y conceptos
Elastic Beanstalk, usos prácticos y conceptos
Marc Cluet
Service discovery and puppet
Service discovery and puppet
Marc Cluet
Consul First Steps
Consul First Steps
Marc Cluet
Microservices and the Cloud - DevOps Cardiff Meetup
Microservices and the Cloud - DevOps Cardiff Meetup
Marc Cluet
Microservices and the Cloud
Microservices and the Cloud
Marc Cluet
How to implement microservices
How to implement microservices
Marc Cluet
A Metadata Ocean in Chef and Puppet
A Metadata Ocean in Chef and Puppet
Marc Cluet
Autoscaling Best Practices
Autoscaling Best Practices
Marc Cluet
Rackspace Hack Night - Vagrant & Packer
Rackspace Hack Night - Vagrant & Packer
Marc Cluet
Introduction to DevOps - Rackspace tech night
Introduction to DevOps - Rackspace tech night
Marc Cluet
Juju + Puppet (Puppetconf 2011)
Juju + Puppet (Puppetconf 2011)
Marc Cluet
Scalable, good, cheap
Scalable, good, cheap
Marc Cluet
Mais de Marc Cluet
(15)
Your Kernel and You
Your Kernel and You
Managing DevOps teams, staying alive
Managing DevOps teams, staying alive
The DevOps journey - How to get there painlessly
The DevOps journey - How to get there painlessly
Elastic Beanstalk, usos prácticos y conceptos
Elastic Beanstalk, usos prácticos y conceptos
Service discovery and puppet
Service discovery and puppet
Consul First Steps
Consul First Steps
Microservices and the Cloud - DevOps Cardiff Meetup
Microservices and the Cloud - DevOps Cardiff Meetup
Microservices and the Cloud
Microservices and the Cloud
How to implement microservices
How to implement microservices
A Metadata Ocean in Chef and Puppet
A Metadata Ocean in Chef and Puppet
Autoscaling Best Practices
Autoscaling Best Practices
Rackspace Hack Night - Vagrant & Packer
Rackspace Hack Night - Vagrant & Packer
Introduction to DevOps - Rackspace tech night
Introduction to DevOps - Rackspace tech night
Juju + Puppet (Puppetconf 2011)
Juju + Puppet (Puppetconf 2011)
Scalable, good, cheap
Scalable, good, cheap
Último
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
wesley chun
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
The Digital Insurer
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
apidays
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
SynarionITSolutions
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
Remote DBA Services
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
MIND CTI
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
Boston Institute of Analytics
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
DianaGray10
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
sudhanshuwaghmare1
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Principled Technologies
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
Igalia
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Andrey Devyatkin
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
Último
(20)
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Introduction to hadoop
1.
Marc Cluet –
Lynx Consultants What’s behind Big Data
2.
What we’ll cover? ¡
Understand Hadoop components ¡ Understand different technologies involved ¡ Embrace Big Data! Lynx Consultants © 2013
3.
What is Big
Data? Lynx Consultants © 2013
4.
What is Big
Data? ¡ SQL has a limited ability to process changing data § SQL schemas are the truth, data needs to fit that Lynx Consultants © 2013
5.
What is Big
Data? ¡ Big Data is the solution! § Data can be truly dynamic Lynx Consultants © 2013
6.
What is Big
Data? ¡ Big Data is the solution! § Data can be truly dynamic § Designed to handle Terabytes of data Lynx Consultants © 2013
7.
What is Big
Data? ¡ Big Data is the solution! § Data can be truly dynamic § Designed to handle Terabytes of data § Designed for fault tolerance and securing data Lynx Consultants © 2013
8.
What is Big
Data? ¡ Big Data is the solution! § Data can be truly dynamic § Designed to handle Terabytes of data § Designed for fault tolerance and securing data § Designed around exploiting hardware to the fullest Lynx Consultants © 2013
9.
What is Big
Data? ¡ Big Data is the solution! § Data can be truly dynamic § Designed to handle Terabytes of data § Designed for fault tolerance and securing data § Designed around exploiting hardware to the fullest § Designed around Map/Reduce Lynx Consultants © 2013
10.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
11.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
12.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
13.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
14.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
15.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
16.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
17.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
18.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
19.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
20.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
21.
Who runs Big
Data? ¡ A few small companies Lynx Consultants © 2013
22.
What is Hadoop? Lynx
Consultants © 2013
23.
What is Hadoop? ¡
Hadoop is one of the big players for Big Data § Developed as an Open Source implementation to implement Google BigTable Lynx Consultants © 2013
24.
What is Hadoop? ¡
Hadoop is one of the big players for Big Data § Developed as an Open Source implementation to implement Google BigTable § Mainly developed at Yahoo! Lynx Consultants © 2013
25.
What is Hadoop? ¡
Hadoop is one of the big players for Big Data § Developed as an Open Source implementation to implement Google BigTable § Mainly developed at Yahoo! § Current companies behind it: Hortonworks and Cloudera Lynx Consultants © 2013
26.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System § HDFS is a distributed filesystem across many nodes § Has many copies of your data (default: 3) § If one node goes down makes sure all the data is rebalanced Lynx Consultants © 2013
27.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System Lynx Consultants © 2013
28.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System ¡ Hbase – Hadoop NoSQL Database § Schemaless Key-‐Value storage § All data exportable in JSON Lynx Consultants © 2013
29.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System ¡ Hbase – Hadoop NoSQL Database Lynx Consultants © 2013
30.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System ¡ Hbase – Hadoop NoSQL Database ¡ Map/Reduce – The key to it all § This was invented by Google § Given a dataset we Map all that match a criteria § Then we Reduce this to a result Lynx Consultants © 2013
31.
What are the
features of Hadoop? ¡ Map/Reduce – The key to it all Lynx Consultants © 2013
32.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System ¡ Hbase – Hadoop NoSQL Database ¡ Map/Reduce – The key to it all ¡ Hive – SQL for NoSQL § Hive provides a SQL language called HiveSQL § Provides a good entrance for SQL users :) Lynx Consultants © 2013
33.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System ¡ Hbase – Hadoop NoSQL Database ¡ Map/Reduce – The key to it all ¡ Hive – SQL for NoSQL ¡ Pig – Map/Reduce made easy § Creates data results given a reduced language § Reinvents SQL somehow Lynx Consultants © 2013
34.
What are the
features of Hadoop? ¡ Hive Lynx Consultants © 2013
35.
What are the
features of Hadoop? ¡ Pig Lynx Consultants © 2013
36.
What are the
features of Hadoop? ¡ HDFS – Hadoop Distributed File System ¡ Hbase – Hadoop NoSQL Database ¡ Map/Reduce – The key to it all ¡ Hive – SQL for NoSQL ¡ Pig – Map/Reduce made easy ¡ Flume – Fault Tolerant transport Lynx Consultants © 2013
37.
What are the
features of Hadoop? ¡ Flume § Divides in Sources, Channels, Sinks § Can have multiple of everything, makes it fault tolerant § Many sources! ▪ Avro, Exec, JMS, Syslog, HTTP, NetCat, Your Own (Java) Lynx Consultants © 2013
38.
What are the
features of Hadoop? ¡ Flume § Divides in Sources, Channels, Sinks § Can have multiple of everything, makes it fault tolerant § Many sources! § Many channels! ▪ Memory, File, Your Own (Java) Lynx Consultants © 2013
39.
What are the
features of Hadoop? ¡ Flume § Divides in Sources, Channels, Sinks § Can have multiple of everything, makes it fault tolerant § Many sources! § Many channels! § Many sinks! ▪ Avro, HDFS, Logger, IRC, File, Hbase, ElasticSearch, S3, Community sinks, Your Own (Java) Lynx Consultants © 2013
40.
What are the
features of Hadoop? ¡ Flume Lynx Consultants © 2013
41.
How Hadoop looks
like in a DC ¡ Components § Primary Namenode § Secondary Namenode § Data Node Lynx Consultants © 2013
42.
How Hadoop looks
like in a DC ¡ Components § Primary Namenode ▪ Controls all the cluster, knows where the data resides ▪ Runs the job tracker to keep track of Map/Reduce jobs ▪ Biggest point of failure, shadowing it is a potential option § Secondary Namenode § Data Node Lynx Consultants © 2013
43.
How Hadoop looks
like in a DC ¡ Components § Primary Namenode § Secondary Namenode ▪ Performs secondary cleanup options § Data Node Lynx Consultants © 2013
44.
How Hadoop looks
like in a DC ¡ Components § Primary Namenode § Secondary Namenode § Data Node ▪ Stores all the information ▪ Runs Map/Reduce Lynx Consultants © 2013
45.
How Hadoop looks
like in a DC ¡ Components Lynx Consultants © 2013
46.
Questions? Lynx Consultants ©
2013
Baixar agora