Enviar pesquisa
Carregar
Big data: Loading your data with flume and sqoop
•
15 gostaram
•
8,407 visualizações
Christophe Marchal
Seguir
Studying Hortonworks stack, I created this 10 minutes presentation. http://hortonworks.com
Leia menos
Leia mais
Tecnologia
Denunciar
Compartilhar
Denunciar
Compartilhar
1 de 22
Baixar agora
Baixar para ler offline
Recomendados
Apache flume by Swapnil Dubey
Apache flume by Swapnil Dubey
Swapnil Dubey
Deploying Apache Flume to enable low-latency analytics
Deploying Apache Flume to enable low-latency analytics
DataWorks Summit
Sqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
Sqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
gethue
New Data Transfer Tools for Hadoop: Sqoop 2
New Data Transfer Tools for Hadoop: Sqoop 2
DataWorks Summit
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
Apache Sqoop: A Data Transfer Tool for Hadoop
Apache Sqoop: A Data Transfer Tool for Hadoop
Cloudera, Inc.
The Future of Apache Storm
The Future of Apache Storm
DataWorks Summit/Hadoop Summit
ApacheCon-Flume-Kafka-2016
ApacheCon-Flume-Kafka-2016
Jayesh Thakrar
Recomendados
Apache flume by Swapnil Dubey
Apache flume by Swapnil Dubey
Swapnil Dubey
Deploying Apache Flume to enable low-latency analytics
Deploying Apache Flume to enable low-latency analytics
DataWorks Summit
Sqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
Sqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
gethue
New Data Transfer Tools for Hadoop: Sqoop 2
New Data Transfer Tools for Hadoop: Sqoop 2
DataWorks Summit
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
Apache Sqoop: A Data Transfer Tool for Hadoop
Apache Sqoop: A Data Transfer Tool for Hadoop
Cloudera, Inc.
The Future of Apache Storm
The Future of Apache Storm
DataWorks Summit/Hadoop Summit
ApacheCon-Flume-Kafka-2016
ApacheCon-Flume-Kafka-2016
Jayesh Thakrar
Introduction to streaming and messaging flume,kafka,SQS,kinesis
Introduction to streaming and messaging flume,kafka,SQS,kinesis
Omid Vahdaty
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
DataWorks Summit/Hadoop Summit
Large scale near real-time log indexing with Flume and SolrCloud
Large scale near real-time log indexing with Flume and SolrCloud
DataWorks Summit
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Skills Matter
Flexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache Flink
DataWorks Summit
Stream Processing made simple with Kafka
Stream Processing made simple with Kafka
DataWorks Summit/Hadoop Summit
Apache Flume - Streaming data easily to Hadoop from any source for Telco oper...
Apache Flume - Streaming data easily to Hadoop from any source for Telco oper...
DataWorks Summit
HBaseConEast2016: Coprocessors – Uses, Abuses and Solutions
HBaseConEast2016: Coprocessors – Uses, Abuses and Solutions
Michael Stack
Highlights Of Sqoop2
Highlights Of Sqoop2
Alexander Alten-Lorenz
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
DataWorks Summit/Hadoop Summit
HBaseConEast2016: HBase and Spark, State of the Art
HBaseConEast2016: HBase and Spark, State of the Art
Michael Stack
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
Yahoo Developer Network
Apache kafka
Apache kafka
Shravan (Sean) Pabba
Introduction to Spark Streaming
Introduction to Spark Streaming
Knoldus Inc.
Spark+flume seattle
Spark+flume seattle
Hari Shreedharan
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Ryan Bosshart
Near-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBase
dave_revell
From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other tools
Guy Harrison
HBaseCon 2012 | Solbase - Kyungseog Oh, Photobucket
HBaseCon 2012 | Solbase - Kyungseog Oh, Photobucket
Cloudera, Inc.
Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?
DataWorks Summit/Hadoop Summit
Apache sqoop with an use case
Apache sqoop with an use case
Davin Abraham
ITSS Overview
ITSS Overview
IMC Institute
Mais conteúdo relacionado
Mais procurados
Introduction to streaming and messaging flume,kafka,SQS,kinesis
Introduction to streaming and messaging flume,kafka,SQS,kinesis
Omid Vahdaty
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
DataWorks Summit/Hadoop Summit
Large scale near real-time log indexing with Flume and SolrCloud
Large scale near real-time log indexing with Flume and SolrCloud
DataWorks Summit
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Skills Matter
Flexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache Flink
DataWorks Summit
Stream Processing made simple with Kafka
Stream Processing made simple with Kafka
DataWorks Summit/Hadoop Summit
Apache Flume - Streaming data easily to Hadoop from any source for Telco oper...
Apache Flume - Streaming data easily to Hadoop from any source for Telco oper...
DataWorks Summit
HBaseConEast2016: Coprocessors – Uses, Abuses and Solutions
HBaseConEast2016: Coprocessors – Uses, Abuses and Solutions
Michael Stack
Highlights Of Sqoop2
Highlights Of Sqoop2
Alexander Alten-Lorenz
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
DataWorks Summit/Hadoop Summit
HBaseConEast2016: HBase and Spark, State of the Art
HBaseConEast2016: HBase and Spark, State of the Art
Michael Stack
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
Yahoo Developer Network
Apache kafka
Apache kafka
Shravan (Sean) Pabba
Introduction to Spark Streaming
Introduction to Spark Streaming
Knoldus Inc.
Spark+flume seattle
Spark+flume seattle
Hari Shreedharan
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Ryan Bosshart
Near-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBase
dave_revell
From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other tools
Guy Harrison
HBaseCon 2012 | Solbase - Kyungseog Oh, Photobucket
HBaseCon 2012 | Solbase - Kyungseog Oh, Photobucket
Cloudera, Inc.
Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?
DataWorks Summit/Hadoop Summit
Mais procurados
(20)
Introduction to streaming and messaging flume,kafka,SQS,kinesis
Introduction to streaming and messaging flume,kafka,SQS,kinesis
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Large scale near real-time log indexing with Flume and SolrCloud
Large scale near real-time log indexing with Flume and SolrCloud
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Introduction to Sqoop Aaron Kimball Cloudera Hadoop User Group UK
Flexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache Flink
Stream Processing made simple with Kafka
Stream Processing made simple with Kafka
Apache Flume - Streaming data easily to Hadoop from any source for Telco oper...
Apache Flume - Streaming data easily to Hadoop from any source for Telco oper...
HBaseConEast2016: Coprocessors – Uses, Abuses and Solutions
HBaseConEast2016: Coprocessors – Uses, Abuses and Solutions
Highlights Of Sqoop2
Highlights Of Sqoop2
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
Querying the Internet of Things: Streaming SQL on Kafka/Samza and Storm/Trident
HBaseConEast2016: HBase and Spark, State of the Art
HBaseConEast2016: HBase and Spark, State of the Art
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
Apache kafka
Apache kafka
Introduction to Spark Streaming
Introduction to Spark Streaming
Spark+flume seattle
Spark+flume seattle
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Near-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBase
From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other tools
HBaseCon 2012 | Solbase - Kyungseog Oh, Photobucket
HBaseCon 2012 | Solbase - Kyungseog Oh, Photobucket
Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?
Destaque
Apache sqoop with an use case
Apache sqoop with an use case
Davin Abraham
ITSS Overview
ITSS Overview
IMC Institute
Big Data Analytics using Mahout
Big Data Analytics using Mahout
IMC Institute
สมุดกิจกรรม Code for Kids
สมุดกิจกรรม Code for Kids
IMC Institute
Thai Software & Software Market Survey 2015
Thai Software & Software Market Survey 2015
IMC Institute
Introduction to Apache Sqoop
Introduction to Apache Sqoop
Avkash Chauhan
Big data processing using Hadoop with Cloudera Quickstart
Big data processing using Hadoop with Cloudera Quickstart
IMC Institute
Advanced Sqoop
Advanced Sqoop
Yogesh Kulkarni
Mobile User and App Analytics in China
Mobile User and App Analytics in China
IMC Institute
Install Apache Hadoop for Development/Production
Install Apache Hadoop for Development/Production
IMC Institute
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlib
IMC Institute
Kanban boards step by step
Kanban boards step by step
Giulio Roggero
Flume vs. kafka
Flume vs. kafka
Omid Vahdaty
Destaque
(13)
Apache sqoop with an use case
Apache sqoop with an use case
ITSS Overview
ITSS Overview
Big Data Analytics using Mahout
Big Data Analytics using Mahout
สมุดกิจกรรม Code for Kids
สมุดกิจกรรม Code for Kids
Thai Software & Software Market Survey 2015
Thai Software & Software Market Survey 2015
Introduction to Apache Sqoop
Introduction to Apache Sqoop
Big data processing using Hadoop with Cloudera Quickstart
Big data processing using Hadoop with Cloudera Quickstart
Advanced Sqoop
Advanced Sqoop
Mobile User and App Analytics in China
Mobile User and App Analytics in China
Install Apache Hadoop for Development/Production
Install Apache Hadoop for Development/Production
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlib
Kanban boards step by step
Kanban boards step by step
Flume vs. kafka
Flume vs. kafka
Semelhante a Big data: Loading your data with flume and sqoop
Data Integration
Data Integration
Datio Big Data
Bigdata
Bigdata
sweetysweety8
A Data Streaming Architecture with Apache Flink (berlin Buzzwords 2016)
A Data Streaming Architecture with Apache Flink (berlin Buzzwords 2016)
Robert Metzger
Kafka connect-london-meetup-2016
Kafka connect-london-meetup-2016
Gwen (Chen) Shapira
Storm – Streaming Data Analytics at Scale - StampedeCon 2014
Storm – Streaming Data Analytics at Scale - StampedeCon 2014
StampedeCon
QCon London - Stream Processing with Apache Flink
QCon London - Stream Processing with Apache Flink
Robert Metzger
Building Scalable Data Pipelines - 2016 DataPalooza Seattle
Building Scalable Data Pipelines - 2016 DataPalooza Seattle
Evan Chan
GOTO Night Amsterdam - Stream processing with Apache Flink
GOTO Night Amsterdam - Stream processing with Apache Flink
Robert Metzger
Flink history, roadmap and vision
Flink history, roadmap and vision
Stephan Ewen
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Apache Flink Taiwan User Group
Data ingestion
Data ingestion
nitheeshe2
SQOOP PPT
SQOOP PPT
Dushhyant Kumar
K. Tzoumas & S. Ewen – Flink Forward Keynote
K. Tzoumas & S. Ewen – Flink Forward Keynote
Flink Forward
Introduction to sqoop
Introduction to sqoop
Uday Vakalapudi
Cloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azure
Timothy Spann
Visual Mapping of Clickstream Data
Visual Mapping of Clickstream Data
DataWorks Summit
Hadoop pycon2011uk
Hadoop pycon2011uk
Aditya Sakhuja
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Hortonworks
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Hortonworks
Robust stream processing with Apache Flink
Robust stream processing with Apache Flink
Aljoscha Krettek
Semelhante a Big data: Loading your data with flume and sqoop
(20)
Data Integration
Data Integration
Bigdata
Bigdata
A Data Streaming Architecture with Apache Flink (berlin Buzzwords 2016)
A Data Streaming Architecture with Apache Flink (berlin Buzzwords 2016)
Kafka connect-london-meetup-2016
Kafka connect-london-meetup-2016
Storm – Streaming Data Analytics at Scale - StampedeCon 2014
Storm – Streaming Data Analytics at Scale - StampedeCon 2014
QCon London - Stream Processing with Apache Flink
QCon London - Stream Processing with Apache Flink
Building Scalable Data Pipelines - 2016 DataPalooza Seattle
Building Scalable Data Pipelines - 2016 DataPalooza Seattle
GOTO Night Amsterdam - Stream processing with Apache Flink
GOTO Night Amsterdam - Stream processing with Apache Flink
Flink history, roadmap and vision
Flink history, roadmap and vision
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Data ingestion
Data ingestion
SQOOP PPT
SQOOP PPT
K. Tzoumas & S. Ewen – Flink Forward Keynote
K. Tzoumas & S. Ewen – Flink Forward Keynote
Introduction to sqoop
Introduction to sqoop
Cloud lunch and learn real-time streaming in azure
Cloud lunch and learn real-time streaming in azure
Visual Mapping of Clickstream Data
Visual Mapping of Clickstream Data
Hadoop pycon2011uk
Hadoop pycon2011uk
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Robust stream processing with Apache Flink
Robust stream processing with Apache Flink
Mais de Christophe Marchal
Elasticsearch avoiding hotspots
Elasticsearch avoiding hotspots
Christophe Marchal
Performance
Performance
Christophe Marchal
Alluxio
Alluxio
Christophe Marchal
Elasticsearch cluster deep dive
Elasticsearch cluster deep dive
Christophe Marchal
Elasticsearch
Elasticsearch
Christophe Marchal
Reactive programming with Rxjava
Reactive programming with Rxjava
Christophe Marchal
Terraform
Terraform
Christophe Marchal
Consul in 5 minutes
Consul in 5 minutes
Christophe Marchal
Spark in 15 min
Spark in 15 min
Christophe Marchal
Microservices Architecture: Nirvana or Nightmare
Microservices Architecture: Nirvana or Nightmare
Christophe Marchal
Capistrano
Capistrano
Christophe Marchal
Aws, play! couch db scaling soa in the cloud
Aws, play! couch db scaling soa in the cloud
Christophe Marchal
Devops e a nova cultura - TDC Florianopolis 2015
Devops e a nova cultura - TDC Florianopolis 2015
Christophe Marchal
Devops and the New Culture
Devops and the New Culture
Christophe Marchal
CUDA
CUDA
Christophe Marchal
Monads in practice
Monads in practice
Christophe Marchal
Productivity and scalability with Play and Scala
Productivity and scalability with Play and Scala
Christophe Marchal
Reactive application
Reactive application
Christophe Marchal
Internet of things and arduino
Internet of things and arduino
Christophe Marchal
Hbase
Hbase
Christophe Marchal
Mais de Christophe Marchal
(20)
Elasticsearch avoiding hotspots
Elasticsearch avoiding hotspots
Performance
Performance
Alluxio
Alluxio
Elasticsearch cluster deep dive
Elasticsearch cluster deep dive
Elasticsearch
Elasticsearch
Reactive programming with Rxjava
Reactive programming with Rxjava
Terraform
Terraform
Consul in 5 minutes
Consul in 5 minutes
Spark in 15 min
Spark in 15 min
Microservices Architecture: Nirvana or Nightmare
Microservices Architecture: Nirvana or Nightmare
Capistrano
Capistrano
Aws, play! couch db scaling soa in the cloud
Aws, play! couch db scaling soa in the cloud
Devops e a nova cultura - TDC Florianopolis 2015
Devops e a nova cultura - TDC Florianopolis 2015
Devops and the New Culture
Devops and the New Culture
CUDA
CUDA
Monads in practice
Monads in practice
Productivity and scalability with Play and Scala
Productivity and scalability with Play and Scala
Reactive application
Reactive application
Internet of things and arduino
Internet of things and arduino
Hbase
Hbase
Último
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
gvaughan
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
Commit University
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
ScyllaDB
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Mark Simos
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
LoriGlavin3
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Dilum Bandara
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
charlottematthew16
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
Pixlogix Infotech
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
null - The Open Security Community
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Mark Billinghurst
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Alfredo García Lavilla
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
RankYa
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Alan Dix
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
Mattias Andersson
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Sri Ambati
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
Lonnie McRorey
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
Kalema Edgar
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Hervé Boutemy
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
DianaGray10
Último
(20)
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
Big data: Loading your data with flume and sqoop
1.
Loading data in Hadoop
2 with SQOOP and Flume Christophe Marchal | Software Architect
2.
Problem to solve
3.
Hortonworks stack
4.
Batch Loading vs Stream Loading
5.
SQOOP HCatalog
6.
SQOOP 1: Import
7.
SQOOP 1: Export
8.
SCOOP 2
9.
Flume Source Source Source Source Web Web Server Web Server Server Agent Agent Agent Agent Sink Sink Sink Sink Channel Channel Channel Channel HDFS
10.
Multi agent flow
11.
Consolidation flow
12.
Flume vs SQOOP ● distributed ● Data
imports ● reliable (transaction) ● Parallelizes data ● available (backup routes) ● collecting data ● aggregating data transfer ● Copies data quickly
13.
Flume example
14.
Flume example
15.
Flume example
16.
SQOOP: import HDFS
17.
SQOOP: import HDFS
18.
SQOOP: import HDFS
19.
SQOOP: import Hive
20.
SQOOP: import Hive
21.
SQOOP: import Hive
22.
Thanks Christophe Marchal |
Software Architect @toff63
Baixar agora