Enviar pesquisa
Carregar
WebHDFS at King - May 2014 Hadoop MeetUp
•
1 gostou
•
1,185 visualizações
huguk
Seguir
The latest developments at King on their work with WebHDFS .
Leia menos
Leia mais
Tecnologia
Esportes
Denunciar
Compartilhar
Denunciar
Compartilhar
1 de 20
Baixar agora
Baixar para ler offline
Recomendados
Integration with hdfs using WebDFS and NFS
Integration with hdfs using WebDFS and NFS
Christophe Marchal
Fluentd and WebHDFS
Fluentd and WebHDFS
SATOSHI TAGOMORI
Drupal feature proposal: two new stream-wrappers
Drupal feature proposal: two new stream-wrappers
Marcus Deglos
HAProxy scale out using open source
HAProxy scale out using open source
Ingo Walz
GFProxy: Scaling the GlusterFS FUSE Client
GFProxy: Scaling the GlusterFS FUSE Client
Gluster.org
Using memcache to improve php performance
Using memcache to improve php performance
Sudar Muthu
Caching basics in PHP
Caching basics in PHP
Anis Ahmad
ReplacingSquidWithATS
ReplacingSquidWithATS
Chiranjeevi Jaladi
Recomendados
Integration with hdfs using WebDFS and NFS
Integration with hdfs using WebDFS and NFS
Christophe Marchal
Fluentd and WebHDFS
Fluentd and WebHDFS
SATOSHI TAGOMORI
Drupal feature proposal: two new stream-wrappers
Drupal feature proposal: two new stream-wrappers
Marcus Deglos
HAProxy scale out using open source
HAProxy scale out using open source
Ingo Walz
GFProxy: Scaling the GlusterFS FUSE Client
GFProxy: Scaling the GlusterFS FUSE Client
Gluster.org
Using memcache to improve php performance
Using memcache to improve php performance
Sudar Muthu
Caching basics in PHP
Caching basics in PHP
Anis Ahmad
ReplacingSquidWithATS
ReplacingSquidWithATS
Chiranjeevi Jaladi
GlusterFS As an Object Storage
GlusterFS As an Object Storage
Keisuke Takahashi
Windows Server 2016 Webinar
Windows Server 2016 Webinar
Men and Mice
dNFS for DBA's
dNFS for DBA's
Marcin Przepiórowski
Apache Traffic Server
Apache Traffic Server
supertom
5-WebServers.ppt
5-WebServers.ppt
webhostingguy
are available here
are available here
webhostingguy
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Ortus Solutions, Corp
Rubyspec y el largo camino hacia Ruby 1.9
Rubyspec y el largo camino hacia Ruby 1.9
David Calavera
[MathWorks] Versioning Infrastructure
[MathWorks] Versioning Infrastructure
Perforce
Redis
Redis
Marc Beaupré-Pham
Apache Traffic Server & Lua
Apache Traffic Server & Lua
Kit Chan
WE18_Performance_Up.ppt
WE18_Performance_Up.ppt
webhostingguy
HAProxy tech talk
HAProxy tech talk
icebourg
Curl Tutorial
Curl Tutorial
Ankireddy Polu
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
Gluster.org
What is new in BIND 9.11?
What is new in BIND 9.11?
Men and Mice
Web Server Load Balancer
Web Server Load Balancer
MobME Technical
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
Betclic Everest Group Tech Team
Clug 2012 March web server optimisation
Clug 2012 March web server optimisation
grooverdan
NGINX: High Performance Load Balancing
NGINX: High Performance Load Balancing
NGINX, Inc.
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
huguk
ether.camp - Hackathon & ether.camp intro
ether.camp - Hackathon & ether.camp intro
huguk
Mais conteúdo relacionado
Mais procurados
GlusterFS As an Object Storage
GlusterFS As an Object Storage
Keisuke Takahashi
Windows Server 2016 Webinar
Windows Server 2016 Webinar
Men and Mice
dNFS for DBA's
dNFS for DBA's
Marcin Przepiórowski
Apache Traffic Server
Apache Traffic Server
supertom
5-WebServers.ppt
5-WebServers.ppt
webhostingguy
are available here
are available here
webhostingguy
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Ortus Solutions, Corp
Rubyspec y el largo camino hacia Ruby 1.9
Rubyspec y el largo camino hacia Ruby 1.9
David Calavera
[MathWorks] Versioning Infrastructure
[MathWorks] Versioning Infrastructure
Perforce
Redis
Redis
Marc Beaupré-Pham
Apache Traffic Server & Lua
Apache Traffic Server & Lua
Kit Chan
WE18_Performance_Up.ppt
WE18_Performance_Up.ppt
webhostingguy
HAProxy tech talk
HAProxy tech talk
icebourg
Curl Tutorial
Curl Tutorial
Ankireddy Polu
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
Gluster.org
What is new in BIND 9.11?
What is new in BIND 9.11?
Men and Mice
Web Server Load Balancer
Web Server Load Balancer
MobME Technical
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
Betclic Everest Group Tech Team
Clug 2012 March web server optimisation
Clug 2012 March web server optimisation
grooverdan
NGINX: High Performance Load Balancing
NGINX: High Performance Load Balancing
NGINX, Inc.
Mais procurados
(20)
GlusterFS As an Object Storage
GlusterFS As an Object Storage
Windows Server 2016 Webinar
Windows Server 2016 Webinar
dNFS for DBA's
dNFS for DBA's
Apache Traffic Server
Apache Traffic Server
5-WebServers.ppt
5-WebServers.ppt
are available here
are available here
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Rubyspec y el largo camino hacia Ruby 1.9
Rubyspec y el largo camino hacia Ruby 1.9
[MathWorks] Versioning Infrastructure
[MathWorks] Versioning Infrastructure
Redis
Redis
Apache Traffic Server & Lua
Apache Traffic Server & Lua
WE18_Performance_Up.ppt
WE18_Performance_Up.ppt
HAProxy tech talk
HAProxy tech talk
Curl Tutorial
Curl Tutorial
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
What is new in BIND 9.11?
What is new in BIND 9.11?
Web Server Load Balancer
Web Server Load Balancer
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
Clug 2012 March web server optimisation
Clug 2012 March web server optimisation
NGINX: High Performance Load Balancing
NGINX: High Performance Load Balancing
Mais de huguk
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
huguk
ether.camp - Hackathon & ether.camp intro
ether.camp - Hackathon & ether.camp intro
huguk
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
huguk
Extracting maximum value from data while protecting consumer privacy. Jason ...
Extracting maximum value from data while protecting consumer privacy. Jason ...
huguk
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
huguk
Streaming Dataflow with Apache Flink
Streaming Dataflow with Apache Flink
huguk
Lambda architecture on Spark, Kafka for real-time large scale ML
Lambda architecture on Spark, Kafka for real-time large scale ML
huguk
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
huguk
Jonathon Southam: Venture Capital, Funding & Pitching
Jonathon Southam: Venture Capital, Funding & Pitching
huguk
Signal Media: Real-Time Media & News Monitoring
Signal Media: Real-Time Media & News Monitoring
huguk
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startup
huguk
Peter Karney: Intro to the Digital catapult
Peter Karney: Intro to the Digital catapult
huguk
Cytora: Real-Time Political Risk Analysis
Cytora: Real-Time Political Risk Analysis
huguk
Cubitic: Predictive Analytics
Cubitic: Predictive Analytics
huguk
Bird.i: Earth Observation Data Made Social
Bird.i: Earth Observation Data Made Social
huguk
Aiseedo: Real Time Machine Intelligence
Aiseedo: Real Time Machine Intelligence
huguk
Secrets of Spark's success - Deenar Toraskar, Think Reactive
Secrets of Spark's success - Deenar Toraskar, Think Reactive
huguk
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
huguk
Hadoop - Looking to the Future By Arun Murthy
Hadoop - Looking to the Future By Arun Murthy
huguk
Mais de huguk
(20)
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
ether.camp - Hackathon & ether.camp intro
ether.camp - Hackathon & ether.camp intro
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Extracting maximum value from data while protecting consumer privacy. Jason ...
Extracting maximum value from data while protecting consumer privacy. Jason ...
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
Streaming Dataflow with Apache Flink
Streaming Dataflow with Apache Flink
Lambda architecture on Spark, Kafka for real-time large scale ML
Lambda architecture on Spark, Kafka for real-time large scale ML
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
Jonathon Southam: Venture Capital, Funding & Pitching
Jonathon Southam: Venture Capital, Funding & Pitching
Signal Media: Real-Time Media & News Monitoring
Signal Media: Real-Time Media & News Monitoring
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startup
Peter Karney: Intro to the Digital catapult
Peter Karney: Intro to the Digital catapult
Cytora: Real-Time Political Risk Analysis
Cytora: Real-Time Political Risk Analysis
Cubitic: Predictive Analytics
Cubitic: Predictive Analytics
Bird.i: Earth Observation Data Made Social
Bird.i: Earth Observation Data Made Social
Aiseedo: Real Time Machine Intelligence
Aiseedo: Real Time Machine Intelligence
Secrets of Spark's success - Deenar Toraskar, Think Reactive
Secrets of Spark's success - Deenar Toraskar, Think Reactive
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
Hadoop - Looking to the Future By Arun Murthy
Hadoop - Looking to the Future By Arun Murthy
Último
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
Commit University
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
Pixlogix Infotech
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Fwdays
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
LoriGlavin3
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
Dubai Multi Commodity Centre
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
BookNet Canada
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Alfredo García Lavilla
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Stephanie Beckett
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
Lorenzo Miniero
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
Lonnie McRorey
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Hervé Boutemy
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
ScyllaDB
How to write a Business Continuity Plan
How to write a Business Continuity Plan
Databarracks
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Sri Ambati
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Mark Billinghurst
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
UiPathCommunity
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
hariprasad279825
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
Enterprise Knowledge
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
Kalema Edgar
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Fwdays
Último
(20)
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
How to write a Business Continuity Plan
How to write a Business Continuity Plan
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
WebHDFS at King - May 2014 Hadoop MeetUp
1.
2.
2 How to turbo
charge your data transfers with WebHDFS Andy Done, Data Platform Lead andy.done@king.com
3.
4.
Last time…
5.
Since then…
6.
100 40 Hadoop
7.
1 0.5 Storage
8.
15 10 Events
9.
10 4 ExaSol
10.
11.
2.5 6 Load times
12.
Problem WebHDFS 12
13.
Old way WebHDFS
14.
Old way hadoop fs
–cat /some/path/* | bulk_load my_table WebHDFS
15.
WebHDFS way WebHDFS
16.
WebHDFS way IMPORT INTO
TABLE my_table FROM FILE ‘http://namenode/webhdfs/v1/some/path/file_1’ FILE ‘http://namenode/webhdfs/v1/some/path/file_2’ … FILE ‘http://namenode/webhdfs/v1/some/path/file_n’ WebHDFS
17.
WebHDFS benefits • Simple •
Efficient • Ubiquitous • Parallelisable • Bidirectional • Fast WebHDFS
18.
18 Conclusion WebHDFS
19.
Thank you 19
20.
We're hiring! 20
Baixar agora