SlideShare uma empresa Scribd logo
1 de 22
Baixar para ler offline
Welcome to Hadoop World: NYC 2009
Hadoop is Everywhere
                          Presents:




Christophe Bisciglia
Founder christophe@cloudera.com
Hadoop World Details and Event Updates
Too Late to Print
▪   WiFi Details                    ▪   UI BOF
    ▪   SSID: HadoopWorld               ▪   Lead: Philip Zeyliger, Cloudera
    ▪   Password: hadoop09              ▪   Vanderbilt Suite, Afternoon Break
▪   Twitter: #hadoopworld           ▪   HBase BOF
                                        ▪   Lead: Michael Stack, Microsoft
▪   Break Out Sessions                  ▪   Terrace Ballroom, Afternoon Break
    ▪   Applications (This Room)
    ▪   Dev / Admin: Terrace Ballroom (Across Lobby)
    ▪   Extensions: Vanderbilt Suite (One Floor Up)
Hadoop World Sponsors
Thanks!
Why Hadoop World?
Time to Upgrade Your Data Management Strategy
▪   Hadoop isn’t just for Web Companies anymore
    ▪   Terabytes are common place
    ▪   Enables consumption of all enterprise data
    ▪   Wide adoption across verticals
▪   Hadoop is driven by the Community
    ▪   Most registrants are new to Hadoop
    ▪   Sharing experience is critical - and incredibly valuable
    ▪   Users and Developers exchanging needs and ideas
Growing Up with Hadoop
You’ve come a long way baby...
Growing Up with Hadoop
You’ve come a long way baby...

▪   Early Days
    ▪   2004: Google Publishes MapReduce/GFS
    ▪   2005: Hadoop Prototype
        ▪   Doug Cutting and Mike Cafarella
    ▪   2006: Hadoop Running on 20 nodes
        ▪   Internet Archive and UW



                                                  Doug Cutting
                                               Photo Credit: New York Times
Growing Up with Hadoop
You’ve come a long way baby...

▪   Formative Years
    ▪   2006: Yahoo! Begins Major Investment
    ▪   2007: Yahoo! Runs Hadoop on 2000 nodes
    ▪   2008: Yahoo! uses Hadoop to claim Terasort
        Benchmark
Growing Up with Hadoop
You’ve come a long way baby...



▪   5 Major Releases for Hadoop in last year
    ▪   More Reliable
    ▪   More Scalable
    ▪   More Manageable
Growing Up with Hadoop
You’ve come a long way baby...




▪   New Sub-Projects Embrace New Users
    ▪   Hive: SQL Data Warehouse for Hadoop
    ▪   Pig: Data Analysis Language
Growing Up with Hadoop
You’ve come a long way baby...




▪   Sqoop: Database import for Hadoop
    ▪   Developer by Aaron Kimball, Cloudera
    ▪   Works over JDBC
    ▪   Extensible for better pefromance
Growing Up with Hadoop
You’ve come a long way baby...




▪   RDBMS Vendors Embrace Hadoop
    ▪   MapReduce is great for Analytics
    ▪   Hadoop is the MapReduce Standard
    ▪           integrates directly with Hadoop
Growing Up with Hadoop
You’ve come a long way baby...




▪   Adoption Spanning Globe
    ▪   HUGs outside the US
    ▪   Over 10x Companies “PoweredBy”
    ▪   Not Just for Web Companies Anymore
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community




 Hadoop Community
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community


Latest Stable Hadoop Release

Stable Upcoming Features       Distribution for Hadoop
  (by customer request)




  Hadoop Community
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community

                                                             Source Code Powering Y!
Latest Stable Hadoop Release
                                                         Improvements for EC2 and S3
Stable Upcoming Features       Distribution for Hadoop
  (by customer request)
                                                          New Features from Cloudera




  Hadoop Community
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community

                                                             Source Code Powering Y!
Latest Stable Hadoop Release
                                                         Improvements for EC2 and S3
Stable Upcoming Features       Distribution for Hadoop
  (by customer request)
                                                          New Features from Cloudera


                               Cloudera Enhancements
                                      Bug Fixes

  Hadoop Community               Contributed to Apache
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community



                 Distribution for Hadoop

                  Cross-Platform Packaging,
                  Integration and Testing

                     Hive, Pig, Sqoop, ...

                          Support
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community



   Private Cloud
                              Distribution for Hadoop

                              Cross-Platform Packaging,
                               Integration and Testing

                                 Hive, Pig, Sqoop, ...

                                      Support


                   Pac
                      kag
                         es
Cloudera’s Distribution for Hadoop
Delivering Hadoop to a Larger Community



   Private Cloud                                                   Public Cloud
                              Distribution for Hadoop

                              Cross-Platform Packaging,
                               Integration and Testing

                                 Hive, Pig, Sqoop, ...

                                      Support


                   Pac
                      kag                                    ges
                         es                               Ima
Comparing Growth Rates since March 2009
Standard Packaging Drives Adoption

▪   Consistent Downloads                      Cloudera Downloads

    from Apache                               Apache Downloads
                                                                                                                           1,835%




    Cloudera Packages
                                                                                                            1,392%
▪

    Drive New Usage
                                                                                             1,026%




                                                                               762%

▪   Enables New Hadoop
    Applications                                                  384%


                                                     238%


                                       100%
                                                                                      133%
                                              100%          96%          95%                          93%            97%            95%


                                      March 2009       May 2009           July 09 Aug 09 Sept 09
Normalized by unique users accessing hadoop.apache.org/core/releases.html and Cloudera Package
Repositories in March 2009
Cloudera’s Business to Date
Support, Training and Professional Services
▪   Dozens of Support Customers
    ▪   Using Hadoop for real enterprise workloads

▪   Training and Certification
    ▪   100’s of engineers trained
    ▪   Sysadmin and Manager programs launched at Hadoop World

▪   Professional Services

Mais conteúdo relacionado

Mais procurados

Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
Simplilearn
 

Mais procurados (20)

Hadoop: Distributed Data Processing
Hadoop: Distributed Data ProcessingHadoop: Distributed Data Processing
Hadoop: Distributed Data Processing
 
Introduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemIntroduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-System
 
Hadoop and Big Data
Hadoop and Big DataHadoop and Big Data
Hadoop and Big Data
 
Big Data Concepts
Big Data ConceptsBig Data Concepts
Big Data Concepts
 
Big data Analytics Hadoop
Big data Analytics HadoopBig data Analytics Hadoop
Big data Analytics Hadoop
 
PPT on Hadoop
PPT on HadoopPPT on Hadoop
PPT on Hadoop
 
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1
 
What are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
What are Hadoop Components? Hadoop Ecosystem and Architecture | EdurekaWhat are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
What are Hadoop Components? Hadoop Ecosystem and Architecture | Edureka
 
Hadoop - Architectural road map for Hadoop Ecosystem
Hadoop -  Architectural road map for Hadoop EcosystemHadoop -  Architectural road map for Hadoop Ecosystem
Hadoop - Architectural road map for Hadoop Ecosystem
 
Big data processing with apache spark part1
Big data processing with apache spark   part1Big data processing with apache spark   part1
Big data processing with apache spark part1
 
Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop Tutorial
 
Apache Hadoop
Apache HadoopApache Hadoop
Apache Hadoop
 
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
 
Hadoop and big data
Hadoop and big dataHadoop and big data
Hadoop and big data
 
Big Data and Hadoop Introduction
 Big Data and Hadoop Introduction Big Data and Hadoop Introduction
Big Data and Hadoop Introduction
 
Big Data on the Microsoft Platform
Big Data on the Microsoft PlatformBig Data on the Microsoft Platform
Big Data on the Microsoft Platform
 
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
 
Hadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouseHadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouse
 
Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP
 
The Hadoop Path by Subash DSouza of Archangel Technology Consultants, LLC.
The Hadoop Path by Subash DSouza of Archangel Technology Consultants, LLC.The Hadoop Path by Subash DSouza of Archangel Technology Consultants, LLC.
The Hadoop Path by Subash DSouza of Archangel Technology Consultants, LLC.
 

Semelhante a Hw09 Welcome To Hadoop World

Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera, Inc.
 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
Brock Noland
 

Semelhante a Hw09 Welcome To Hadoop World (20)

Amr Awadallah, unSEXY Presentation
Amr Awadallah, unSEXY PresentationAmr Awadallah, unSEXY Presentation
Amr Awadallah, unSEXY Presentation
 
Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7
 
Webinar: The Future of Hadoop
Webinar: The Future of HadoopWebinar: The Future of Hadoop
Webinar: The Future of Hadoop
 
Emerging trends in data analytics
Emerging trends in data analyticsEmerging trends in data analytics
Emerging trends in data analytics
 
Hadoop summit cloudera keynote_v5
Hadoop summit cloudera keynote_v5Hadoop summit cloudera keynote_v5
Hadoop summit cloudera keynote_v5
 
Spark in the Enterprise - 2 Years Later by Alan Saldich
Spark in the Enterprise - 2 Years Later by Alan SaldichSpark in the Enterprise - 2 Years Later by Alan Saldich
Spark in the Enterprise - 2 Years Later by Alan Saldich
 
Apache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondApache Hadoop Now Next and Beyond
Apache Hadoop Now Next and Beyond
 
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data...
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Big Data Analytics - Is Your Elephant Enterprise Ready?
Big Data Analytics - Is Your Elephant Enterprise Ready?Big Data Analytics - Is Your Elephant Enterprise Ready?
Big Data Analytics - Is Your Elephant Enterprise Ready?
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
 
One Hadoop, Multiple Clouds
One Hadoop, Multiple CloudsOne Hadoop, Multiple Clouds
One Hadoop, Multiple Clouds
 
Commonanduniqueusecases 110831113310-phpapp01
Commonanduniqueusecases 110831113310-phpapp01Commonanduniqueusecases 110831113310-phpapp01
Commonanduniqueusecases 110831113310-phpapp01
 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
 
Improving the Drupal Developer Experience with DevCloud, Managed Cloud and th...
Improving the Drupal Developer Experience with DevCloud, Managed Cloud and th...Improving the Drupal Developer Experience with DevCloud, Managed Cloud and th...
Improving the Drupal Developer Experience with DevCloud, Managed Cloud and th...
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
 
Driving in the Desert - Running Your HDP Cluster with Helion, Openstack, and ...
Driving in the Desert - Running Your HDP Cluster with Helion, Openstack, and ...Driving in the Desert - Running Your HDP Cluster with Helion, Openstack, and ...
Driving in the Desert - Running Your HDP Cluster with Helion, Openstack, and ...
 
Karmasphere Studio for Hadoop
Karmasphere Studio for HadoopKarmasphere Studio for Hadoop
Karmasphere Studio for Hadoop
 
Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks?
Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks?Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks?
Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks?
 
HadoopIntroduction.pptx
HadoopIntroduction.pptxHadoopIntroduction.pptx
HadoopIntroduction.pptx
 

Mais de Cloudera, Inc.

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 

Hw09 Welcome To Hadoop World

  • 1. Welcome to Hadoop World: NYC 2009 Hadoop is Everywhere Presents: Christophe Bisciglia Founder christophe@cloudera.com
  • 2. Hadoop World Details and Event Updates Too Late to Print ▪ WiFi Details ▪ UI BOF ▪ SSID: HadoopWorld ▪ Lead: Philip Zeyliger, Cloudera ▪ Password: hadoop09 ▪ Vanderbilt Suite, Afternoon Break ▪ Twitter: #hadoopworld ▪ HBase BOF ▪ Lead: Michael Stack, Microsoft ▪ Break Out Sessions ▪ Terrace Ballroom, Afternoon Break ▪ Applications (This Room) ▪ Dev / Admin: Terrace Ballroom (Across Lobby) ▪ Extensions: Vanderbilt Suite (One Floor Up)
  • 4. Why Hadoop World? Time to Upgrade Your Data Management Strategy ▪ Hadoop isn’t just for Web Companies anymore ▪ Terabytes are common place ▪ Enables consumption of all enterprise data ▪ Wide adoption across verticals ▪ Hadoop is driven by the Community ▪ Most registrants are new to Hadoop ▪ Sharing experience is critical - and incredibly valuable ▪ Users and Developers exchanging needs and ideas
  • 5. Growing Up with Hadoop You’ve come a long way baby...
  • 6. Growing Up with Hadoop You’ve come a long way baby... ▪ Early Days ▪ 2004: Google Publishes MapReduce/GFS ▪ 2005: Hadoop Prototype ▪ Doug Cutting and Mike Cafarella ▪ 2006: Hadoop Running on 20 nodes ▪ Internet Archive and UW Doug Cutting Photo Credit: New York Times
  • 7. Growing Up with Hadoop You’ve come a long way baby... ▪ Formative Years ▪ 2006: Yahoo! Begins Major Investment ▪ 2007: Yahoo! Runs Hadoop on 2000 nodes ▪ 2008: Yahoo! uses Hadoop to claim Terasort Benchmark
  • 8. Growing Up with Hadoop You’ve come a long way baby... ▪ 5 Major Releases for Hadoop in last year ▪ More Reliable ▪ More Scalable ▪ More Manageable
  • 9. Growing Up with Hadoop You’ve come a long way baby... ▪ New Sub-Projects Embrace New Users ▪ Hive: SQL Data Warehouse for Hadoop ▪ Pig: Data Analysis Language
  • 10. Growing Up with Hadoop You’ve come a long way baby... ▪ Sqoop: Database import for Hadoop ▪ Developer by Aaron Kimball, Cloudera ▪ Works over JDBC ▪ Extensible for better pefromance
  • 11. Growing Up with Hadoop You’ve come a long way baby... ▪ RDBMS Vendors Embrace Hadoop ▪ MapReduce is great for Analytics ▪ Hadoop is the MapReduce Standard ▪ integrates directly with Hadoop
  • 12. Growing Up with Hadoop You’ve come a long way baby... ▪ Adoption Spanning Globe ▪ HUGs outside the US ▪ Over 10x Companies “PoweredBy” ▪ Not Just for Web Companies Anymore
  • 13. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community
  • 14. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Hadoop Community
  • 15. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Latest Stable Hadoop Release Stable Upcoming Features Distribution for Hadoop (by customer request) Hadoop Community
  • 16. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Source Code Powering Y! Latest Stable Hadoop Release Improvements for EC2 and S3 Stable Upcoming Features Distribution for Hadoop (by customer request) New Features from Cloudera Hadoop Community
  • 17. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Source Code Powering Y! Latest Stable Hadoop Release Improvements for EC2 and S3 Stable Upcoming Features Distribution for Hadoop (by customer request) New Features from Cloudera Cloudera Enhancements Bug Fixes Hadoop Community Contributed to Apache
  • 18. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Distribution for Hadoop Cross-Platform Packaging, Integration and Testing Hive, Pig, Sqoop, ... Support
  • 19. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Private Cloud Distribution for Hadoop Cross-Platform Packaging, Integration and Testing Hive, Pig, Sqoop, ... Support Pac kag es
  • 20. Cloudera’s Distribution for Hadoop Delivering Hadoop to a Larger Community Private Cloud Public Cloud Distribution for Hadoop Cross-Platform Packaging, Integration and Testing Hive, Pig, Sqoop, ... Support Pac kag ges es Ima
  • 21. Comparing Growth Rates since March 2009 Standard Packaging Drives Adoption ▪ Consistent Downloads Cloudera Downloads from Apache Apache Downloads 1,835% Cloudera Packages 1,392% ▪ Drive New Usage 1,026% 762% ▪ Enables New Hadoop Applications 384% 238% 100% 133% 100% 96% 95% 93% 97% 95% March 2009 May 2009 July 09 Aug 09 Sept 09 Normalized by unique users accessing hadoop.apache.org/core/releases.html and Cloudera Package Repositories in March 2009
  • 22. Cloudera’s Business to Date Support, Training and Professional Services ▪ Dozens of Support Customers ▪ Using Hadoop for real enterprise workloads ▪ Training and Certification ▪ 100’s of engineers trained ▪ Sysadmin and Manager programs launched at Hadoop World ▪ Professional Services