O slideshow foi denunciado.
Utilizamos seu perfil e dados de atividades no LinkedIn para personalizar e exibir anúncios mais relevantes. Altere suas preferências de anúncios quando desejar.

Managing the Dewey Decimal System

115 visualizações

Publicada em

OCLC has been using HBase since 2012 to enable single-search-box access to over a billion items from your library and the world’s library collection. This talk will provide an overview of how HBase is structured to provide this information and some of the challenges they have encountered to scale to support the world catalog and how they have overcome them.

Publicada em: Tecnologia
  • Seja o primeiro a comentar

  • Seja a primeira pessoa a gostar disto

Managing the Dewey Decimal System

  1. 1. Confidential – Restricted Cloudera’s Vision for HBase Krishna Maheshwari Director, Product Management
  2. 2. Confidential – Restricted 2 Where are we today With bulleted list • #17 DBMS by popularity1, #5 by revenue2 • Large ecosystem (Nifi, Kafka, Sqoop, Hive, Impala, SOLR, Ranger, Atlas, etc) • Supports NoSQL, SQL, Geospatual, Graph, TimeSeries, Key Value and other use cases • Sold by: Cloudera, IBM, Microsoft, Amazon, Teradata, Oracle and more 1. As per db-engines 2. Cloudera anlaysis
  3. 3. Confidential – Restricted 3 What has HBase enabled? • Operationalizing ML / AI to revolutionize healthcare, public utilities, etc • Serving webscale content • Empowering big data analytics for operational and offline uses • Acting as a resilient store of record
  4. 4. Confidential – Restricted 4 What’s changed since HBase began • Acceptable trade-offs – Agility vs ownership – Simplicity vs control • Infrastructure as code • Rise of “HTAP” systems • Everyone offers NoSQL Big data getting bigger
  5. 5. Confidential – Restricted 5 Next 10 years • Auto-resiliency, auto-scaling • Self-optimization through AI/ML • Multi-modal • Performance
  6. 6. Confidential – Restricted 6 User complaints can act as guideposts • Hard to setup • Complex to configure and tune • Not quite multi-tenant • Slow at analytics • Doesn’t scale-up
  7. 7. Confidential – Restricted 7 Where will Cloudera focus? • Operational use cases • Integration • Infrastructure as code • Performance
  8. 8. Confidential – Restricted THANK YOU