SlideShare a Scribd company logo
1 of 10
CC 2.0 by Bryan Pearson | http://flic.kr/p/RUfEt
11.
                                   April
                                   2013

                                    2




          HBase is an
   open source, distributed,
  column-oriented data store
 modeled after Google’s BigTable


HBase Introduction

About HBase
11.
                                        April
                                        2013

                                         3
•  Sorted map data store
•  Table consists of rows, each has a
   row key (primary key)
•  Each row may have any number of
   columns (Map<byte[], byte []>)
•  Rows are sorted lexicographically
   based on row key



HBase Introduction

Data Model
11.
                                                                                 April
                                                                                 2013
 Different types of data separated                                                 4
 into different “column families”                            Data is all byte[]

         Row key     Data
         amuller     info: { ‘height’: ‘2.0m’, ‘state’: ‘ZH’ }
                     roles: { ‘IBM’: ‘Sales Manager’ }
         cguegi      info: { ‘height’: ‘1.85m’, ‘state’: ‘BE’ }
                     roles: { ‘Sentric’: ‘Architect’@ts=2011,
                               ‘Sentric’: ‘Mentor’@ts=2012,
                               ‘SBDUG’: ‘Founder’ }

 Different rows may have different
 sets of columns (table is sparse)

                                    Single cell may have different values
                                    at different timestampes

HBase Introduction

Sorted Map (Logical View)
11.
                                            info Column Family                    April
                                                                                  2013

                             Row key   Column key      Timestamp      Value        5
                             amuller   info:height     1333883187     2.0m
                             amuller   info:state      1273871824     ZH
Sorted on disk by row key,
column key, descending ts

                             cguegi    info:height     1325755229     1.85m
                             cguegi    info:state      1325751049     TG

                                             roles Column Family
                             Row key   Column key      Timestamp      Value
                             amuller   roles:IBM       1320105636     Developer
                             cguegi    roles:SBDUG     1330561785     Founder
                             cguegi    roles:Sentric   1325376723     Mentor
                             cguegi    roles:Sentric   1293840959     Architect


                                                     Unix timestamp
    HBase Introduction

    Sorted Map (Physical View)
11.
                                                                April
                                                                2013

                                                                 6


              HBase                         API

                                         RegionServer

                       Master               HFile    Memstore
                                             Write-Ahead Log




                           HDFS                     ZooKeeper

          [HBase: The Definitive Guide]




HBase Introduction

HBase Architecture
11.
                                             April
                                             2013

                                              7
•     Favors Consistency over Availability
•     Great Hadoop integration
•     Ordered range partitions
•     Automatically shards/scales
•     Sparse column storage




HBase Introduction

HBase vs other “NoSQL”
CC 2.0 by Aurelien Guichard | http://flic.kr/p/cjg9yw
11.
                                    April
                                    2013


                                     9
•     http://hbase.apache.org
•     http://www.sentric.ch
•     http://bigdata-usergroup.ch
•     http://about.me/cguegi




HBase Introduction

Resources
11.
                                                                                                                      April
                                                                                                                      2013

                                                                                                                      10




Source: http://blogs.the451group.com/information_management/2013/02/04/updated-database-lanscape-map-february-2013/



HBase Introduction

Database Landscape Map

More Related Content

Similar to Apache HBase: Introduction to a column-oriented data store

Optimization on Key-value Stores in Cloud Environment
Optimization on Key-value Stores in Cloud EnvironmentOptimization on Key-value Stores in Cloud Environment
Optimization on Key-value Stores in Cloud Environment
Fei Dong
 
Integration of HIve and HBase
Integration of HIve and HBaseIntegration of HIve and HBase
Integration of HIve and HBase
Hortonworks
 
Hadoop cluster configuration
Hadoop cluster configurationHadoop cluster configuration
Hadoop cluster configuration
prabakaranbrick
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
Yiwei Ma
 
支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统
yongboy
 
Hadoop and h base in the real world
Hadoop and h base in the real worldHadoop and h base in the real world
Hadoop and h base in the real world
Joey Echeverria
 

Similar to Apache HBase: Introduction to a column-oriented data store (20)

Intro to HBase
Intro to HBaseIntro to HBase
Intro to HBase
 
Optimization on Key-value Stores in Cloud Environment
Optimization on Key-value Stores in Cloud EnvironmentOptimization on Key-value Stores in Cloud Environment
Optimization on Key-value Stores in Cloud Environment
 
Mar 2012 HUG: Hive with HBase
Mar 2012 HUG: Hive with HBaseMar 2012 HUG: Hive with HBase
Mar 2012 HUG: Hive with HBase
 
No SQL introduction
No SQL introductionNo SQL introduction
No SQL introduction
 
Apache HBase 1.0 Release
Apache HBase 1.0 ReleaseApache HBase 1.0 Release
Apache HBase 1.0 Release
 
Apache HBase™
Apache HBase™Apache HBase™
Apache HBase™
 
Integration of Hive and HBase
Integration of Hive and HBaseIntegration of Hive and HBase
Integration of Hive and HBase
 
Integration of HIve and HBase
Integration of HIve and HBaseIntegration of HIve and HBase
Integration of HIve and HBase
 
Ten tools for ten big data areas 04_Apache Hive
Ten tools for ten big data areas 04_Apache HiveTen tools for ten big data areas 04_Apache Hive
Ten tools for ten big data areas 04_Apache Hive
 
HBase Mongo_DB Project
HBase Mongo_DB ProjectHBase Mongo_DB Project
HBase Mongo_DB Project
 
Hypertable Distilled by edydkim.github.com
Hypertable Distilled by edydkim.github.comHypertable Distilled by edydkim.github.com
Hypertable Distilled by edydkim.github.com
 
Hadoop cluster configuration
Hadoop cluster configurationHadoop cluster configuration
Hadoop cluster configuration
 
Intro to HBase Internals & Schema Design (for HBase users)
Intro to HBase Internals & Schema Design (for HBase users)Intro to HBase Internals & Schema Design (for HBase users)
Intro to HBase Internals & Schema Design (for HBase users)
 
Hadoop and HBase in the Real World
Hadoop and HBase in the Real WorldHadoop and HBase in the Real World
Hadoop and HBase in the Real World
 
Techincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql databaseTechincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql database
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase
 
支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统
 
Hadoop and h base in the real world
Hadoop and h base in the real worldHadoop and h base in the real world
Hadoop and h base in the real world
 
Apache HBase - Introduction & Use Cases
Apache HBase - Introduction & Use CasesApache HBase - Introduction & Use Cases
Apache HBase - Introduction & Use Cases
 

More from Christian Gügi

Online Media Data Stream Processing with Kafka
Online Media Data Stream Processing with KafkaOnline Media Data Stream Processing with Kafka
Online Media Data Stream Processing with Kafka
Christian Gügi
 

More from Christian Gügi (7)

Real-Time Fraud Detection in Payment Transactions
Real-Time Fraud Detection in Payment TransactionsReal-Time Fraud Detection in Payment Transactions
Real-Time Fraud Detection in Payment Transactions
 
Building Scalable Big Data Pipelines
Building Scalable Big Data PipelinesBuilding Scalable Big Data Pipelines
Building Scalable Big Data Pipelines
 
Case Study: In-Store Analysis
Case Study: In-Store AnalysisCase Study: In-Store Analysis
Case Study: In-Store Analysis
 
Apachecon Europe 2012: Operating HBase - Things you need to know
Apachecon Europe 2012: Operating HBase - Things you need to knowApachecon Europe 2012: Operating HBase - Things you need to know
Apachecon Europe 2012: Operating HBase - Things you need to know
 
Online Media Data Stream Processing with Kafka
Online Media Data Stream Processing with KafkaOnline Media Data Stream Processing with Kafka
Online Media Data Stream Processing with Kafka
 
Near Real Time Processing of Social Media Data with HBase
Near Real Time Processing of Social Media Data with HBaseNear Real Time Processing of Social Media Data with HBase
Near Real Time Processing of Social Media Data with HBase
 
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Apache HBase: Introduction to a column-oriented data store

  • 1. CC 2.0 by Bryan Pearson | http://flic.kr/p/RUfEt
  • 2. 11. April 2013 2 HBase is an open source, distributed, column-oriented data store modeled after Google’s BigTable HBase Introduction About HBase
  • 3. 11. April 2013 3 •  Sorted map data store •  Table consists of rows, each has a row key (primary key) •  Each row may have any number of columns (Map<byte[], byte []>) •  Rows are sorted lexicographically based on row key HBase Introduction Data Model
  • 4. 11. April 2013 Different types of data separated 4 into different “column families” Data is all byte[] Row key Data amuller info: { ‘height’: ‘2.0m’, ‘state’: ‘ZH’ } roles: { ‘IBM’: ‘Sales Manager’ } cguegi info: { ‘height’: ‘1.85m’, ‘state’: ‘BE’ } roles: { ‘Sentric’: ‘Architect’@ts=2011, ‘Sentric’: ‘Mentor’@ts=2012, ‘SBDUG’: ‘Founder’ } Different rows may have different sets of columns (table is sparse) Single cell may have different values at different timestampes HBase Introduction Sorted Map (Logical View)
  • 5. 11. info Column Family April 2013 Row key Column key Timestamp Value 5 amuller info:height 1333883187 2.0m amuller info:state 1273871824 ZH Sorted on disk by row key, column key, descending ts cguegi info:height 1325755229 1.85m cguegi info:state 1325751049 TG roles Column Family Row key Column key Timestamp Value amuller roles:IBM 1320105636 Developer cguegi roles:SBDUG 1330561785 Founder cguegi roles:Sentric 1325376723 Mentor cguegi roles:Sentric 1293840959 Architect Unix timestamp HBase Introduction Sorted Map (Physical View)
  • 6. 11. April 2013 6 HBase API RegionServer Master HFile Memstore Write-Ahead Log HDFS ZooKeeper [HBase: The Definitive Guide] HBase Introduction HBase Architecture
  • 7. 11. April 2013 7 •  Favors Consistency over Availability •  Great Hadoop integration •  Ordered range partitions •  Automatically shards/scales •  Sparse column storage HBase Introduction HBase vs other “NoSQL”
  • 8. CC 2.0 by Aurelien Guichard | http://flic.kr/p/cjg9yw
  • 9. 11. April 2013 9 •  http://hbase.apache.org •  http://www.sentric.ch •  http://bigdata-usergroup.ch •  http://about.me/cguegi HBase Introduction Resources
  • 10. 11. April 2013 10 Source: http://blogs.the451group.com/information_management/2013/02/04/updated-database-lanscape-map-february-2013/ HBase Introduction Database Landscape Map