SlideShare a Scribd company logo
1 of 18
Introduction to HBase
Anil Gupta
@bigdatanoob
What is NoSql?
RDBMS vs NoSql
HBase
HBase Components
Architecture
HBase Cluster
HBase Data Model
Key -> Value
Region
Outline
NoSQL is acronym for Not Only SQL. These databases are
non-relational. This term was coined in 1998.
They do not use SQL as their primary language.
NoSQL is not a replacement of Relational
Database.
NoSQL is designed for distributed data stores
NoSQL was designed to store semi-structured
and sparse data
NoSQL RDBMS
Hardware Farm of Commodity(upto
several thousand)
1-3 High End or
Proprietary(costly)
Data Type Semi-structured and
Sparse
Structured and dense
Data Size PetaBytes(1015) TeraBytes(1012 bytes)
Auto-Sharding Yes No
Flexible Schema Yes No
Referential Integrity No Yes
Support for Joins No Yes
Support for Aggregations Basic Advance
HBase is an open-source, distributed, versioned,
key-value database modeled after Google's
Bigtable.
is optional for
HBase has real-time read/writes(in milliseconds)
HBase is highly fault tolerant(HA) and scalable
+ Random Read/Write
access= + Apache
Zookeeper
Selling Points of HBase
Highly Scalable
Auto-sharding
Strongly Consistent
Out of the box support for Historical Data
Very high read throughput
Readily compatible with Hadoop
Highly Fault-tolerant(HA)
HBase Components
1. HBase Master(HMaster): HMaster is the
Master Server.
 HMaster is responsible for monitoring all
RegionServers
 Performs load balancing a.k.a sharding
 Assigns regions to RegionServers
 All the metadata changes go through Master
 Periodically checks and cleans up the .META.
table
 Multiple HMaster can run in cluster but only one
HMaster will be active at any time.
HBase Components(cont.)
2. RegionServer(HRegionServer):
HRegionServer is the implementation of the
worker module.
 Runs as Java Service on worker nodes.
 Machine running a RegionServer is considered
a worker node.
 Serves get/put/scan requests
 Responsible for splitting and compacting regions
 Runs on DataNode
 Multiple RegionServers run in a cluster
Zookeeper in HBase
ZooKeeper: It allows distributed processes to
coordinate with each other through a shared
hierarchical name space. It is distributed and
highly reliable service.
In HBase it is responsible for following:
 Provide availability status of RegionServers
 To ensure single active HMaster in the cluster
 Provide location of “-ROOT-” table
 Selection of new HMaster in case of failure of
an active HMaster
HBase Architecture
HBase Cluster
Worker
Node
Worker
Node
Worker Node
DataNodeDataNode
TaskTracker
HRegionServe
r
DataNode
TaskTracker
HRegionServe
r
Worker Node
DataNode
Worker Node
DataNode
RegionServer
Worker Node
DataNode
Worker Node
DataNode
Worker Node
DataNode
Worker Node
DataNode
Name
Node
HMaster
Zoo
keeper
HMaster
RegionServer
RegionServer
RegionServerRegionServer
RegionServerRegionServer
Name
Node
Column Family and Column Qualifier
Column Family: Columns Qualifiers in HBase are grouped
into column families.
The colon character (:) delimits the column qualifier family
from the column family.
Combination of <Column Family>: <Column Qualifier> is
equivalent to a Column name.
Physically, all column qualifiers of a column family are stored
together on the file system.
• Column Qualifiers within a family are sorted lexicographically and
stored together
Example: txn:amt , Here “txn” is the Column Family and “amt” is
the Column Qualifier.
HBase Data Model
• Table maintains data in lexicographic order by RowKey.
• Everything except table names are stored as byte array
• Only column families are defined at the creation time of table
 Each family can have any number of columns(to a
maximum of few millions)
 Each row can have different columns in a column family
 Each column consists of any number of versions
 Columns only exist when inserted because HBase does
not have NULL values
(RowKey, Column Family:Column Qualifier,
Timestamp) is a “Key” in HBase.
“Value” is stored corresponding to a “Key”
Timestamp is used to support storing of Historical
Data
Table is always indexed on RowKey
Key -> Value in HBase
Region
Tables in HBase are divided into multiple Regions.
1 Region = 1 Partition of Table
Regions are hosted by RegionServers
1 RegionServer can host 100’s of Regions
RegionServer can host Regions from multiple
tables.
After a major compaction, every region has 1 HFile
for each column family.
Random Facts About
HBase
Data in HBase is stored in HFile Format
Values are stored as Byte Array in HFiles
HLog is the file format used for storing “Write
Ahead Logging” in HBase.
References
http://hbase.apache.org/
https://hadoop.apache.org/
http://www.larsgeorge.com/2009/10/hbase-
architecture-101-storage.html
Questions?

More Related Content

What's hot (20)

HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
Introduction to HiveQL
Introduction to HiveQLIntroduction to HiveQL
Introduction to HiveQL
 
6.hive
6.hive6.hive
6.hive
 
Introduction to HBase - NoSqlNow2015
Introduction to HBase - NoSqlNow2015Introduction to HBase - NoSqlNow2015
Introduction to HBase - NoSqlNow2015
 
Intro to HBase
Intro to HBaseIntro to HBase
Intro to HBase
 
Apache hive
Apache hiveApache hive
Apache hive
 
Hive(ppt)
Hive(ppt)Hive(ppt)
Hive(ppt)
 
Nosql databases
Nosql databasesNosql databases
Nosql databases
 
Hadoop Security Architecture
Hadoop Security ArchitectureHadoop Security Architecture
Hadoop Security Architecture
 
Apache Phoenix + Apache HBase
Apache Phoenix + Apache HBaseApache Phoenix + Apache HBase
Apache Phoenix + Apache HBase
 
Hive
HiveHive
Hive
 
Security and Data Governance using Apache Ranger and Apache Atlas
Security and Data Governance using Apache Ranger and Apache AtlasSecurity and Data Governance using Apache Ranger and Apache Atlas
Security and Data Governance using Apache Ranger and Apache Atlas
 
Apache Hadoop and HBase
Apache Hadoop and HBaseApache Hadoop and HBase
Apache Hadoop and HBase
 
RDBMS vs NoSQL
RDBMS vs NoSQLRDBMS vs NoSQL
RDBMS vs NoSQL
 
Yarn.ppt
Yarn.pptYarn.ppt
Yarn.ppt
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase
 
Introduction to HDFS
Introduction to HDFSIntroduction to HDFS
Introduction to HDFS
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
 
Apache phoenix: Past, Present and Future of SQL over HBAse
Apache phoenix: Past, Present and Future of SQL over HBAseApache phoenix: Past, Present and Future of SQL over HBAse
Apache phoenix: Past, Present and Future of SQL over HBAse
 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture
 

Viewers also liked

Viewers also liked (6)

HiveServer2
HiveServer2HiveServer2
HiveServer2
 
HiveServer2 for Apache Hive
HiveServer2 for Apache HiveHiveServer2 for Apache Hive
HiveServer2 for Apache Hive
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
 
GFS
GFSGFS
GFS
 
GOOGLE FILE SYSTEM
GOOGLE FILE SYSTEMGOOGLE FILE SYSTEM
GOOGLE FILE SYSTEM
 
SQL to Hive Cheat Sheet
SQL to Hive Cheat SheetSQL to Hive Cheat Sheet
SQL to Hive Cheat Sheet
 

Similar to Introduction To HBase

Similar to Introduction To HBase (20)

Hbase
HbaseHbase
Hbase
 
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
CCS334 BIG DATA ANALYTICS UNIT 5 PPT  ELECTIVE PAPERCCS334 BIG DATA ANALYTICS UNIT 5 PPT  ELECTIVE PAPER
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
 
Introduction to HBase
Introduction to HBaseIntroduction to HBase
Introduction to HBase
 
Hbase.pptx
Hbase.pptxHbase.pptx
Hbase.pptx
 
Hbase Quick Review Guide for Interviews
Hbase Quick Review Guide for InterviewsHbase Quick Review Guide for Interviews
Hbase Quick Review Guide for Interviews
 
4. hbase overview
4. hbase overview4. hbase overview
4. hbase overview
 
Hbase introduction
Hbase introductionHbase introduction
Hbase introduction
 
Techincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql databaseTechincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql database
 
Hbase 20141003
Hbase 20141003Hbase 20141003
Hbase 20141003
 
Hadoop - Apache Hbase
Hadoop - Apache HbaseHadoop - Apache Hbase
Hadoop - Apache Hbase
 
Data Storage Management
Data Storage ManagementData Storage Management
Data Storage Management
 
01 hbase
01 hbase01 hbase
01 hbase
 
Big data hbase
Big data hbase Big data hbase
Big data hbase
 
Hbase
HbaseHbase
Hbase
 
Hbase
HbaseHbase
Hbase
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
 
支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统
 
Hbase
HbaseHbase
Hbase
 
Introduction to Apache HBase
Introduction to Apache HBaseIntroduction to Apache HBase
Introduction to Apache HBase
 
Hbasepreso 111116185419-phpapp02
Hbasepreso 111116185419-phpapp02Hbasepreso 111116185419-phpapp02
Hbasepreso 111116185419-phpapp02
 

Recently uploaded

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 

Recently uploaded (20)

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 

Introduction To HBase

  • 1. Introduction to HBase Anil Gupta @bigdatanoob
  • 2. What is NoSql? RDBMS vs NoSql HBase HBase Components Architecture HBase Cluster HBase Data Model Key -> Value Region Outline
  • 3. NoSQL is acronym for Not Only SQL. These databases are non-relational. This term was coined in 1998. They do not use SQL as their primary language. NoSQL is not a replacement of Relational Database. NoSQL is designed for distributed data stores NoSQL was designed to store semi-structured and sparse data
  • 4. NoSQL RDBMS Hardware Farm of Commodity(upto several thousand) 1-3 High End or Proprietary(costly) Data Type Semi-structured and Sparse Structured and dense Data Size PetaBytes(1015) TeraBytes(1012 bytes) Auto-Sharding Yes No Flexible Schema Yes No Referential Integrity No Yes Support for Joins No Yes Support for Aggregations Basic Advance
  • 5. HBase is an open-source, distributed, versioned, key-value database modeled after Google's Bigtable. is optional for HBase has real-time read/writes(in milliseconds) HBase is highly fault tolerant(HA) and scalable + Random Read/Write access= + Apache Zookeeper
  • 6. Selling Points of HBase Highly Scalable Auto-sharding Strongly Consistent Out of the box support for Historical Data Very high read throughput Readily compatible with Hadoop Highly Fault-tolerant(HA)
  • 7. HBase Components 1. HBase Master(HMaster): HMaster is the Master Server.  HMaster is responsible for monitoring all RegionServers  Performs load balancing a.k.a sharding  Assigns regions to RegionServers  All the metadata changes go through Master  Periodically checks and cleans up the .META. table  Multiple HMaster can run in cluster but only one HMaster will be active at any time.
  • 8. HBase Components(cont.) 2. RegionServer(HRegionServer): HRegionServer is the implementation of the worker module.  Runs as Java Service on worker nodes.  Machine running a RegionServer is considered a worker node.  Serves get/put/scan requests  Responsible for splitting and compacting regions  Runs on DataNode  Multiple RegionServers run in a cluster
  • 9. Zookeeper in HBase ZooKeeper: It allows distributed processes to coordinate with each other through a shared hierarchical name space. It is distributed and highly reliable service. In HBase it is responsible for following:  Provide availability status of RegionServers  To ensure single active HMaster in the cluster  Provide location of “-ROOT-” table  Selection of new HMaster in case of failure of an active HMaster
  • 11. HBase Cluster Worker Node Worker Node Worker Node DataNodeDataNode TaskTracker HRegionServe r DataNode TaskTracker HRegionServe r Worker Node DataNode Worker Node DataNode RegionServer Worker Node DataNode Worker Node DataNode Worker Node DataNode Worker Node DataNode Name Node HMaster Zoo keeper HMaster RegionServer RegionServer RegionServerRegionServer RegionServerRegionServer Name Node
  • 12. Column Family and Column Qualifier Column Family: Columns Qualifiers in HBase are grouped into column families. The colon character (:) delimits the column qualifier family from the column family. Combination of <Column Family>: <Column Qualifier> is equivalent to a Column name. Physically, all column qualifiers of a column family are stored together on the file system. • Column Qualifiers within a family are sorted lexicographically and stored together Example: txn:amt , Here “txn” is the Column Family and “amt” is the Column Qualifier.
  • 13. HBase Data Model • Table maintains data in lexicographic order by RowKey. • Everything except table names are stored as byte array • Only column families are defined at the creation time of table  Each family can have any number of columns(to a maximum of few millions)  Each row can have different columns in a column family  Each column consists of any number of versions  Columns only exist when inserted because HBase does not have NULL values
  • 14. (RowKey, Column Family:Column Qualifier, Timestamp) is a “Key” in HBase. “Value” is stored corresponding to a “Key” Timestamp is used to support storing of Historical Data Table is always indexed on RowKey Key -> Value in HBase
  • 15. Region Tables in HBase are divided into multiple Regions. 1 Region = 1 Partition of Table Regions are hosted by RegionServers 1 RegionServer can host 100’s of Regions RegionServer can host Regions from multiple tables. After a major compaction, every region has 1 HFile for each column family.
  • 16. Random Facts About HBase Data in HBase is stored in HFile Format Values are stored as Byte Array in HFiles HLog is the file format used for storing “Write Ahead Logging” in HBase.