hbaseconasia2017: HBase Offheaping

HBase Off heaping
Anoop Sam John (Intel)
Ramkrishna S Vasudevan (Intel)

2
HBase Cache
▪HBase Cache for better read performance
–Keep data local to the process
▪L1 on heap LRU Cache
–Larger Cache Sizes => Larger Java Heap => GC issues
▪L2 Bucket Cache
–LRU
–Backed by off heap memory / File
–Can be larger than L1 allocation
–Not constrained by Java Heap Size

Bucket Cache - Reads
3
▪ Read block by block
▪ On heap block copy
▪ Less throughput vs L1
▪ GC Impacts
▪ Predictable latency problems
Region1
Region2
Read request
Read request
RegionServer
Scanner layers
Scanner layers
On heap
Block
Off heap Bucket Cache
Java ByteBuffer
Bucket slot
On heap
Block

Bucket Cache – Optimized Off heap Reads
4
▪ Delivers data directly from off
heap cache
▪ Less Heap needed
▪ Better throughput
▪ Predictable latency
▪ Jira # HBASE-11425
▪ Available in HBase 2.0
Region1
Region2
Read request
RegionServer
Off heap Bucket Cache
Refcount
Scanner layers
Scanner layers
callback
callback
Refcount
Java ByteBuffer

5
Alibaba : Old Performance Data With L2 Cache
▪ From A/B test cluster with 400+ nodes and on simulated data
▪ 12 GB off heap L2 cache
▪ Inconsistent throughput. Dips in latency
▪ HBase read from L2 does data copy to on heap
– More garbage and GC workload

6
Performance Data After Off heaping read path
▪ Backported patches to hbase-1.2 based codebase
▪ Average throughput improved from 18 to 25 million QPS – 30% improvement
▪ Linear throughput. Predictable latency.
▪ In production since Oct 2016
▪ Used in 2016 Singles’ Day, with peaks at 100K QPS per single RegionServer

Store
7
HBase Writes
▪Accumulate cells in memstore
–Default flushes at size >= 128 MB
▪Global memstore heap size
– Defaults to 40% of Xmx
▪MSLAB Chunks & Pool
– On heap Chunks of 2 MB
– Cells in memstore have data copied
over to chunks
– Reduce fragmentation
– Pool to Reuse chunks
v
RegionServer
Region 1
Region N
MSLAB Chunk Pool
memstorememstore
memstore
chunkchunk chunk

8
Write path Offheaping
▪ HBASE-15179 In HBase 2.0
▪ Reading request data into reusable off heap ByteBuffers
– ByteBuffer pool. Response Cell blocks already use this
– Using the same off heap ByteBuffers – Each of 64 KB size
– Reduce garbage and GC
– Usage of off heap avoid temp off heap copy in NIO.
▪ Protobuf upgrade
– PB 3.x latest version support off heap ByteBuffers throughout
– Shaded PB jars to avoid possible version conflict
– New PB ByteStream to work over 1 or more Off heap ByteBuffers

9
Write path Offheaping
▪ HBASE-15179
▪ Off heap MSLAB Chunk Pool
– 2 MB off heap chunks
– Global off heap memstore size – New config
hbase.regionserver.offheap.global.memstore
.size
– Separation of Cell data size, heap overhead
size tracking
– Region flush decision on cell size alone (128
MB default flush decision)
– Global level checking with data size and
heap overhead
v
RegionServer
Region 1
Region N
Off heap MSLAB Chunk Pool
memstorememstore
memstore
chunkchunk chunk

10
Write path Offheaping – Performance results
▪ Performance Evaluation tool
– 200 regions – Single node
– 3 cells per row
– 300 bytes value size / cell
– 100 GB data write
– G1GC
– 65% IHOP
– On heap tests
– 40 GB max heap
– 60% Global memstore size
– MSLAB pool 100%
– Off heap tests
– 24 GB global off heap memstores
(60% of 40 GB)
– 16 GB max heap
0
50000
100000
150000
200000
250000
300000
350000
10 25 50 75 100
Throughput
Threads
Off heap vs on heap write avg throughput
offheap
onheap
▪ 6% - 22% avg throughput gain

Accordion: Compacting memstore
Active Segment
HFile
Immutable Segment
Immutable Segment
Immutable Segment
HFile
Compacting
MemStore
Flush
Put Get/Scan
Compaction
RAM
Disk

12
Write path Offheaping – Performance results
▪ Off heap with Compacting memstore
– In memory flushes/flattening reduces heap
overhead.
– Allow memstore to grow bigger
– Off heap MSLAB pool with compacting
memstores perform the best.
▪ PE Tool – 50 threads and writing 25GB
0
50000
100000
150000
200000
250000
300000
350000
400000
450000
100 500 1024
AvgThroughput
Value Size per Cell
On heap
On heap+ Compacting
Memstore
On heap+ Compacting
Memstore + MSLAB off
offheap +Compacting
Memstore

13
 References
● https://blogs.apache.org/hbase/
● https://blogs.apache.org/hbase/entry/offheaping_the_read_path_in
Questions?

hbaseconasia2017: HBase Offheaping

hbaseconasia2017: HBase Offheaping

Recomendados

Recomendados

Mais conteúdo relacionado

Mais de HBaseCon

Mais de HBaseCon (20)

Último

Último (20)

hbaseconasia2017: HBase Offheaping