SlideShare uma empresa Scribd logo
1 de 47
Baixar para ler offline
Project Status
HBaseConAsia2018, Beijing
Michael Stack <stack@apache.org>
Yu Li <liyu@apache.org>
2.0.0
Pervasive...
...distributed, scalable, big data store
In a nutshell...
●  ...15,409 commits made by 311 contributors
●  ...representing 800,490 lines of code
●  ...mostly written in Java
●  ...has a well established, mature codebase
●  ...maintained by a very large development team
●  ...with stable Y-O-Y commits
●  ...took an estimated 222 years of effort (COCOMO model)
●  ...starting with its first commit in April, 2007 (>10 years old!)
Source https://www.openhub.net/p/hbase
LOC
Source https://www.openhub.net/p/hbase
Issues
Commits per month
Source https://www.openhub.net/p/hbase
Contributors
Source https://www.openhub.net/p/hbase
“The HBase project represents solid, useful computer science that solves
problems and runs businesses every day. To keep that going, we need to
keep bringing new ideas and approaches into the project...we need to
continue to attract people from all backgrounds and parts of the world. I'd love
to see more women, more people of color, and even more worldwide
diversity. I'd like to see more contributions from people not employed by big
data platform companies.
“If we continue to strive for a diversity of ideas and experiences, we'll keep
innovating so that HBase remains relevant for years to come.”
Misty Linville, Vice-President of the Apache HBase Project
New PMC Chairperson!
●  Apache HBase is an Open Source Apache project.
●  It’s what we want to make of it.
●  No owners!
●  Anyone can help!
●  All welcome!
●  The more, the merrier!
Our project
2.0.0
Active branches
2.0.0
Active Branches Latest Branche Release Release Manager
branch-1.2 1.2.6.1 (EOL’d) Sean Busbey
branch-1.3 1.3.2.1 (Yahoo) Francis Liu
branch-1.4 1.4.6 (Current Stable) Andrew Purtell
branch-1.5 Coming... Andrew Purtell
branch-2.0 2.0.1 Michael Stack
branch-2.1 2.1.0 Duo Zhang
branch-2.2 <null> <null>
branch-3 <null> <null>
2.0.0
hbase-2.0.0
2.0.0: Long-time coming
●  Branched four years ago
●  Released end-of-April, 2018
●  Took > 1 year to stabilize
○  hbase-2.0.0 released, April 29th, 2018
○  hbase-2.0.0-beta2 released, March 22nd, 2018
○  hbase-2.0.0-beta1 released January, 16th, 2018
○  hbase-2.0.0-alpha4 released November 4th, 2017
○  hbase-2.0.0-alpha3 released September 17th, 2017
○  hbase-2.0.0-alpha2 released August 21st, 2017
○  hbase-2.0.0-alpha1 released June 22nd, 2017
●  Multiple Release Managers
○  Matteo Bertozzi, Stephen Yuan Jiang, yours truly...
2.0.02.0.0
Lets not do this again!
●  Backed-up mountains of “Tech Debt”
○  Rotted Unit Tests
■  “...99/100 it was the test, not Apache Infra”
○  Performance regressions
■  Not out of the woods yet…
2.0.0
2.0.0
S
Goals: Compatibility
●  Double-down on Semantic Versioning, semver
○  Adopted in hbase-1.0.0
○  MAJOR.MINOR.PATCH[-IDENTIFIER]
■  E.g. 2.0.0-alpha1
2.0.0
Goals: Compatibility
●  But…
○  Semantic Versioning is about API only
■  What about….
●  Internal/External Interfaces
○  Where is Client Interface when Spark/MapReduce
●  It’s complicated...
2.0.0
Goals: Compatibility
●  From Hadoop… Yetus, annotations
○  InterfaceAudience.Public
■  Get/Put/Scan/Connection
○  InterfaceAudience.LimitedPrivate
■  Coprocessors, Replication, etc.
○  InterfaceAudience.Private
■  Internal only
●  What about…
○  Source/Binary compatibility
○  Serializations
■  Wire
■  Formats in HDFS/Zookeeper
○  Dependencies
●  See refguide semver section
○  http://hbase.apache.org/book.html#hbase.versioning
2.0.0
Goals: Compatibility
●  Grey areas…
○  Coprocessors
■  Free access HBase core
■  Change to hbase internals => broken Coprocessors
■  InterfaceAudience.LimitedPrivate
○  Published metrics/jmx
○  Protobufs
■  hbase-protocol/hbase-protocol-shaded
2.0.0
Goals: Compatibility in 2.0.0
●  We adhere to SemVer for DML in 2.x
○  Not for DDL
●  hbase-1.x client can work against hbase-2.x cluster
○  Even 1.x Coprocessor Endpoints work on an hbase-2.x cluster
○  Read-only DDL/Admin of hbase-2.x from hbase-1.x client
○  Replication 1 2 works
●  Extensive curation of what is public/private
●  Purged Guava/Protobuf from API
●  Coprocessors
○  Revamped
●  No Singularity! No downtime! Rolling upgrade from hbase1!
○  Experimental! From 1.4.x to 2.1.x has been tested.
2.0.0
Goals: Compatibility
●  Still plenty to do
○  Ongoing effort...
○  3.0.0!
2.0.0
Goals: Others
●  Scale
○  More Regions, bigger clusters
●  Performance
○  Inline read/write but also macro-aspect: restart, assign, etc.
○  Better resource utilization
■  I/O, RAM
●  Fix primary root of operational woes/bugs
○  Master Region Assignment
2.0.0
Insides
●  Scale
○  More Regions, bigger clusters
●  Performance
○  Inline read/write but also macro restart, assign, etc.
○  Better resource utilization
■  I/O, RAM
●  Fix primary root of operational woes/bugs
○  Master Region Assignment
●  Cleanup
○  Spark narrative
○  Interfaces
2.0.0
Insides
●  Currently >4500 issues resolved
○  ~3k exclusive to 2.0.0+
2.0.0
Insides: Prerequisites
●  JDK8 only
●  Hadoop-2.7.7 minimum*
○  Works against the coming Hadoop-3.x
*Be wary of “...not stable / production ready” Hadoops
2.0.0
Insides: Features
●  New Master Core (A.K.A AMv2)
●  Off-heap Read/Write path
●  In-memory Compaction (“Accordion”)
●  And more...
.0.02.0.0
Insides: Assignment Manager
●  New Master Core (A.K.A AMv2)
○  Assignment Manager v1 (AMv1) root of many operational headaches
●  Prompt assign of millions of Regions, faster startup, larger scale
●  Scrutable/Standalone Testable
●  One hbase:meta writer only, the Master
●  No more intermediate state in ZK
○  At other end of an RPC...
○  Only final state published to hbase:meta
○  No more distributed state: some in Master memory, some in ZK, some in HDFS.
●  New degree of Resilience
2.0.0
Insides: Off-heap
●  Smaller JVM heaps, less copying
○  But more accounting!
●  Off-heap Read Path
○  HDFS=>BucketCache=>Outbound Socket
○  ~latency
■  Cache more
■  Less GC, less erratic
●  Off-heap Write Path
○  RPC=>HDFS data kept off-heap
■  Async DFS WAL Client
●  Off-heap
○  Socket Socket
○  Off-heap fragmentation anyone?
○  On by default?
2.0.0
OFF
2.0.0
Insides: Offheap
2.0.02.0.0
Before:
After:
Insides: Accordion
●  In-memory LSM
○  In-memory flush from ConcurrentSkipListMap to read-only pipeline of ‘Segments’
■  Optional in-memory compaction of Segments
●  Prune deletes/versions early
●  Benefit depends on # of versions
■  Memory ‘breathes’ like an Accordion’s bellows…
■  Less flushing, write amplification
●  Flavors:
○  NONE, BASIC, ENHANCED, ADAPTIVE
●  Symbiosis w/ Offheaping
○  MemStore SLABs off-heap
○  Compact in-memory layout
■  CellChunkMap, NOT CSLM
2.0.0
Insides: Miscellaneous
●  New Netty Server/Client chassis
●  Filesystem Quotas on Table/Namespace sizes
○  Per Table, per Namespace -- not per User
○  Violation Policy
○  Via Shell
●  Medium Object Blobs (MOB)
○  Documents, Images, etc.
○  100k => 10MB
○  Consistency, security, snapshot and HBase replication between clusters
2.0.0
Insides: Miscellaneous
●  Async DFSClient
○  Fanout, fail-fast
○  Less resources
●  New Async Client
○  NOT asynchbase
○  Complete, Admin ops too.
●  Update+Relocation of core dependencies
○  hbase-thirdparty
■  Guava 0.12 => 0.22
■  Protobuf 2.5 => 3.3
■  Netty
2.0.0
Insides: Miscellaneous
●  RegionServer Groups (rsgroups)
○  Coarse Isolation
○  Tables x RegionServers
●  WALs and HFiles in different Filesystems
○  AWS EMR HBase on S3 offering (Amazon S3 Storage Mode)
2.0.0
Insides: Miscellaneous
●  HBASE-14061 Support CF-level Storage Policy
●  HBASE-17408 Introduce per request limit by number of mutations (ChiaPing Tsai)
●  HBASE-17320 Add inclusive/exclusive support for startRow and endRow of scan
●  HBASE-17174 Refactor the AsyncProcess, BufferedMutatorImpl, and HTable
●  HBASE-16010 Put draining function through Admin API (Matt Warhaftig)
●  HBASE-17262 Refactor RpcServer so as to make it extendable and/or pluggable
●  HBASE-17282 Reduce the redundant requests to meta table
●  HBASE-15432 TableInputFormat - support multi column family scan (Xuesen Liang)
●  HBASE-17313 Add BufferedMutatorParams#clone method (Joep Rottinghuis)
●  HBASE-17277 Allow alternate BufferedMutator implemenation Specify the name of an alternate BufferedMutator implementation by either:
●  HBASE-16700 Allow for coprocessor whitelisting
●  HBASE-17194 Assign the new region to the idle server after splitting (ChiaPing Tsai)
●  HBASE-17110 Improve SimpleLoadBalancer to always take server-level balance into account
●  Revert "HBASE-15513 hbase.hregion.memstore.chunkpool.maxsize is 0.0 by default (Vladimir Rodionov)"
●  HBASE-16733 add hadoop 3.0.0-alpha1 to precommit checks
●  HBASE-13259 mmap() based BucketCache IOEngine
●  HBASE-16447 Replication by namespaces config in peer
●  HBASE-15968 New behavior of versions considering mvcc and ts rather than ts only
●  HBASE-15816 Provide client with ability to set priority on Operations
●  HBASE-16213 A new HFileBlock structure for fast random get
●  HBASE-14004 [Replication] Inconsistency between Memstore and WAL may result in data in remote cluster that is not in the origin
●  HBASE-10367 RegionServer graceful stop / decommissioning
●  HBASE-16290 Dump summary of callQueue content; can help debugging
●  HBASE-14790 Implement a new DFSOutputStream for logging WAL only
●  HBASE-18410 FilterList Improvement
●  HBASE-8329 Limit compaction speed -- throttling is enabled by default.
●  HBASE-14969 Add throughput controller for flush
●  HBASE-19358 Improve the stability of splitting log when do fail over
●  HBASE-18882 [TEST] Run MR branch-1 jobs against hbase2 cluster
●  ...
2.0.0
How Coprocessors Changed
●  Pass Interfaces instead of Implementations
○  TableDescriptor instead of HTableDescriptor and Region instead of HRegion
○  Implementations can change w/o breaking CPs
●  Refactor so less boilerplate, more compile-time checking.
●  Some operations changed locus
○  E.g. splits are done by Master in hbase2, RegionServer in hbase1.
●  Purge of PB from CP API
○  Passed raw Protobufs to Coprocessors. Pass POJOs/Interfaces instead.
●  CP API Pruning
○  Removed hooks on internals that were too private to expose.
○  Also hooks that allowed Coprocessor do “crazy” stuff: e.g. create own StoreFiles
●  Deprecated methods have been removed
2.0.0
Upgrading to 2.0.x
●  Read Upgrading Chapter in refguide
○  Includes list of notable changes…
■  Changed defaults, namings, etc., based off user feedback
■  Configs/metrics misspellings fixed, some renames
○  Removals
■  DLR
■  prefix-tree encoding
■  Deprecated pre-1.0.0 features and classes
○  Coprocessors Upgrade Recipe
■  Rebuild against new API otherwise they will fail to load and HBase processes will die
●  New CHANGES.md and RELEASENOTES.md (courtesy of Apache Yetus).
●  Before upgrade, run Pre-Upgrade Validator
○  $ bin/hbase pre-upgrade …
●  HBCK1
2.0.0
Goals: Others, revisited
●  Scale
○  More Regions, bigger clusters
●  Performance
○  Inline read/write but also macro aspect: restart, assign, etc.
○  Better resource utilization
■  I/O, RAM
●  Fix primary root of operational woes/bugs
○  Master Region Assignment
2.0.0
●  2.0.x
○  Bug-Fixing
■  AMv2 Corner cases, teething...
○  Perf
■  Ongoing
○  Time-based releases
■  Every 6-weeks or so…
■  Just stabilizing…
●  HBCK2
○  New standalone project
2.0.0
2.0.0
hbase-2.1.x
2.1.x
●  Duo Zhang, Release Manager
●  Pv2 based replication peer modification
○  Operation will be synchronous
●  Serial Replication
○  Preserve edit order crossing the chasm
●  Sensible-looking CLASSPATH for your client
●  Lots of AMv2 fixes...
Future (3.0.0+): Ease of Use
●  Native SQL support
○  Internal
■  Lightweight SQL support
●  Something like CQL (Apache Calcite, Presto, Apache Derby?)
■  Official Secondary-Index
●  Better implementation with Procedure v2
○  External
■  Better integration with Spark SQL
●  An official spark connector in the hbase project
■  Phoenix
●  SQL-tier over HBase
●  Configuration
○  Complete dynamic online configuration change; no need of a rolling restart.
○  Templates and Auto-tuning; ergonomic adjustment to suit current load.
Future (3.0.0+): Performance
●  Aim at making the best Java LSM storage engine
●  E2E asynchronous
○  Since 2.0.0, we have async client, async WAL
■  Async HDFS API underway
○  We will have: SEDA request handling on server side
■  Better write pipeline
■  Complete at Alibaba, upstream coming soon
○  E2E asynchronous is coming...
●  New Cell format
○  Plumbed end-to-end
○  Better-integrated sequenceid & Cell-Tags
○  More CPU-cache friendly
Future (3.0.0+): Scale and Stability
●  Split meta table
○  Distribute i/o and so we can store more metadata
●  New Distributed Log Replay redo
●  Clusters with 1M/10M regions
○  Smaller regions, less write amplification
Future (3.0.0+): Internal
●  Framework Refactor
○  Distribution Management Framework
■  Procedure V2 polish
○  SEDA framework
○  Core abstraction
■  Support “embedded” mode
■  Support other storage engine like rocksdb
○  FileSystem Interface Redo
■  Bring into hbase atomic operations like rename rather than ask HDFS do it for us
●  Eliminate trips to NameNode
■  Remove dependency on HDFS, better support Cloud/S3
●  Hybrid Logical Clocks
●  Externalized Compaction
○  Spark
Future (3.0.0+): Join Us!
●  More help needed
○  We need more contributors/committers to make the bright future!
●  Don’t let language block you
○  PMC would like to see world diversity
○  A Chinese HBase group already setup
○  More HBase China meetups coming
■  Hangzhou/Shanghai/Shenzhen...
●  Don’t be shy
○  Why not upstream your great work?
○  What’s in your way?
○  Just let us know
●  Don’t hesitate
○  Join us!
Thank you
Images:
○  Insides: http://alienanthology.wikia.com/wiki/File:Ash%27s_insides.png
○  Goal: https://commons.wikimedia.org/wiki/File:Goal-com-logo-eps-vector-image.png
○  Future: https://thenextweb.com/insider/2015/12/30/artificial-intelligence-free-money-and-the-end-of-capitalism/
○  Sea: https://playgroundforlife.files.wordpress.com/2010/08/gerhard-richter-seascape-1998-sfmoma.jpg

Mais conteúdo relacionado

Mais procurados

HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetHBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetCloudera, Inc.
 
HBaseConAsia2018 Track3-6: HBase at Meituan
HBaseConAsia2018 Track3-6: HBase at MeituanHBaseConAsia2018 Track3-6: HBase at Meituan
HBaseConAsia2018 Track3-6: HBase at MeituanMichael Stack
 
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...Michael Stack
 
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程HBaseCon
 
HBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbmsHBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbmsMichael Stack
 
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...Michael Stack
 
HBaseConAsia2018 Track1-3: HBase at Xiaomi
HBaseConAsia2018 Track1-3: HBase at XiaomiHBaseConAsia2018 Track1-3: HBase at Xiaomi
HBaseConAsia2018 Track1-3: HBase at XiaomiMichael Stack
 
HBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a FlurryHBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a FlurryHBaseCon
 
Keynote: The Future of Apache HBase
Keynote: The Future of Apache HBaseKeynote: The Future of Apache HBase
Keynote: The Future of Apache HBaseHBaseCon
 
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsightHBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsightHBaseCon
 
Argus Production Monitoring at Salesforce
Argus Production Monitoring at SalesforceArgus Production Monitoring at Salesforce
Argus Production Monitoring at SalesforceHBaseCon
 
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsightOptimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsightHBaseCon
 
Facebook - Jonthan Gray - Hadoop World 2010
Facebook - Jonthan Gray - Hadoop World 2010Facebook - Jonthan Gray - Hadoop World 2010
Facebook - Jonthan Gray - Hadoop World 2010Cloudera, Inc.
 
HBaseCon 2015: State of HBase Docs and How to Contribute
HBaseCon 2015: State of HBase Docs and How to ContributeHBaseCon 2015: State of HBase Docs and How to Contribute
HBaseCon 2015: State of HBase Docs and How to ContributeHBaseCon
 
Rolling Out Apache HBase for Mobile Offerings at Visa
Rolling Out Apache HBase for Mobile Offerings at Visa Rolling Out Apache HBase for Mobile Offerings at Visa
Rolling Out Apache HBase for Mobile Offerings at Visa HBaseCon
 
HBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWS
HBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWSHBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWS
HBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWSHBaseCon
 
HBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ FlipboardHBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ FlipboardMatthew Blair
 
Time-Series Apache HBase
Time-Series Apache HBaseTime-Series Apache HBase
Time-Series Apache HBaseHBaseCon
 

Mais procurados (20)

HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetHBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
 
HBaseConAsia2018 Track3-6: HBase at Meituan
HBaseConAsia2018 Track3-6: HBase at MeituanHBaseConAsia2018 Track3-6: HBase at Meituan
HBaseConAsia2018 Track3-6: HBase at Meituan
 
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
HBaseConAsia2018 Track2-3: Bringing MySQL Compatibility to HBase using Databa...
 
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
 
HBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbmsHBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbms
 
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
 
HBaseConAsia2018 Track1-3: HBase at Xiaomi
HBaseConAsia2018 Track1-3: HBase at XiaomiHBaseConAsia2018 Track1-3: HBase at Xiaomi
HBaseConAsia2018 Track1-3: HBase at Xiaomi
 
HBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a FlurryHBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a Flurry
 
Keynote: The Future of Apache HBase
Keynote: The Future of Apache HBaseKeynote: The Future of Apache HBase
Keynote: The Future of Apache HBase
 
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsightHBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
 
Argus Production Monitoring at Salesforce
Argus Production Monitoring at SalesforceArgus Production Monitoring at Salesforce
Argus Production Monitoring at Salesforce
 
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsightOptimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
Optimizing Apache HBase for Cloud Storage in Microsoft Azure HDInsight
 
Cassandra in e-commerce
Cassandra in e-commerceCassandra in e-commerce
Cassandra in e-commerce
 
Facebook - Jonthan Gray - Hadoop World 2010
Facebook - Jonthan Gray - Hadoop World 2010Facebook - Jonthan Gray - Hadoop World 2010
Facebook - Jonthan Gray - Hadoop World 2010
 
HBaseCon 2015: State of HBase Docs and How to Contribute
HBaseCon 2015: State of HBase Docs and How to ContributeHBaseCon 2015: State of HBase Docs and How to Contribute
HBaseCon 2015: State of HBase Docs and How to Contribute
 
Rolling Out Apache HBase for Mobile Offerings at Visa
Rolling Out Apache HBase for Mobile Offerings at Visa Rolling Out Apache HBase for Mobile Offerings at Visa
Rolling Out Apache HBase for Mobile Offerings at Visa
 
HBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWS
HBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWSHBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWS
HBaseCon 2015: Graph Processing of Stock Market Order Flow in HBase on AWS
 
HBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ FlipboardHBaseCon 2015- HBase @ Flipboard
HBaseCon 2015- HBase @ Flipboard
 
What database
What databaseWhat database
What database
 
Time-Series Apache HBase
Time-Series Apache HBaseTime-Series Apache HBase
Time-Series Apache HBase
 

Semelhante a HBaseConAsia2018 Keynote1: Apache HBase Project Status

hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0HBaseCon
 
Logs @ OVHcloud
Logs @ OVHcloudLogs @ OVHcloud
Logs @ OVHcloudOVHcloud
 
Percona XtraBackup - New Features and Improvements
Percona XtraBackup - New Features and ImprovementsPercona XtraBackup - New Features and Improvements
Percona XtraBackup - New Features and ImprovementsMarcelo Altmann
 
Meet hbase 2.0
Meet hbase 2.0Meet hbase 2.0
Meet hbase 2.0enissoz
 
Meet HBase 2.0
Meet HBase 2.0Meet HBase 2.0
Meet HBase 2.0enissoz
 
Tips to drive maria db cluster performance for nextcloud
Tips to drive maria db cluster performance for nextcloudTips to drive maria db cluster performance for nextcloud
Tips to drive maria db cluster performance for nextcloudSeveralnines
 
A Kong retrospective: from 0.10 to 0.13
A Kong retrospective: from 0.10 to 0.13A Kong retrospective: from 0.10 to 0.13
A Kong retrospective: from 0.10 to 0.13Thibault Charbonnier
 
Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017Junping Du
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateDataWorks Summit
 
PGConf APAC 2018 - High performance json postgre-sql vs. mongodb
PGConf APAC 2018 - High performance json  postgre-sql vs. mongodbPGConf APAC 2018 - High performance json  postgre-sql vs. mongodb
PGConf APAC 2018 - High performance json postgre-sql vs. mongodbPGConf APAC
 
High performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodbHigh performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodbWei Shan Ang
 
Percona 服务器与 XtraDB 存储引擎
Percona 服务器与 XtraDB 存储引擎Percona 服务器与 XtraDB 存储引擎
Percona 服务器与 XtraDB 存储引擎YUCHENG HU
 
What's New with Ceph - Ceph Day Silicon Valley
What's New with Ceph - Ceph Day Silicon ValleyWhat's New with Ceph - Ceph Day Silicon Valley
What's New with Ceph - Ceph Day Silicon ValleyCeph Community
 
Beyond 1000 bosh Deployments
Beyond 1000 bosh DeploymentsBeyond 1000 bosh Deployments
Beyond 1000 bosh Deploymentsanynines GmbH
 
MySQL Performance - Best practices
MySQL Performance - Best practices MySQL Performance - Best practices
MySQL Performance - Best practices Ted Wennmark
 
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with AlluxioSpeed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with AlluxioAlluxio, Inc.
 
Optimizing Servers for High-Throughput and Low-Latency at Dropbox
Optimizing Servers for High-Throughput and Low-Latency at DropboxOptimizing Servers for High-Throughput and Low-Latency at Dropbox
Optimizing Servers for High-Throughput and Low-Latency at DropboxScyllaDB
 
Introducing NetBSD 5.0
Introducing NetBSD 5.0Introducing NetBSD 5.0
Introducing NetBSD 5.0Andrew Doran
 
Type safe, versioned, and rewindable stream processing with Apache {Avro, K...
Type safe, versioned, and rewindable stream processing  with  Apache {Avro, K...Type safe, versioned, and rewindable stream processing  with  Apache {Avro, K...
Type safe, versioned, and rewindable stream processing with Apache {Avro, K...Hisham Mardam-Bey
 

Semelhante a HBaseConAsia2018 Keynote1: Apache HBase Project Status (20)

hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0
 
Logs @ OVHcloud
Logs @ OVHcloudLogs @ OVHcloud
Logs @ OVHcloud
 
Percona XtraBackup - New Features and Improvements
Percona XtraBackup - New Features and ImprovementsPercona XtraBackup - New Features and Improvements
Percona XtraBackup - New Features and Improvements
 
Meet Apache HBase - 2.0
Meet Apache HBase - 2.0Meet Apache HBase - 2.0
Meet Apache HBase - 2.0
 
Meet hbase 2.0
Meet hbase 2.0Meet hbase 2.0
Meet hbase 2.0
 
Meet HBase 2.0
Meet HBase 2.0Meet HBase 2.0
Meet HBase 2.0
 
Tips to drive maria db cluster performance for nextcloud
Tips to drive maria db cluster performance for nextcloudTips to drive maria db cluster performance for nextcloud
Tips to drive maria db cluster performance for nextcloud
 
A Kong retrospective: from 0.10 to 0.13
A Kong retrospective: from 0.10 to 0.13A Kong retrospective: from 0.10 to 0.13
A Kong retrospective: from 0.10 to 0.13
 
Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017Hadoop 3 @ Hadoop Summit San Jose 2017
Hadoop 3 @ Hadoop Summit San Jose 2017
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community Update
 
PGConf APAC 2018 - High performance json postgre-sql vs. mongodb
PGConf APAC 2018 - High performance json  postgre-sql vs. mongodbPGConf APAC 2018 - High performance json  postgre-sql vs. mongodb
PGConf APAC 2018 - High performance json postgre-sql vs. mongodb
 
High performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodbHigh performance json- postgre sql vs. mongodb
High performance json- postgre sql vs. mongodb
 
Percona 服务器与 XtraDB 存储引擎
Percona 服务器与 XtraDB 存储引擎Percona 服务器与 XtraDB 存储引擎
Percona 服务器与 XtraDB 存储引擎
 
What's New with Ceph - Ceph Day Silicon Valley
What's New with Ceph - Ceph Day Silicon ValleyWhat's New with Ceph - Ceph Day Silicon Valley
What's New with Ceph - Ceph Day Silicon Valley
 
Beyond 1000 bosh Deployments
Beyond 1000 bosh DeploymentsBeyond 1000 bosh Deployments
Beyond 1000 bosh Deployments
 
MySQL Performance - Best practices
MySQL Performance - Best practices MySQL Performance - Best practices
MySQL Performance - Best practices
 
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with AlluxioSpeed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with Alluxio
 
Optimizing Servers for High-Throughput and Low-Latency at Dropbox
Optimizing Servers for High-Throughput and Low-Latency at DropboxOptimizing Servers for High-Throughput and Low-Latency at Dropbox
Optimizing Servers for High-Throughput and Low-Latency at Dropbox
 
Introducing NetBSD 5.0
Introducing NetBSD 5.0Introducing NetBSD 5.0
Introducing NetBSD 5.0
 
Type safe, versioned, and rewindable stream processing with Apache {Avro, K...
Type safe, versioned, and rewindable stream processing  with  Apache {Avro, K...Type safe, versioned, and rewindable stream processing  with  Apache {Avro, K...
Type safe, versioned, and rewindable stream processing with Apache {Avro, K...
 

Mais de Michael Stack

hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloudhbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on CloudMichael Stack
 
hbaseconasia2019 Recent work on HBase at Pinterest
hbaseconasia2019 Recent work on HBase at Pinteresthbaseconasia2019 Recent work on HBase at Pinterest
hbaseconasia2019 Recent work on HBase at PinterestMichael Stack
 
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltdhbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., LtdMichael Stack
 
hbaseconasia2019 HBase at Didi
hbaseconasia2019 HBase at Didihbaseconasia2019 HBase at Didi
hbaseconasia2019 HBase at DidiMichael Stack
 
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...Michael Stack
 
hbaseconasia2019 HBase at Tencent
hbaseconasia2019 HBase at Tencenthbaseconasia2019 HBase at Tencent
hbaseconasia2019 HBase at TencentMichael Stack
 
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...Michael Stack
 
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...Michael Stack
 
hbaseconasia2019 Pharos as a Pluggable Secondary Index Component
hbaseconasia2019 Pharos as a Pluggable Secondary Index Componenthbaseconasia2019 Pharos as a Pluggable Secondary Index Component
hbaseconasia2019 Pharos as a Pluggable Secondary Index ComponentMichael Stack
 
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibabahbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at AlibabaMichael Stack
 
hbaseconasia2019 OpenTSDB at Xiaomi
hbaseconasia2019 OpenTSDB at Xiaomihbaseconasia2019 OpenTSDB at Xiaomi
hbaseconasia2019 OpenTSDB at XiaomiMichael Stack
 
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Sparkhbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and SparkMichael Stack
 
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBasehbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBaseMichael Stack
 
hbaseconasia2019 Distributed Bitmap Index Solution
hbaseconasia2019 Distributed Bitmap Index Solutionhbaseconasia2019 Distributed Bitmap Index Solution
hbaseconasia2019 Distributed Bitmap Index SolutionMichael Stack
 
hbaseconasia2019 HBase Bucket Cache on Persistent Memory
hbaseconasia2019 HBase Bucket Cache on Persistent Memoryhbaseconasia2019 HBase Bucket Cache on Persistent Memory
hbaseconasia2019 HBase Bucket Cache on Persistent MemoryMichael Stack
 
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACLhbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACLMichael Stack
 
hbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBasehbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBaseMichael Stack
 
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...Michael Stack
 
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...Michael Stack
 
HBaseConAsia2019 Keynote
HBaseConAsia2019 KeynoteHBaseConAsia2019 Keynote
HBaseConAsia2019 KeynoteMichael Stack
 

Mais de Michael Stack (20)

hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloudhbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
 
hbaseconasia2019 Recent work on HBase at Pinterest
hbaseconasia2019 Recent work on HBase at Pinteresthbaseconasia2019 Recent work on HBase at Pinterest
hbaseconasia2019 Recent work on HBase at Pinterest
 
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltdhbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
 
hbaseconasia2019 HBase at Didi
hbaseconasia2019 HBase at Didihbaseconasia2019 HBase at Didi
hbaseconasia2019 HBase at Didi
 
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
 
hbaseconasia2019 HBase at Tencent
hbaseconasia2019 HBase at Tencenthbaseconasia2019 HBase at Tencent
hbaseconasia2019 HBase at Tencent
 
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
 
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
 
hbaseconasia2019 Pharos as a Pluggable Secondary Index Component
hbaseconasia2019 Pharos as a Pluggable Secondary Index Componenthbaseconasia2019 Pharos as a Pluggable Secondary Index Component
hbaseconasia2019 Pharos as a Pluggable Secondary Index Component
 
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibabahbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
 
hbaseconasia2019 OpenTSDB at Xiaomi
hbaseconasia2019 OpenTSDB at Xiaomihbaseconasia2019 OpenTSDB at Xiaomi
hbaseconasia2019 OpenTSDB at Xiaomi
 
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Sparkhbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
 
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBasehbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
 
hbaseconasia2019 Distributed Bitmap Index Solution
hbaseconasia2019 Distributed Bitmap Index Solutionhbaseconasia2019 Distributed Bitmap Index Solution
hbaseconasia2019 Distributed Bitmap Index Solution
 
hbaseconasia2019 HBase Bucket Cache on Persistent Memory
hbaseconasia2019 HBase Bucket Cache on Persistent Memoryhbaseconasia2019 HBase Bucket Cache on Persistent Memory
hbaseconasia2019 HBase Bucket Cache on Persistent Memory
 
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACLhbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
 
hbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBasehbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBase
 
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
 
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
 
HBaseConAsia2019 Keynote
HBaseConAsia2019 KeynoteHBaseConAsia2019 Keynote
HBaseConAsia2019 Keynote
 

Último

Summary ID-IGF 2016 National Dialogue - English (tata kelola internet / int...
Summary  ID-IGF 2016 National Dialogue  - English (tata kelola internet / int...Summary  ID-IGF 2016 National Dialogue  - English (tata kelola internet / int...
Summary ID-IGF 2016 National Dialogue - English (tata kelola internet / int...ICT Watch - Indonesia
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119APNIC
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxmibuzondetrabajo
 
办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...
办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...
办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...vmzoxnx5
 
Cybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best PracticesCybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best PracticesLumiverse Solutions Pvt Ltd
 
How to login to Router net ORBI LOGIN...
How to login to Router net ORBI LOGIN...How to login to Router net ORBI LOGIN...
How to login to Router net ORBI LOGIN...rrouter90
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxMario
 
Summary IGF 2013 Bali - English (tata kelola internet / internet governance)
Summary  IGF 2013 Bali - English (tata kelola internet / internet governance)Summary  IGF 2013 Bali - English (tata kelola internet / internet governance)
Summary IGF 2013 Bali - English (tata kelola internet / internet governance)ICT Watch - Indonesia
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxAndrieCagasanAkio
 

Último (9)

Summary ID-IGF 2016 National Dialogue - English (tata kelola internet / int...
Summary  ID-IGF 2016 National Dialogue  - English (tata kelola internet / int...Summary  ID-IGF 2016 National Dialogue  - English (tata kelola internet / int...
Summary ID-IGF 2016 National Dialogue - English (tata kelola internet / int...
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptx
 
办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...
办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...
办理澳洲USYD文凭证书学历认证【Q微/1954292140】办理悉尼大学毕业证书真实成绩单GPA修改/办理澳洲大学文凭证书Offer录取通知书/在读证明...
 
Cybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best PracticesCybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best Practices
 
How to login to Router net ORBI LOGIN...
How to login to Router net ORBI LOGIN...How to login to Router net ORBI LOGIN...
How to login to Router net ORBI LOGIN...
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptx
 
Summary IGF 2013 Bali - English (tata kelola internet / internet governance)
Summary  IGF 2013 Bali - English (tata kelola internet / internet governance)Summary  IGF 2013 Bali - English (tata kelola internet / internet governance)
Summary IGF 2013 Bali - English (tata kelola internet / internet governance)
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptx
 

HBaseConAsia2018 Keynote1: Apache HBase Project Status

  • 1. Project Status HBaseConAsia2018, Beijing Michael Stack <stack@apache.org> Yu Li <liyu@apache.org>
  • 4. In a nutshell... ●  ...15,409 commits made by 311 contributors ●  ...representing 800,490 lines of code ●  ...mostly written in Java ●  ...has a well established, mature codebase ●  ...maintained by a very large development team ●  ...with stable Y-O-Y commits ●  ...took an estimated 222 years of effort (COCOMO model) ●  ...starting with its first commit in April, 2007 (>10 years old!) Source https://www.openhub.net/p/hbase
  • 7. Commits per month Source https://www.openhub.net/p/hbase
  • 9. “The HBase project represents solid, useful computer science that solves problems and runs businesses every day. To keep that going, we need to keep bringing new ideas and approaches into the project...we need to continue to attract people from all backgrounds and parts of the world. I'd love to see more women, more people of color, and even more worldwide diversity. I'd like to see more contributions from people not employed by big data platform companies. “If we continue to strive for a diversity of ideas and experiences, we'll keep innovating so that HBase remains relevant for years to come.” Misty Linville, Vice-President of the Apache HBase Project New PMC Chairperson!
  • 10. ●  Apache HBase is an Open Source Apache project. ●  It’s what we want to make of it. ●  No owners! ●  Anyone can help! ●  All welcome! ●  The more, the merrier! Our project 2.0.0
  • 11. Active branches 2.0.0 Active Branches Latest Branche Release Release Manager branch-1.2 1.2.6.1 (EOL’d) Sean Busbey branch-1.3 1.3.2.1 (Yahoo) Francis Liu branch-1.4 1.4.6 (Current Stable) Andrew Purtell branch-1.5 Coming... Andrew Purtell branch-2.0 2.0.1 Michael Stack branch-2.1 2.1.0 Duo Zhang branch-2.2 <null> <null> branch-3 <null> <null>
  • 13. 2.0.0: Long-time coming ●  Branched four years ago ●  Released end-of-April, 2018 ●  Took > 1 year to stabilize ○  hbase-2.0.0 released, April 29th, 2018 ○  hbase-2.0.0-beta2 released, March 22nd, 2018 ○  hbase-2.0.0-beta1 released January, 16th, 2018 ○  hbase-2.0.0-alpha4 released November 4th, 2017 ○  hbase-2.0.0-alpha3 released September 17th, 2017 ○  hbase-2.0.0-alpha2 released August 21st, 2017 ○  hbase-2.0.0-alpha1 released June 22nd, 2017 ●  Multiple Release Managers ○  Matteo Bertozzi, Stephen Yuan Jiang, yours truly... 2.0.02.0.0
  • 14. Lets not do this again! ●  Backed-up mountains of “Tech Debt” ○  Rotted Unit Tests ■  “...99/100 it was the test, not Apache Infra” ○  Performance regressions ■  Not out of the woods yet… 2.0.0
  • 16. Goals: Compatibility ●  Double-down on Semantic Versioning, semver ○  Adopted in hbase-1.0.0 ○  MAJOR.MINOR.PATCH[-IDENTIFIER] ■  E.g. 2.0.0-alpha1 2.0.0
  • 17. Goals: Compatibility ●  But… ○  Semantic Versioning is about API only ■  What about…. ●  Internal/External Interfaces ○  Where is Client Interface when Spark/MapReduce ●  It’s complicated... 2.0.0
  • 18. Goals: Compatibility ●  From Hadoop… Yetus, annotations ○  InterfaceAudience.Public ■  Get/Put/Scan/Connection ○  InterfaceAudience.LimitedPrivate ■  Coprocessors, Replication, etc. ○  InterfaceAudience.Private ■  Internal only ●  What about… ○  Source/Binary compatibility ○  Serializations ■  Wire ■  Formats in HDFS/Zookeeper ○  Dependencies ●  See refguide semver section ○  http://hbase.apache.org/book.html#hbase.versioning 2.0.0
  • 19. Goals: Compatibility ●  Grey areas… ○  Coprocessors ■  Free access HBase core ■  Change to hbase internals => broken Coprocessors ■  InterfaceAudience.LimitedPrivate ○  Published metrics/jmx ○  Protobufs ■  hbase-protocol/hbase-protocol-shaded 2.0.0
  • 20. Goals: Compatibility in 2.0.0 ●  We adhere to SemVer for DML in 2.x ○  Not for DDL ●  hbase-1.x client can work against hbase-2.x cluster ○  Even 1.x Coprocessor Endpoints work on an hbase-2.x cluster ○  Read-only DDL/Admin of hbase-2.x from hbase-1.x client ○  Replication 1 2 works ●  Extensive curation of what is public/private ●  Purged Guava/Protobuf from API ●  Coprocessors ○  Revamped ●  No Singularity! No downtime! Rolling upgrade from hbase1! ○  Experimental! From 1.4.x to 2.1.x has been tested. 2.0.0
  • 21. Goals: Compatibility ●  Still plenty to do ○  Ongoing effort... ○  3.0.0! 2.0.0
  • 22. Goals: Others ●  Scale ○  More Regions, bigger clusters ●  Performance ○  Inline read/write but also macro-aspect: restart, assign, etc. ○  Better resource utilization ■  I/O, RAM ●  Fix primary root of operational woes/bugs ○  Master Region Assignment 2.0.0
  • 23. Insides ●  Scale ○  More Regions, bigger clusters ●  Performance ○  Inline read/write but also macro restart, assign, etc. ○  Better resource utilization ■  I/O, RAM ●  Fix primary root of operational woes/bugs ○  Master Region Assignment ●  Cleanup ○  Spark narrative ○  Interfaces 2.0.0
  • 24. Insides ●  Currently >4500 issues resolved ○  ~3k exclusive to 2.0.0+ 2.0.0
  • 25. Insides: Prerequisites ●  JDK8 only ●  Hadoop-2.7.7 minimum* ○  Works against the coming Hadoop-3.x *Be wary of “...not stable / production ready” Hadoops 2.0.0
  • 26. Insides: Features ●  New Master Core (A.K.A AMv2) ●  Off-heap Read/Write path ●  In-memory Compaction (“Accordion”) ●  And more... .0.02.0.0
  • 27. Insides: Assignment Manager ●  New Master Core (A.K.A AMv2) ○  Assignment Manager v1 (AMv1) root of many operational headaches ●  Prompt assign of millions of Regions, faster startup, larger scale ●  Scrutable/Standalone Testable ●  One hbase:meta writer only, the Master ●  No more intermediate state in ZK ○  At other end of an RPC... ○  Only final state published to hbase:meta ○  No more distributed state: some in Master memory, some in ZK, some in HDFS. ●  New degree of Resilience 2.0.0
  • 28. Insides: Off-heap ●  Smaller JVM heaps, less copying ○  But more accounting! ●  Off-heap Read Path ○  HDFS=>BucketCache=>Outbound Socket ○  ~latency ■  Cache more ■  Less GC, less erratic ●  Off-heap Write Path ○  RPC=>HDFS data kept off-heap ■  Async DFS WAL Client ●  Off-heap ○  Socket Socket ○  Off-heap fragmentation anyone? ○  On by default? 2.0.0 OFF 2.0.0
  • 30. Insides: Accordion ●  In-memory LSM ○  In-memory flush from ConcurrentSkipListMap to read-only pipeline of ‘Segments’ ■  Optional in-memory compaction of Segments ●  Prune deletes/versions early ●  Benefit depends on # of versions ■  Memory ‘breathes’ like an Accordion’s bellows… ■  Less flushing, write amplification ●  Flavors: ○  NONE, BASIC, ENHANCED, ADAPTIVE ●  Symbiosis w/ Offheaping ○  MemStore SLABs off-heap ○  Compact in-memory layout ■  CellChunkMap, NOT CSLM 2.0.0
  • 31. Insides: Miscellaneous ●  New Netty Server/Client chassis ●  Filesystem Quotas on Table/Namespace sizes ○  Per Table, per Namespace -- not per User ○  Violation Policy ○  Via Shell ●  Medium Object Blobs (MOB) ○  Documents, Images, etc. ○  100k => 10MB ○  Consistency, security, snapshot and HBase replication between clusters 2.0.0
  • 32. Insides: Miscellaneous ●  Async DFSClient ○  Fanout, fail-fast ○  Less resources ●  New Async Client ○  NOT asynchbase ○  Complete, Admin ops too. ●  Update+Relocation of core dependencies ○  hbase-thirdparty ■  Guava 0.12 => 0.22 ■  Protobuf 2.5 => 3.3 ■  Netty 2.0.0
  • 33. Insides: Miscellaneous ●  RegionServer Groups (rsgroups) ○  Coarse Isolation ○  Tables x RegionServers ●  WALs and HFiles in different Filesystems ○  AWS EMR HBase on S3 offering (Amazon S3 Storage Mode) 2.0.0
  • 34. Insides: Miscellaneous ●  HBASE-14061 Support CF-level Storage Policy ●  HBASE-17408 Introduce per request limit by number of mutations (ChiaPing Tsai) ●  HBASE-17320 Add inclusive/exclusive support for startRow and endRow of scan ●  HBASE-17174 Refactor the AsyncProcess, BufferedMutatorImpl, and HTable ●  HBASE-16010 Put draining function through Admin API (Matt Warhaftig) ●  HBASE-17262 Refactor RpcServer so as to make it extendable and/or pluggable ●  HBASE-17282 Reduce the redundant requests to meta table ●  HBASE-15432 TableInputFormat - support multi column family scan (Xuesen Liang) ●  HBASE-17313 Add BufferedMutatorParams#clone method (Joep Rottinghuis) ●  HBASE-17277 Allow alternate BufferedMutator implemenation Specify the name of an alternate BufferedMutator implementation by either: ●  HBASE-16700 Allow for coprocessor whitelisting ●  HBASE-17194 Assign the new region to the idle server after splitting (ChiaPing Tsai) ●  HBASE-17110 Improve SimpleLoadBalancer to always take server-level balance into account ●  Revert "HBASE-15513 hbase.hregion.memstore.chunkpool.maxsize is 0.0 by default (Vladimir Rodionov)" ●  HBASE-16733 add hadoop 3.0.0-alpha1 to precommit checks ●  HBASE-13259 mmap() based BucketCache IOEngine ●  HBASE-16447 Replication by namespaces config in peer ●  HBASE-15968 New behavior of versions considering mvcc and ts rather than ts only ●  HBASE-15816 Provide client with ability to set priority on Operations ●  HBASE-16213 A new HFileBlock structure for fast random get ●  HBASE-14004 [Replication] Inconsistency between Memstore and WAL may result in data in remote cluster that is not in the origin ●  HBASE-10367 RegionServer graceful stop / decommissioning ●  HBASE-16290 Dump summary of callQueue content; can help debugging ●  HBASE-14790 Implement a new DFSOutputStream for logging WAL only ●  HBASE-18410 FilterList Improvement ●  HBASE-8329 Limit compaction speed -- throttling is enabled by default. ●  HBASE-14969 Add throughput controller for flush ●  HBASE-19358 Improve the stability of splitting log when do fail over ●  HBASE-18882 [TEST] Run MR branch-1 jobs against hbase2 cluster ●  ... 2.0.0
  • 35. How Coprocessors Changed ●  Pass Interfaces instead of Implementations ○  TableDescriptor instead of HTableDescriptor and Region instead of HRegion ○  Implementations can change w/o breaking CPs ●  Refactor so less boilerplate, more compile-time checking. ●  Some operations changed locus ○  E.g. splits are done by Master in hbase2, RegionServer in hbase1. ●  Purge of PB from CP API ○  Passed raw Protobufs to Coprocessors. Pass POJOs/Interfaces instead. ●  CP API Pruning ○  Removed hooks on internals that were too private to expose. ○  Also hooks that allowed Coprocessor do “crazy” stuff: e.g. create own StoreFiles ●  Deprecated methods have been removed 2.0.0
  • 36. Upgrading to 2.0.x ●  Read Upgrading Chapter in refguide ○  Includes list of notable changes… ■  Changed defaults, namings, etc., based off user feedback ■  Configs/metrics misspellings fixed, some renames ○  Removals ■  DLR ■  prefix-tree encoding ■  Deprecated pre-1.0.0 features and classes ○  Coprocessors Upgrade Recipe ■  Rebuild against new API otherwise they will fail to load and HBase processes will die ●  New CHANGES.md and RELEASENOTES.md (courtesy of Apache Yetus). ●  Before upgrade, run Pre-Upgrade Validator ○  $ bin/hbase pre-upgrade … ●  HBCK1 2.0.0
  • 37. Goals: Others, revisited ●  Scale ○  More Regions, bigger clusters ●  Performance ○  Inline read/write but also macro aspect: restart, assign, etc. ○  Better resource utilization ■  I/O, RAM ●  Fix primary root of operational woes/bugs ○  Master Region Assignment 2.0.0
  • 38. ●  2.0.x ○  Bug-Fixing ■  AMv2 Corner cases, teething... ○  Perf ■  Ongoing ○  Time-based releases ■  Every 6-weeks or so… ■  Just stabilizing… ●  HBCK2 ○  New standalone project 2.0.0
  • 40. 2.1.x ●  Duo Zhang, Release Manager ●  Pv2 based replication peer modification ○  Operation will be synchronous ●  Serial Replication ○  Preserve edit order crossing the chasm ●  Sensible-looking CLASSPATH for your client ●  Lots of AMv2 fixes...
  • 41.
  • 42. Future (3.0.0+): Ease of Use ●  Native SQL support ○  Internal ■  Lightweight SQL support ●  Something like CQL (Apache Calcite, Presto, Apache Derby?) ■  Official Secondary-Index ●  Better implementation with Procedure v2 ○  External ■  Better integration with Spark SQL ●  An official spark connector in the hbase project ■  Phoenix ●  SQL-tier over HBase ●  Configuration ○  Complete dynamic online configuration change; no need of a rolling restart. ○  Templates and Auto-tuning; ergonomic adjustment to suit current load.
  • 43. Future (3.0.0+): Performance ●  Aim at making the best Java LSM storage engine ●  E2E asynchronous ○  Since 2.0.0, we have async client, async WAL ■  Async HDFS API underway ○  We will have: SEDA request handling on server side ■  Better write pipeline ■  Complete at Alibaba, upstream coming soon ○  E2E asynchronous is coming... ●  New Cell format ○  Plumbed end-to-end ○  Better-integrated sequenceid & Cell-Tags ○  More CPU-cache friendly
  • 44. Future (3.0.0+): Scale and Stability ●  Split meta table ○  Distribute i/o and so we can store more metadata ●  New Distributed Log Replay redo ●  Clusters with 1M/10M regions ○  Smaller regions, less write amplification
  • 45. Future (3.0.0+): Internal ●  Framework Refactor ○  Distribution Management Framework ■  Procedure V2 polish ○  SEDA framework ○  Core abstraction ■  Support “embedded” mode ■  Support other storage engine like rocksdb ○  FileSystem Interface Redo ■  Bring into hbase atomic operations like rename rather than ask HDFS do it for us ●  Eliminate trips to NameNode ■  Remove dependency on HDFS, better support Cloud/S3 ●  Hybrid Logical Clocks ●  Externalized Compaction ○  Spark
  • 46. Future (3.0.0+): Join Us! ●  More help needed ○  We need more contributors/committers to make the bright future! ●  Don’t let language block you ○  PMC would like to see world diversity ○  A Chinese HBase group already setup ○  More HBase China meetups coming ■  Hangzhou/Shanghai/Shenzhen... ●  Don’t be shy ○  Why not upstream your great work? ○  What’s in your way? ○  Just let us know ●  Don’t hesitate ○  Join us!
  • 47. Thank you Images: ○  Insides: http://alienanthology.wikia.com/wiki/File:Ash%27s_insides.png ○  Goal: https://commons.wikimedia.org/wiki/File:Goal-com-logo-eps-vector-image.png ○  Future: https://thenextweb.com/insider/2015/12/30/artificial-intelligence-free-money-and-the-end-of-capitalism/ ○  Sea: https://playgroundforlife.files.wordpress.com/2010/08/gerhard-richter-seascape-1998-sfmoma.jpg