SlideShare uma empresa Scribd logo
1 de 59
Baixar para ler offline
DB Apache Kudu
DB
HybridTime
2 © Cloudera, Inc. All rights reserved.
• ( ) / takahiko at cloudera.com
•
• Cloudera
•
• Internet & Network
• RDBMS 1
• NoSQL 2
• Hadoop 3 ←Now!
3 © Cloudera, Inc. All rights reserved.
• Apache Kudu
• Kudu OLTP OLAP HTAP
DB #dbts2017 Kudu
• BI/DWH DB Kudu
Google Spanner
https://www.slideshare.net/Cloudera_jp/apache-kududb-dbts2017
• HybridTime
DB HybridTime
Kudu
© Cloudera, Inc. All rights reserved.
Apache Kudu
5 © Cloudera, Inc. All rights reserved.
• 275 3PB
• 1000 PB
• /
• 1 GB/
• DB
• BLOB
•
• 1000
Kudu
1
...
6 © Cloudera, Inc. All rights reserved.
Kudu
Kudu
(Impala)
(Kudu) (S3)(HDFS)
(Impala) (Spark)
(Hive)
(MapReduce)
(ADLS)
SQL
( DB )
HMS
7 © Cloudera, Inc. All rights reserved.
SQL Kudu
Impala + Kudu
(Impala)
(Kudu) (S3)(HDFS)
(Impala) (Spark)
(Hive)
(MapReduce)
(ADLS)
HMS
• Kudu SQL
• Impala SQL
8 © Cloudera, Inc. All rights reserved.
• Impala SQL Impala Kudu
• Impala Kudu predicate push down
• Kudu SCAN Impala aggregation
SQL
SQL Impala
3
90
9 © Cloudera, Inc. All rights reserved.
Spark Kudu
Spark + Kudu
(Impala)
(Kudu) (S3)(HDFS)
(Impala) (Spark)
(Hive)
(MapReduce)
(ADLS)
HMS
• Spark SQL
Kudu API
• SparkSQL
10 © Cloudera, Inc. All rights reserved.
• Kudu 1
Kudu
Tablet
Kudu
TabletServer
11 © Cloudera, Inc. All rights reserved.
• 1 3
• 3
• Raft
•
•
Tablet
TabletServer
12 © Cloudera, Inc. All rights reserved.
•
• INSERT/UPDATE/UPSERT/DELETE
• DECIMAL
•
•
• Kerberos
•
•
•
•
Kudu
© Cloudera, Inc. All rights reserved.
HTAP: OLTP OLAP Kudu
14 © Cloudera, Inc. All rights reserved.
• OLTP 1TB RAM
•
OLTP OLAP
• 2
DB DB
insert/update/delete
OLTP OLAP DWH BI
select
ETL
15 © Cloudera, Inc. All rights reserved.
Hadoop
• (PB)
HDFS OLAP
• Impala/Hive SQL
• HDFS OLTP HBase
HBase
Hadoop DB
put / delete OLTP OLAP BIselect
HBase ImpalaHadoop
data ingestion
HDFS
ETL
16 © Cloudera, Inc. All rights reserved.
• OLTP? OLAP?
•
•
• OLTP OLAP
• HTAP(Hybrid Transactional/Analytic Processing)
•
• ...
•
• Kudu HTAP
OLTP OLAP
HTAP
HTAP
17 © Cloudera, Inc. All rights reserved.
• OLTP OLAP 1 DB
•
HTAP DB
Kudu
insert/update/delete
HTAP DWH BI
select
Kudu
18 © Cloudera, Inc. All rights reserved.
(HDFS)
SQL
Impala
(Spark Streaming)
(Flume)
ETL SQL
Hive/Spark
DB DB
(Kudu)
IoT
(Flume)
BI
BI
BI
ETL
MQTT
BrokerIoT
BI
DB DB
/
DB DB
(Kafka)
( )
© Cloudera, Inc. All rights reserved.
DB
20 © Cloudera, Inc. All rights reserved.
• DB
•
• ! f
•
• 12:30:00 < 12:30:03
• 2 < 3
• Log Sequence Number LSN
• LSN
• DB
• DB LSN
• DB
DB
12:30:00 12:30:03
2 3
! f
21 © Cloudera, Inc. All rights reserved.
• Physical Clock
•
•
•
•
• Logical Clock)
•
• ...
•
•
•
DB
B
A
A B
22 © Cloudera, Inc. All rights reserved.
•
•
•
• +1
- Lamport Clock
2
3
6
24
16
61
54
69
70
12 24 48423630
8 32 40 48
50 703020
23 © Cloudera, Inc. All rights reserved.
•
• +1
•
•
•
- Vector Clock
2
3
{1,0,0}
{1,1,0}
{1,2,0}
{1,2,1} {1,2,2}
{1,4,2}
{1,3,0}
{2,3,0} {3,3,0}
{3,3,3}
{1,5,2}
{5,5,4}
{5,5,2}{4,5,2}
24 © Cloudera, Inc. All rights reserved.
•
•
25 © Cloudera, Inc. All rights reserved.
•
•
•
12:30:00
12:29:59
A B
B
!
f
26 © Cloudera, Inc. All rights reserved.
• Spanner: Google’s Globally Distributed Database
• DB
ACID
• GPS
• TrueTime API
error bound
Google Spanner
27 © Cloudera, Inc. All rights reserved.
• API
• GPS
•
• TrueTime API TT.now() TTinterval
• TT.now()
• Google DC 1 7ms 4ms
Google Spanner TrueTime API
earliest latest
TT"#$%&'(): %(&)"%+$, )($%+$
TT,now()
--
28 © Cloudera, Inc. All rights reserved.
• commit wait
• TrueTime API
• e f
2"
• External Consistency
• f e T $ < T &
Google Spanner commit-wait
$
" & "
2"
&
$
&
2"
2"
& → $
© Cloudera, Inc. All rights reserved.
Technical Report: HybridTime - Accessible Global Consistency
with High Clock Uncertainty
30 © Cloudera, Inc. All rights reserved.
• Technical Report: HybridTime - Accessible Global Consistency with High Clock
Uncertainty
•
• Google DC
• HybridTime NTP DB
• Kudu Kudu
HybridTime
•
• 2014 (
)
Kudu
31 © Cloudera, Inc. All rights reserved.
• Google Spanner DC
• Amazon Dynamo Cassandra DB
Eventual Consistency
•
•
•
[ ] DC DB
32 © Cloudera, Inc. All rights reserved.
• Consistency
•
•
• CAP Consistency ACID Consistency/Isolation
• Consistency
• (Anomaly)
• Lost Update, Dirty Read, Non-Repeatable, Phantom Read, Read Skew, Write Skew, etc...
•
• Lost Update SELECT FOR UPDATE
Consistency
33 © Cloudera, Inc. All rights reserved.
• Lamport Clocks Vector Clocks
•
•
•
• RDB Point-in-Time
• Vector Clocks
[ ]
34 © Cloudera, Inc. All rights reserved.
• Spinnaker Paxos
•
•
• Spanner commit-wait
•
• GPS
•
[ ]
35 © Cloudera, Inc. All rights reserved.
• HybridTime
•
•
• Pint-in-time
• Lamport Clock
• HybridTime
• Vector Clocks Lamport Clocks 2
( commit-wait )
[ ] HybridTime
HTC: { , }
36 © Cloudera, Inc. All rights reserved.
•
• NTP
• NTP
• commit-wait
•
• NTP
• commit-wait
•
•
• Kudu DB
HybridTime
37 © Cloudera, Inc. All rights reserved.
• !"# $ i e
• !"'() $ e
• *# $ i e
• 1:
• 2:
[ ] HybridTime
38 © Cloudera, Inc. All rights reserved.
• HybridTime HTC
• (error)
• Spanner TrueTime API
• HybridTime
• NTP
HybridTime
39 © Cloudera, Inc. All rights reserved.
• ntp_adjtime
• timex
• maxerror
HybridTime
40 © Cloudera, Inc. All rights reserved.
• Kudu macOS
• macOS OS
macOS
41 © Cloudera, Inc. All rights reserved.
Kudu
42 © Cloudera, Inc. All rights reserved.
• 1 2
• 2 1
• !" − !$ ...
•
• %$
• & = !" − !$ − %$
NTP
2
1
100*+
160*+
T1
100ms
160ms 160-100 = 60ms
T2
%$
NG
43 © Cloudera, Inc. All rights reserved.
• !"
• RTT: !
#
$
• ! = !" + !$ = '( − '" − ('+ − '$)
#
$
= !" =
-./-0 /(-1/-2)
$
• 3
• 3 = '$ − '" −
-./-0 / -1/-2
$
• 3 =
$ -2/-0
$
−
-./-0 / -1/-2
$
• 3 =
$-2/$-0/-.4-04-1/-2
$
• 3 =
-2/-0/-.4-1
$
• 3 =
-2/-0 4 -1/-.
$
NTP
NTP
T2 T3
T4T1
NTP
10078
15078 16078
11078
16578
11578 12578
17578
7078 8078 8578 9578
+1078 +1078
+578
!" !$
1) 50ms
2) -30ms
RTT20ms
44 © Cloudera, Inc. All rights reserved.
•
•
•
• ! =
#$#%#&& '(#)$%#$*)
)
= 41./
• 50ms -9ms
• RTT
0
)
• ±
0
)
NTP
T2 T3
T4T1
NTP
100./ 101./ 106./ 125./
+1./ +19./
+5./
8# 8)
150./ 151./ 156./ 175./) 50ms
RTT20ms
45 © Cloudera, Inc. All rights reserved.
•
• 1
•
•
•
• NTP !
•
• ! +
#
$
NTP
46 © Cloudera, Inc. All rights reserved.
• NTP
• DC NTP or Google Public NTP
• AWS Amazon Time Sync Service
• Azure Hyper-V time synchronization Google Public NTP
• GCE Google Public NTP
•
NTP
47 © Cloudera, Inc. All rights reserved.
• UPDATE
• 2
• HybridTime
+1
[ ] HybridTime
48 © Cloudera, Inc. All rights reserved.
• !" #, % !" HTC #, %
[ ] HybridTime
49 © Cloudera, Inc. All rights reserved.
• HTC
• HTC
• ) ! → #
• −%& ! < !(()( # < %* #
[ ] Kudu HybridTime
j
i
#
−%& !
%* #−%& !
!
%& !
50 © Cloudera, Inc. All rights reserved.
• KUDU-146 Deal with leap seconds
• leap second
• Stratum 0 NTP Leap Indicator OS
• 23:59:59 -> 23:59:60 -> 00:00:00
• 23:59:59 -> 23:59:59-> 00:00:00
• 2 1
• HybridTime propagate
• Kudu commit-wait
• wait ms
1
• NTP TIME_INS/TIME_OOP max error
• Leap Smearing
https://issues.apache.org/jira/browse/KUDU-430
51 © Cloudera, Inc. All rights reserved.
• HybridTime
•
• Kudu RDB
• ACID
• Commit-wait
• NTP
• CLIENT_PROPAGETED
• HybridTime
• HybridTime propagate
Kudu
52 © Cloudera, Inc. All rights reserved.
• Kudu MVCC Multi-version Concurrency Control
•
• WAL REDO UNDO
•
• READ_LATEST
•
• READ_AT_SNAPSHOT
• MVCC
•
Repeatable Read
Kudu
53 © Cloudera, Inc. All rights reserved.
Kudu
https://blog.cloudera.co.jp/11c3a749a81b
54 © Cloudera, Inc. All rights reserved.
• YCSB
•
• 3 8
• insert 60%, update 20%, single-row read 20%
•
• GCE: nl-standard-8 x10
• RAM 30GB
• Disk 350GB
• NTP
• GCE
[ ]
NTP
55 © Cloudera, Inc. All rights reserved.
[ ]
HybridTime Commit Wait Commit Wait
Clock Error
© Cloudera, Inc. All rights reserved.
57 © Cloudera, Inc. All rights reserved.
• OLTP OLAP 1 DB Kudu
• HybridTime
DB
• HybridTime → P.36
• Kudu
HybridTime Serializable
• OLAP Kudu #dbts2017
• 11 6 Cloudera World Tokyo 2018
Kudu
Kudu
58 © Cloudera, Inc. All rights reserved.
Cloudera World Tokyo 2018
http://www.clouderaworldtokyo.com/
THANK YOU

Mais conteúdo relacionado

Mais procurados

TiDBのトランザクション
TiDBのトランザクションTiDBのトランザクション
TiDBのトランザクションAkio Mitobe
 
マイクロサービスにおける 結果整合性との戦い
マイクロサービスにおける 結果整合性との戦いマイクロサービスにおける 結果整合性との戦い
マイクロサービスにおける 結果整合性との戦いota42y
 
Amazon Athena で実現する データ分析の広がり
Amazon Athena で実現する データ分析の広がりAmazon Athena で実現する データ分析の広がり
Amazon Athena で実現する データ分析の広がりAmazon Web Services Japan
 
pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)
pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)
pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)NTT DATA Technology & Innovation
 
Airflowを広告データのワークフローエンジンとして運用してみた話
Airflowを広告データのワークフローエンジンとして運用してみた話Airflowを広告データのワークフローエンジンとして運用してみた話
Airflowを広告データのワークフローエンジンとして運用してみた話Katsunori Kanda
 
A critique of ansi sql isolation levels 解説公開用
A critique of ansi sql isolation levels 解説公開用A critique of ansi sql isolation levels 解説公開用
A critique of ansi sql isolation levels 解説公開用Takashi Kambayashi
 
分散システムについて語らせてくれ
分散システムについて語らせてくれ分散システムについて語らせてくれ
分散システムについて語らせてくれKumazaki Hiroki
 
分散システム第7章(後半)
分散システム第7章(後半)分散システム第7章(後半)
分散システム第7章(後半)Kenta Hattori
 
事例で学ぶApache Cassandra
事例で学ぶApache Cassandra事例で学ぶApache Cassandra
事例で学ぶApache CassandraYuki Morishita
 
AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)NTT DATA Technology & Innovation
 
PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)
PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)
PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)Hironobu Suzuki
 
地理分散DBについて
地理分散DBについて地理分散DBについて
地理分散DBについてKumazaki Hiroki
 
Cassandraのしくみ データの読み書き編
Cassandraのしくみ データの読み書き編Cassandraのしくみ データの読み書き編
Cassandraのしくみ データの読み書き編Yuki Morishita
 
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...Amazon Web Services Japan
 

Mais procurados (20)

MVSR Schedulerを作るための指針
MVSR Schedulerを作るための指針MVSR Schedulerを作るための指針
MVSR Schedulerを作るための指針
 
TiDBのトランザクション
TiDBのトランザクションTiDBのトランザクション
TiDBのトランザクション
 
マイクロサービスにおける 結果整合性との戦い
マイクロサービスにおける 結果整合性との戦いマイクロサービスにおける 結果整合性との戦い
マイクロサービスにおける 結果整合性との戦い
 
PostgreSQLの運用・監視にまつわるエトセトラ
PostgreSQLの運用・監視にまつわるエトセトラPostgreSQLの運用・監視にまつわるエトセトラ
PostgreSQLの運用・監視にまつわるエトセトラ
 
Amazon Athena で実現する データ分析の広がり
Amazon Athena で実現する データ分析の広がりAmazon Athena で実現する データ分析の広がり
Amazon Athena で実現する データ分析の広がり
 
At least onceってぶっちゃけ問題の先送りだったよね #kafkajp
At least onceってぶっちゃけ問題の先送りだったよね #kafkajpAt least onceってぶっちゃけ問題の先送りだったよね #kafkajp
At least onceってぶっちゃけ問題の先送りだったよね #kafkajp
 
pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)
pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)
pg_bigmで全文検索するときに気を付けたい5つのポイント(第23回PostgreSQLアンカンファレンス@オンライン 発表資料)
 
Apache Spark 2.4 and 3.0 What's Next?
Apache Spark 2.4 and 3.0  What's Next? Apache Spark 2.4 and 3.0  What's Next?
Apache Spark 2.4 and 3.0 What's Next?
 
Airflowを広告データのワークフローエンジンとして運用してみた話
Airflowを広告データのワークフローエンジンとして運用してみた話Airflowを広告データのワークフローエンジンとして運用してみた話
Airflowを広告データのワークフローエンジンとして運用してみた話
 
A critique of ansi sql isolation levels 解説公開用
A critique of ansi sql isolation levels 解説公開用A critique of ansi sql isolation levels 解説公開用
A critique of ansi sql isolation levels 解説公開用
 
Hadoopエコシステムのデータストア振り返り
Hadoopエコシステムのデータストア振り返りHadoopエコシステムのデータストア振り返り
Hadoopエコシステムのデータストア振り返り
 
分散システムについて語らせてくれ
分散システムについて語らせてくれ分散システムについて語らせてくれ
分散システムについて語らせてくれ
 
Hive on Tezのベストプラクティス
Hive on TezのベストプラクティスHive on Tezのベストプラクティス
Hive on Tezのベストプラクティス
 
分散システム第7章(後半)
分散システム第7章(後半)分散システム第7章(後半)
分散システム第7章(後半)
 
事例で学ぶApache Cassandra
事例で学ぶApache Cassandra事例で学ぶApache Cassandra
事例で学ぶApache Cassandra
 
AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
AlloyDBを触ってみた!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
 
PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)
PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)
PostgreSQLのリカバリ超入門(もしくはWAL、CHECKPOINT、オンラインバックアップの仕組み)
 
地理分散DBについて
地理分散DBについて地理分散DBについて
地理分散DBについて
 
Cassandraのしくみ データの読み書き編
Cassandraのしくみ データの読み書き編Cassandraのしくみ データの読み書き編
Cassandraのしくみ データの読み書き編
 
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
 

Semelhante a 分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Cloudera, Inc.
 
NYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache HadoopNYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache Hadoopmarkgrover
 
Querying multiple distributed storage systems with Apache Hive robustly
Querying multiple distributed storage systems with Apache Hive robustlyQuerying multiple distributed storage systems with Apache Hive robustly
Querying multiple distributed storage systems with Apache Hive robustlyAshish Singh
 
Getting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionGetting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionCloudera, Inc.
 
Nodejsvault austin2019
Nodejsvault austin2019Nodejsvault austin2019
Nodejsvault austin2019Taswar Bhatti
 
[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx
[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx
[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptxhackersuli
 
いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編
いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編
いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編Miho Yamamoto
 
SCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud Infrastructure
SCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud InfrastructureSCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud Infrastructure
SCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud InfrastructureMatt Ray
 
DB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentals
DB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentalsDB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentals
DB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentalsJohn Beresniewicz
 
Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...
Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...
Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...Data Con LA
 
Fraud Detection using Hadoop
Fraud Detection using HadoopFraud Detection using Hadoop
Fraud Detection using Hadoophadooparchbook
 
Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...
Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...
Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...HostedbyConfluent
 
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...DataStax
 
Fraud Detection for Israel BigThings Meetup
Fraud Detection  for Israel BigThings MeetupFraud Detection  for Israel BigThings Meetup
Fraud Detection for Israel BigThings MeetupGwen (Chen) Shapira
 
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
Strata NY 2014 - Architectural considerations for Hadoop applications tutorialStrata NY 2014 - Architectural considerations for Hadoop applications tutorial
Strata NY 2014 - Architectural considerations for Hadoop applications tutorialhadooparchbook
 
Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015hadooparchbook
 
What's New in Apache Spark 2.3 & Why Should You Care
What's New in Apache Spark 2.3 & Why Should You CareWhat's New in Apache Spark 2.3 & Why Should You Care
What's New in Apache Spark 2.3 & Why Should You CareDatabricks
 
Oracle Performance Tuning Fundamentals
Oracle Performance Tuning FundamentalsOracle Performance Tuning Fundamentals
Oracle Performance Tuning FundamentalsCarlos Sierra
 

Semelhante a 分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは (20)

Empower Hive with Spark
Empower Hive with SparkEmpower Hive with Spark
Empower Hive with Spark
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
NYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache HadoopNYC HUG - Application Architectures with Apache Hadoop
NYC HUG - Application Architectures with Apache Hadoop
 
Querying multiple distributed storage systems with Apache Hive robustly
Querying multiple distributed storage systems with Apache Hive robustlyQuerying multiple distributed storage systems with Apache Hive robustly
Querying multiple distributed storage systems with Apache Hive robustly
 
Getting Apache Spark Customers to Production
Getting Apache Spark Customers to ProductionGetting Apache Spark Customers to Production
Getting Apache Spark Customers to Production
 
Nodejsvault austin2019
Nodejsvault austin2019Nodejsvault austin2019
Nodejsvault austin2019
 
[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx
[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx
[HUN] 2023_Hacker_Suli_Meetup_Cloud_DFIR_Alapok.pptx
 
いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編
いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編
いそがしいひとのための Microsoft Ignite 2018 + 最新情報 Data & AI 編
 
SCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud Infrastructure
SCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud InfrastructureSCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud Infrastructure
SCALE12X Build a Cloud Day: Chef: The Swiss Army Knife of Cloud Infrastructure
 
DB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentals
DB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentalsDB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentals
DB Time, Average Active Sessions, and ASH Math - Oracle performance fundamentals
 
Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...
Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...
Big Data Day LA 2015 - Brainwashed: Building an IDE for Feature Engineering b...
 
Fraud Detection using Hadoop
Fraud Detection using HadoopFraud Detection using Hadoop
Fraud Detection using Hadoop
 
Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...
Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...
Don’t Forget About Your Past—Optimizing Apache Druid Performance With Neil Bu...
 
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
 
Fraud Detection for Israel BigThings Meetup
Fraud Detection  for Israel BigThings MeetupFraud Detection  for Israel BigThings Meetup
Fraud Detection for Israel BigThings Meetup
 
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
Strata NY 2014 - Architectural considerations for Hadoop applications tutorialStrata NY 2014 - Architectural considerations for Hadoop applications tutorial
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
 
Spark etl
Spark etlSpark etl
Spark etl
 
Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015
 
What's New in Apache Spark 2.3 & Why Should You Care
What's New in Apache Spark 2.3 & Why Should You CareWhat's New in Apache Spark 2.3 & Why Should You Care
What's New in Apache Spark 2.3 & Why Should You Care
 
Oracle Performance Tuning Fundamentals
Oracle Performance Tuning FundamentalsOracle Performance Tuning Fundamentals
Oracle Performance Tuning Fundamentals
 

Mais de Cloudera Japan

Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)
Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)
Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)Cloudera Japan
 
機械学習の定番プラットフォームSparkの紹介
機械学習の定番プラットフォームSparkの紹介機械学習の定番プラットフォームSparkの紹介
機械学習の定番プラットフォームSparkの紹介Cloudera Japan
 
HDFS Supportaiblity Improvements
HDFS Supportaiblity ImprovementsHDFS Supportaiblity Improvements
HDFS Supportaiblity ImprovementsCloudera Japan
 
Apache Impalaパフォーマンスチューニング #dbts2018
Apache Impalaパフォーマンスチューニング #dbts2018Apache Impalaパフォーマンスチューニング #dbts2018
Apache Impalaパフォーマンスチューニング #dbts2018Cloudera Japan
 
Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理Cloudera Japan
 
HBase Across the World #LINE_DM
HBase Across the World #LINE_DMHBase Across the World #LINE_DM
HBase Across the World #LINE_DMCloudera Japan
 
Cloudera のサポートエンジニアリング #supennight
Cloudera のサポートエンジニアリング #supennightCloudera のサポートエンジニアリング #supennight
Cloudera のサポートエンジニアリング #supennightCloudera Japan
 
Train, predict, serve: How to go into production your machine learning model
Train, predict, serve: How to go into production your machine learning modelTrain, predict, serve: How to go into production your machine learning model
Train, predict, serve: How to go into production your machine learning modelCloudera Japan
 
Cloudera in the Cloud #CWT2017
Cloudera in the Cloud #CWT2017Cloudera in the Cloud #CWT2017
Cloudera in the Cloud #CWT2017Cloudera Japan
 
先行事例から学ぶ IoT / ビッグデータの始め方
先行事例から学ぶ IoT / ビッグデータの始め方先行事例から学ぶ IoT / ビッグデータの始め方
先行事例から学ぶ IoT / ビッグデータの始め方Cloudera Japan
 
Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017
Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017
Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017Cloudera Japan
 
How to go into production your machine learning models? #CWT2017
How to go into production your machine learning models? #CWT2017How to go into production your machine learning models? #CWT2017
How to go into production your machine learning models? #CWT2017Cloudera Japan
 
Apache Kudu - Updatable Analytical Storage #rakutentech
Apache Kudu - Updatable Analytical Storage #rakutentechApache Kudu - Updatable Analytical Storage #rakutentech
Apache Kudu - Updatable Analytical Storage #rakutentechCloudera Japan
 
Hue 4.0 / Hue Meetup Tokyo #huejp
Hue 4.0 / Hue Meetup Tokyo #huejpHue 4.0 / Hue Meetup Tokyo #huejp
Hue 4.0 / Hue Meetup Tokyo #huejpCloudera Japan
 
Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017
Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017
Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017Cloudera Japan
 
Cloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadeda
Cloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadedaCloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadeda
Cloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadedaCloudera Japan
 
Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016
Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016
Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016Cloudera Japan
 
Cloud Native Hadoop #cwt2016
Cloud Native Hadoop #cwt2016Cloud Native Hadoop #cwt2016
Cloud Native Hadoop #cwt2016Cloudera Japan
 
大規模データに対するデータサイエンスの進め方 #CWT2016
大規模データに対するデータサイエンスの進め方 #CWT2016大規模データに対するデータサイエンスの進め方 #CWT2016
大規模データに対するデータサイエンスの進め方 #CWT2016Cloudera Japan
 
#cwt2016 Apache Kudu 構成とテーブル設計
#cwt2016 Apache Kudu 構成とテーブル設計#cwt2016 Apache Kudu 構成とテーブル設計
#cwt2016 Apache Kudu 構成とテーブル設計Cloudera Japan
 

Mais de Cloudera Japan (20)

Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)
Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)
Impala + Kudu を用いたデータウェアハウス構築の勘所 (仮)
 
機械学習の定番プラットフォームSparkの紹介
機械学習の定番プラットフォームSparkの紹介機械学習の定番プラットフォームSparkの紹介
機械学習の定番プラットフォームSparkの紹介
 
HDFS Supportaiblity Improvements
HDFS Supportaiblity ImprovementsHDFS Supportaiblity Improvements
HDFS Supportaiblity Improvements
 
Apache Impalaパフォーマンスチューニング #dbts2018
Apache Impalaパフォーマンスチューニング #dbts2018Apache Impalaパフォーマンスチューニング #dbts2018
Apache Impalaパフォーマンスチューニング #dbts2018
 
Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理
 
HBase Across the World #LINE_DM
HBase Across the World #LINE_DMHBase Across the World #LINE_DM
HBase Across the World #LINE_DM
 
Cloudera のサポートエンジニアリング #supennight
Cloudera のサポートエンジニアリング #supennightCloudera のサポートエンジニアリング #supennight
Cloudera のサポートエンジニアリング #supennight
 
Train, predict, serve: How to go into production your machine learning model
Train, predict, serve: How to go into production your machine learning modelTrain, predict, serve: How to go into production your machine learning model
Train, predict, serve: How to go into production your machine learning model
 
Cloudera in the Cloud #CWT2017
Cloudera in the Cloud #CWT2017Cloudera in the Cloud #CWT2017
Cloudera in the Cloud #CWT2017
 
先行事例から学ぶ IoT / ビッグデータの始め方
先行事例から学ぶ IoT / ビッグデータの始め方先行事例から学ぶ IoT / ビッグデータの始め方
先行事例から学ぶ IoT / ビッグデータの始め方
 
Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017
Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017
Clouderaが提供するエンタープライズ向け運用、データ管理ツールの使い方 #CW2017
 
How to go into production your machine learning models? #CWT2017
How to go into production your machine learning models? #CWT2017How to go into production your machine learning models? #CWT2017
How to go into production your machine learning models? #CWT2017
 
Apache Kudu - Updatable Analytical Storage #rakutentech
Apache Kudu - Updatable Analytical Storage #rakutentechApache Kudu - Updatable Analytical Storage #rakutentech
Apache Kudu - Updatable Analytical Storage #rakutentech
 
Hue 4.0 / Hue Meetup Tokyo #huejp
Hue 4.0 / Hue Meetup Tokyo #huejpHue 4.0 / Hue Meetup Tokyo #huejp
Hue 4.0 / Hue Meetup Tokyo #huejp
 
Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017
Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017
Apache Kuduは何がそんなに「速い」DBなのか? #dbts2017
 
Cloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadeda
Cloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadedaCloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadeda
Cloudera Data Science WorkbenchとPySparkで 好きなPythonライブラリを 分散で使う #cadeda
 
Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016
Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016
Cloudera + MicrosoftでHadoopするのがイイらしい。 #CWT2016
 
Cloud Native Hadoop #cwt2016
Cloud Native Hadoop #cwt2016Cloud Native Hadoop #cwt2016
Cloud Native Hadoop #cwt2016
 
大規模データに対するデータサイエンスの進め方 #CWT2016
大規模データに対するデータサイエンスの進め方 #CWT2016大規模データに対するデータサイエンスの進め方 #CWT2016
大規模データに対するデータサイエンスの進め方 #CWT2016
 
#cwt2016 Apache Kudu 構成とテーブル設計
#cwt2016 Apache Kudu 構成とテーブル設計#cwt2016 Apache Kudu 構成とテーブル設計
#cwt2016 Apache Kudu 構成とテーブル設計
 

Último

Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGSIVASHANKAR N
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 

Último (20)

Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 

分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは

  • 2. 2 © Cloudera, Inc. All rights reserved. • ( ) / takahiko at cloudera.com • • Cloudera • • Internet & Network • RDBMS 1 • NoSQL 2 • Hadoop 3 ←Now!
  • 3. 3 © Cloudera, Inc. All rights reserved. • Apache Kudu • Kudu OLTP OLAP HTAP DB #dbts2017 Kudu • BI/DWH DB Kudu Google Spanner https://www.slideshare.net/Cloudera_jp/apache-kududb-dbts2017 • HybridTime DB HybridTime Kudu
  • 4. © Cloudera, Inc. All rights reserved. Apache Kudu
  • 5. 5 © Cloudera, Inc. All rights reserved. • 275 3PB • 1000 PB • / • 1 GB/ • DB • BLOB • • 1000 Kudu 1 ...
  • 6. 6 © Cloudera, Inc. All rights reserved. Kudu Kudu (Impala) (Kudu) (S3)(HDFS) (Impala) (Spark) (Hive) (MapReduce) (ADLS) SQL ( DB ) HMS
  • 7. 7 © Cloudera, Inc. All rights reserved. SQL Kudu Impala + Kudu (Impala) (Kudu) (S3)(HDFS) (Impala) (Spark) (Hive) (MapReduce) (ADLS) HMS • Kudu SQL • Impala SQL
  • 8. 8 © Cloudera, Inc. All rights reserved. • Impala SQL Impala Kudu • Impala Kudu predicate push down • Kudu SCAN Impala aggregation SQL SQL Impala 3 90
  • 9. 9 © Cloudera, Inc. All rights reserved. Spark Kudu Spark + Kudu (Impala) (Kudu) (S3)(HDFS) (Impala) (Spark) (Hive) (MapReduce) (ADLS) HMS • Spark SQL Kudu API • SparkSQL
  • 10. 10 © Cloudera, Inc. All rights reserved. • Kudu 1 Kudu Tablet Kudu TabletServer
  • 11. 11 © Cloudera, Inc. All rights reserved. • 1 3 • 3 • Raft • • Tablet TabletServer
  • 12. 12 © Cloudera, Inc. All rights reserved. • • INSERT/UPDATE/UPSERT/DELETE • DECIMAL • • • Kerberos • • • • Kudu
  • 13. © Cloudera, Inc. All rights reserved. HTAP: OLTP OLAP Kudu
  • 14. 14 © Cloudera, Inc. All rights reserved. • OLTP 1TB RAM • OLTP OLAP • 2 DB DB insert/update/delete OLTP OLAP DWH BI select ETL
  • 15. 15 © Cloudera, Inc. All rights reserved. Hadoop • (PB) HDFS OLAP • Impala/Hive SQL • HDFS OLTP HBase HBase Hadoop DB put / delete OLTP OLAP BIselect HBase ImpalaHadoop data ingestion HDFS ETL
  • 16. 16 © Cloudera, Inc. All rights reserved. • OLTP? OLAP? • • • OLTP OLAP • HTAP(Hybrid Transactional/Analytic Processing) • • ... • • Kudu HTAP OLTP OLAP HTAP HTAP
  • 17. 17 © Cloudera, Inc. All rights reserved. • OLTP OLAP 1 DB • HTAP DB Kudu insert/update/delete HTAP DWH BI select Kudu
  • 18. 18 © Cloudera, Inc. All rights reserved. (HDFS) SQL Impala (Spark Streaming) (Flume) ETL SQL Hive/Spark DB DB (Kudu) IoT (Flume) BI BI BI ETL MQTT BrokerIoT BI DB DB / DB DB (Kafka) ( )
  • 19. © Cloudera, Inc. All rights reserved. DB
  • 20. 20 © Cloudera, Inc. All rights reserved. • DB • • ! f • • 12:30:00 < 12:30:03 • 2 < 3 • Log Sequence Number LSN • LSN • DB • DB LSN • DB DB 12:30:00 12:30:03 2 3 ! f
  • 21. 21 © Cloudera, Inc. All rights reserved. • Physical Clock • • • • • Logical Clock) • • ... • • • DB B A A B
  • 22. 22 © Cloudera, Inc. All rights reserved. • • • • +1 - Lamport Clock 2 3 6 24 16 61 54 69 70 12 24 48423630 8 32 40 48 50 703020
  • 23. 23 © Cloudera, Inc. All rights reserved. • • +1 • • • - Vector Clock 2 3 {1,0,0} {1,1,0} {1,2,0} {1,2,1} {1,2,2} {1,4,2} {1,3,0} {2,3,0} {3,3,0} {3,3,3} {1,5,2} {5,5,4} {5,5,2}{4,5,2}
  • 24. 24 © Cloudera, Inc. All rights reserved. • •
  • 25. 25 © Cloudera, Inc. All rights reserved. • • • 12:30:00 12:29:59 A B B ! f
  • 26. 26 © Cloudera, Inc. All rights reserved. • Spanner: Google’s Globally Distributed Database • DB ACID • GPS • TrueTime API error bound Google Spanner
  • 27. 27 © Cloudera, Inc. All rights reserved. • API • GPS • • TrueTime API TT.now() TTinterval • TT.now() • Google DC 1 7ms 4ms Google Spanner TrueTime API earliest latest TT"#$%&'(): %(&)"%+$, )($%+$ TT,now() --
  • 28. 28 © Cloudera, Inc. All rights reserved. • commit wait • TrueTime API • e f 2" • External Consistency • f e T $ < T & Google Spanner commit-wait $ " & " 2" & $ & 2" 2" & → $
  • 29. © Cloudera, Inc. All rights reserved. Technical Report: HybridTime - Accessible Global Consistency with High Clock Uncertainty
  • 30. 30 © Cloudera, Inc. All rights reserved. • Technical Report: HybridTime - Accessible Global Consistency with High Clock Uncertainty • • Google DC • HybridTime NTP DB • Kudu Kudu HybridTime • • 2014 ( ) Kudu
  • 31. 31 © Cloudera, Inc. All rights reserved. • Google Spanner DC • Amazon Dynamo Cassandra DB Eventual Consistency • • • [ ] DC DB
  • 32. 32 © Cloudera, Inc. All rights reserved. • Consistency • • • CAP Consistency ACID Consistency/Isolation • Consistency • (Anomaly) • Lost Update, Dirty Read, Non-Repeatable, Phantom Read, Read Skew, Write Skew, etc... • • Lost Update SELECT FOR UPDATE Consistency
  • 33. 33 © Cloudera, Inc. All rights reserved. • Lamport Clocks Vector Clocks • • • • RDB Point-in-Time • Vector Clocks [ ]
  • 34. 34 © Cloudera, Inc. All rights reserved. • Spinnaker Paxos • • • Spanner commit-wait • • GPS • [ ]
  • 35. 35 © Cloudera, Inc. All rights reserved. • HybridTime • • • Pint-in-time • Lamport Clock • HybridTime • Vector Clocks Lamport Clocks 2 ( commit-wait ) [ ] HybridTime HTC: { , }
  • 36. 36 © Cloudera, Inc. All rights reserved. • • NTP • NTP • commit-wait • • NTP • commit-wait • • • Kudu DB HybridTime
  • 37. 37 © Cloudera, Inc. All rights reserved. • !"# $ i e • !"'() $ e • *# $ i e • 1: • 2: [ ] HybridTime
  • 38. 38 © Cloudera, Inc. All rights reserved. • HybridTime HTC • (error) • Spanner TrueTime API • HybridTime • NTP HybridTime
  • 39. 39 © Cloudera, Inc. All rights reserved. • ntp_adjtime • timex • maxerror HybridTime
  • 40. 40 © Cloudera, Inc. All rights reserved. • Kudu macOS • macOS OS macOS
  • 41. 41 © Cloudera, Inc. All rights reserved. Kudu
  • 42. 42 © Cloudera, Inc. All rights reserved. • 1 2 • 2 1 • !" − !$ ... • • %$ • & = !" − !$ − %$ NTP 2 1 100*+ 160*+ T1 100ms 160ms 160-100 = 60ms T2 %$ NG
  • 43. 43 © Cloudera, Inc. All rights reserved. • !" • RTT: ! # $ • ! = !" + !$ = '( − '" − ('+ − '$) # $ = !" = -./-0 /(-1/-2) $ • 3 • 3 = '$ − '" − -./-0 / -1/-2 $ • 3 = $ -2/-0 $ − -./-0 / -1/-2 $ • 3 = $-2/$-0/-.4-04-1/-2 $ • 3 = -2/-0/-.4-1 $ • 3 = -2/-0 4 -1/-. $ NTP NTP T2 T3 T4T1 NTP 10078 15078 16078 11078 16578 11578 12578 17578 7078 8078 8578 9578 +1078 +1078 +578 !" !$ 1) 50ms 2) -30ms RTT20ms
  • 44. 44 © Cloudera, Inc. All rights reserved. • • • • ! = #$#%#&& '(#)$%#$*) ) = 41./ • 50ms -9ms • RTT 0 ) • ± 0 ) NTP T2 T3 T4T1 NTP 100./ 101./ 106./ 125./ +1./ +19./ +5./ 8# 8) 150./ 151./ 156./ 175./) 50ms RTT20ms
  • 45. 45 © Cloudera, Inc. All rights reserved. • • 1 • • • • NTP ! • • ! + # $ NTP
  • 46. 46 © Cloudera, Inc. All rights reserved. • NTP • DC NTP or Google Public NTP • AWS Amazon Time Sync Service • Azure Hyper-V time synchronization Google Public NTP • GCE Google Public NTP • NTP
  • 47. 47 © Cloudera, Inc. All rights reserved. • UPDATE • 2 • HybridTime +1 [ ] HybridTime
  • 48. 48 © Cloudera, Inc. All rights reserved. • !" #, % !" HTC #, % [ ] HybridTime
  • 49. 49 © Cloudera, Inc. All rights reserved. • HTC • HTC • ) ! → # • −%& ! < !(()( # < %* # [ ] Kudu HybridTime j i # −%& ! %* #−%& ! ! %& !
  • 50. 50 © Cloudera, Inc. All rights reserved. • KUDU-146 Deal with leap seconds • leap second • Stratum 0 NTP Leap Indicator OS • 23:59:59 -> 23:59:60 -> 00:00:00 • 23:59:59 -> 23:59:59-> 00:00:00 • 2 1 • HybridTime propagate • Kudu commit-wait • wait ms 1 • NTP TIME_INS/TIME_OOP max error • Leap Smearing https://issues.apache.org/jira/browse/KUDU-430
  • 51. 51 © Cloudera, Inc. All rights reserved. • HybridTime • • Kudu RDB • ACID • Commit-wait • NTP • CLIENT_PROPAGETED • HybridTime • HybridTime propagate Kudu
  • 52. 52 © Cloudera, Inc. All rights reserved. • Kudu MVCC Multi-version Concurrency Control • • WAL REDO UNDO • • READ_LATEST • • READ_AT_SNAPSHOT • MVCC • Repeatable Read Kudu
  • 53. 53 © Cloudera, Inc. All rights reserved. Kudu https://blog.cloudera.co.jp/11c3a749a81b
  • 54. 54 © Cloudera, Inc. All rights reserved. • YCSB • • 3 8 • insert 60%, update 20%, single-row read 20% • • GCE: nl-standard-8 x10 • RAM 30GB • Disk 350GB • NTP • GCE [ ] NTP
  • 55. 55 © Cloudera, Inc. All rights reserved. [ ] HybridTime Commit Wait Commit Wait Clock Error
  • 56. © Cloudera, Inc. All rights reserved.
  • 57. 57 © Cloudera, Inc. All rights reserved. • OLTP OLAP 1 DB Kudu • HybridTime DB • HybridTime → P.36 • Kudu HybridTime Serializable • OLAP Kudu #dbts2017 • 11 6 Cloudera World Tokyo 2018 Kudu Kudu
  • 58. 58 © Cloudera, Inc. All rights reserved. Cloudera World Tokyo 2018 http://www.clouderaworldtokyo.com/