SlideShare a Scribd company logo
1 of 227
RAD
How We Replicate Terabytes of Data Around the World Every Day
Jason Koppe
System Administrator
Indeed is the #1
external source of hire
64% of US job searchers search
on indeed each month
Unique Visitors (millions)
Million unique visitors
2009 2011 2012 2013 2014 2015
0
20
40
60
80
100
120
140
160
180
2010
180M
180 million
unique users
80.2M
unique US visitors per month
16M
jobs
50+
countries
28
languages
How We Build Systems
fast simple resilient scalable
fast
Fast
Job Search Browser Rendering
median ~0.5 seconds
Feb 24 Feb 25 Feb 26 Feb 27 Feb 28 Feb 29 Mar 1 Mar 2 Mar 3 Mar 4 Mar 5 Mar 6 Mar 7 Mar 8
0
100
200
300
400
500
600
700
800
milliseconds
simple
2004 launch: a few servers, 1.8m US jobs
2004
Aggregation
MySQL
Job Search
Every job on
the web
relational database,
accessed across the network
NOT fast at full text search
NOT a search engine
2004
Indeed
1999
Lucene
LuceneTM
a high-performance, full featured
text search engine library
LuceneTM
NOT a remote database,
files must be on local disk
MySQL
Database Server Lucene Index Server
Index Builder
/data/jobindex
Index Builder Index Builder Index Builder Index Builder
/data/jobindex /data/jobindex /data/jobindex /data/jobindex
MySQL
MySQL
Database Server Indexer Server
Index Builder
/data/jobindex
Search Engine
/data/jobindex
4 Search Servers
any combination of data, not just lucene
lucene +
model
lucene +
model
bitset
lucene +
model
bitset
lucene +
custom
binary
lucene +
model
bitset
lucene +
custom
binary
json +
csv
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Artifact
is read-optimized data stored in a directory on the file system
Producer
creates and updates a data artifact
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
MySQL
Consumer
reads a data artifact
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
MySQL
produce once, consume many times
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: minimize database access
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: compute artifact once
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: scale consumers independently
MySQL
Expensive
Index Builder
Producer
Artifact Artifact
Commodity
Search Engine
Benefit: scale consumers independently
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: separate code deployables
fast resilient scalable
Producer
artifact
Search Engine
Consumers
artifact
Index Builder
Producer
artifact
Search Engine
Consumers
artifact
Index Builder
rsync
efficient point-to-point file transfer utility
1
consumers should
reload data regularly
1
consumers should
reload data regularly
2
roll back
consumers should
reload data regularly
2
roll back
3
data reload should
not interrupt requests
1
artifact versioning
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
new directory for new version
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
jobindex.latest -> jobindex.3
symlink to know current version
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
jobindex.4
jobindex.latest -> jobindex.4
load new data
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
jobindex.4
jobindex.latest -> jobindex.3
roll back
each new version takes disk space & time
versions
total bytes on disk
normal disk copy
versions
disk
latency
total bytes on disk
normal disk copy
versions
version
create time
disk
latency
total bytes on disk
normal disk copy
1.8m jobs, change <2% per hour
all jobs
00:00 AM
all jobs
00:00 AM
all jobs
04:00 AM
new jobs
changed jobs
all jobs
00:00 AM
all jobs
04:00 AM
new jobs
changed jobs
unchanged
incremental updates
save disk space & time
share data between versions
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
jobindex.2
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
5GB
jobindex.3
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
5GB
jobindex.3
= 12GB+ +
5GB
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
1GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
1GB
jobindex.3
=+ +
file1.bin
file2.bin
file3.bin
file4.bin
jobindex.2
file1.bin
file2.bin
file3.bin
file5.bin
jobindex.3
deleted
1GB 1GB = 5GB+ 2GB
file4.bin
remove referenced file of symlink, data is gone
hardlink
additional name for an existing file
hardlink != symlink
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
1GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
1GB
jobindex.3
= 5GB+ +
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
1GB
jobindex.3
= 5GB+
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
5GB
jobindex.3
= 5GB
remove last hardlink, data is gone
artifact versions: symlinks + hardlinks + rsync
scale: single producer, many consumers
Job Search Browser Rendering
median ~0.5 seconds
Feb 24 Feb 25 Feb 26 Feb 27 Feb 28 Feb 29 Mar 1 Mar 2 Mar 3 Mar 4 Mar 5 Mar 6 Mar 7 Mar 8
0
100
200
300
400
500
600
700
800
milliseconds
fast simple resilient scalable
How We Build Systems
2004
Indeed
1999
Lucene
2008
6 countries
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2004 2008 200920062005
22.5 M5.2 M 7.1 M4.0 M1.8 M
jobs added or modified each month
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2nd
datacenter
Producer
Consumers
artifacts
DC1
Staging
Consumers
artifacts
DC2
multi-dc rsync
Staging
Consumers
artifacts
DC3
Producer
Consumers
artifacts
DC1
Staging
Consumers
artifacts
DC2
Staging
Consumers
artifacts
DC3
minimize
Internet
bandwidth
2011
52 countries
4 datacenters
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2004 2008 200920062005
22.5 M5.2 M 7.1 M4.0 M1.8 M
jobs added or modified each month
2011
32.5 M
rsync system growing pains
Simple: serially copy one artifact at a time
DC1
Producer Artifacts
DC2
Staging Artifacts
Problem: serially can cause delays
Producer
Staging
New
New
New
Old
DC1
DC2
smalllarge2large1
smalllarge2large1
Workaround: copy separately in “streams”
DC1
DC2
Staging
Producer
Simple: point-to-point datacenter rsync paths
DC4
DC3
DC2
DC1
Problem: Internet, why did you do that?
Down
DC4
DC3
DC2
DC1
Workaround: shift replication path
DC4
DC3
DC2
DC1
Scale: few consumers with rsync
Producer
Artifacts Consumers
Consumers
Producer
Grow: many consumers with rsync
Artifacts
Consumers
Consumers
Producer
Problem: too many consumers with rsync
Artifacts
Consumers
network
100%
used
Workaround: add more network bandwidth
Consumers
Producer
Artifacts
Consumers
Workaround: add staging tiers
Consumers
Producer
Artifacts
Staging
Artifacts Artifacts
Staging
Artifacts
Staging
Artifacts
Consumers Consumers Consumers Consumers Consumers Consumers Consumers
Staging
rsync growth required sysad intervention
2011
52 countries
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2014
rsync growth
100 artifacts, adding +1 producer each month
producing 1,761 TB per month
over 200 consumers, +2 each month
replicating over 21,931 TB per month
staging tiers or network bandwidth, quarterly
modify replication path, monthly
requiring too much intervention from system
administrators
sysad
dev
sysad
dev
+50%
+100%
2014
January December
2011
52 countries
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2014
rsync limits
Julie Scully
Software Engineer
Jobsearch backend team produces a lot of data
RAD
“Resilient Artifact Distribution”
Design GoalsDesign Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
4
5
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
3
4
5
1
2
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
5
4
3
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
5
4
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
4
5
Design GoalsDesign Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
4
5
No more point-to-point
Measure time and
network traffic
Bittorrent: Would it work?
Sample replication to
3 consumers
https://github.com/shevek/ttorrent
Network Test
Total MB received + transmitted for 700MB artifact
Producer 2,240
Consumer 1 746
Consumer 2 747
Consumer 3 747
machine RSYNC
Network Test
Total MB received + transmitted for 700MB artifact
Producer 2,240 782
Consumer 1 746 1,226
Consumer 2 747 1,225
Consumer 3 747 1,245
machine BITTORRENTRSYNC
Network Test
Total MB received + transmitted for 700MB artifact
Producer 2,240 782
Consumer 1 746 1,226
Consumer 2 747 1,225
Consumer 3 747 1,245
Total 4,481 4,480
machine BITTORRENTRSYNC
24 minutes
rsync
5.5 minutes
bittorrent
Timing Test
How does bittorrent work?
Data split into small pieces of equal size
Hash computed for each piece
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
Piece 1: 75 MB
Piece 2: 75 MB
Piece 3: 75 MB
Piece 4: 75 MB
Piece 5: 25 MB
torrent metadata file
{ files:file1.bin,100MB;
file2.bin,200MB;
file3.bin,50MB }
{ piecelength:75MB }
{
infohash:XSDJSK;JDISJLD;DJKJDB;KDJB
OP;FJEIODK; }
.torrent metadata file:
{ files:file1.bin,100MB;
file2.bin,200MB;
file3.bin,50MB }
{ piecelength:75MB }
{
infohash:XSDJSK;JDISJLD;DJKJDB;KDJB
OP;FJEIODK; }
.torrent metadata file:
{ files:file1.bin,100MB;
file2.bin,200MB;
file3.bin,50MB }
{ piecelength:75MB }
{
infohash:XSDJSK;JDISJLD;DJKJDB;KDJB
OP;FJEIODK; }
.torrent metadata file:
Tracker
Coordinator of the download
Seeder
Any client providing data
Seeder
Data
I have pieces for info hash
Tracker
.torrent
Info Hash
File manifest
Data .torrent
Info Hash
File manifest
Seeder Tracker
Info hash peer
Map
Ok!
I have pieces for info hash
Consumer
Any client downloading data
Peers for infohash
Consumer Tracker
.torrent
Info Hash
File manifest
Tracker URL
Map
Info hash peer
How a consumer gets the first piece
Peers for infohash
Peerlist
Consumer Tracker
.torrent
Info Hash
File manifest
Tracker URL
Map
Info hash peer
How a consumer gets the first piece
Data .torrent
Info Hash
File manifest
Consumer/
Seeder
I have pieces for infohash
Tracker
Info hash peer
Map
It is also a seeder
Consumer 1
Seeding as it downloads
Consumer 2
Seeding as it downloads
Consumer 3
Seeding as it downloads
Seeder
SWARM
Didn’t quite meet our needs
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH6
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH7
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH6
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH7
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1 jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.2
File0.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Piece 1: HASH6
Piece 2: HASH7
Piece 3: HASH8
Piece 4: HASH9
Piece 5: HASH10
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH11
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1 jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.2
File0.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Control sort order?
jobindex.2
File3.bin
(50MB)
File1.bin
(150MB)
File2.bin
(200MB)
Piece 1: HASH6
Piece 2: HASH7
Piece 3: HASH8
Piece 4: HASH9
Piece 5: HASH10
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH11
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
Piece 1: HASH6
Piece 2: HASH7
Piece 3: HASH8
Piece 4: HASH9
Piece 5: HASH10
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH11
File3.bin
(50MB)
File1.bin
(150MB)
File2.bin
(200MB)
jobindex.2
hash each file?
Compare files not pieces
{ files:file1.bin,100MB,DATETIME;
file2.bin,200MB,DATETIME;
file3.bin,50MB,DATETIME }
{ piecelength:75MB }
...
.torrent metadata file contents:
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.1
Piece 1: File 0, File1
Piece 2: File 1
Piece 3: File 1, File 2
Piece 4: File 2
Piece 5: File 2, File 3
Piece 6: File 3
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.2
File0.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.1
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.2
File0.bin
(50MB) Piece 1: File 0, File1
Piece 2: File 1
Piece 3: File 1, File 2
Piece 4: File 2
Piece 5: File 2, File 3
Piece 6: File 3
Bittorrent Evaluation Result
substantially faster drastically reduces
network load on the
producer machine
horizontally scalable
Design GoalsDesign Goals
Automatic recovery
Developer empowerment
System-wide visibility
3
4
5
Loose coupling2
Minimize network bottlenecks1
Service-oriented architecture
Headwater
The beginning of a river
Headwater
Host
Data
Producer Data
Publish
my data
Headwater takes ownership of the data
(hardlink + read-only)
Headwater
Host
Data
Producer Data
Publish
my data
Will do!
Headwater
Host
Data
Producer Data
create the .torrent metadata file
Headwater
The beginning of a river
River
Course the water carves
across the landscape
Rhone
RhoneRhone
Zookeeper
Rhone: multi-master coordinator service
Rhone
Headwater
Host
Data
Producer Data
Rhone
Headwater
Host
Data
Producer Datadata.version
torrent metadata
Rhone
Headwater
Host
Data
Producer Datadata.version
torrent metadata
Rhone
Headwater
Host
Data
Producer Data
Tracker
.torrent metadata
can be retrieved
data.version
torrent metadata
Headwater
The beginning of a river
River
Course the water carves
across the landscape
Delta
The end of the river
Subscribe
to data!
Delta
Host
Data
Consumer
Make all subscribed artifacts available
RhoneDelta
Host
Data
Consumer
Headwater
Host
Data
Producer Data
Delta
Data
Consumer
Rhone
Host
Tracker
Delta
Host
Data
ConsumerData
/rad/data
Delta
Host
Data
ConsumerData
Where’s
the latest
data?
/rad/data
It’s at
/rad/data
Delta
Host
Data
ConsumerData
Where’s the
latest data?
/rad/data
Delta
Host
Data
ConsumerData
/rad/data
Keep all subscribed artifacts current
Delta
Data
Consumer
Rhone
Host
Rhone
Data
Host
Artifact Availability Flow
Delta Headwater
Host
Data
Consumer
Data
Producer Data
Design GoalsDesign Goals
Automatic recovery
Developer empowerment
System-wide visibility
4
5
Minimize network bottlenecks1
Loose coupling2
3
Rhone
Headwater
Host
Data
Producer Data
Crash!
Rhone
Headwater
Data
Producer Datadata.version
torrent metadata
Tracker
Crash!
Host
Development philosophy:
Make recovery the common case
Durable state with atomic filesystem operations
All service calls are idempotent
RAD handles network recovery
DC4
DC3
DC2
DC1
rsync is point-to-point
DC1
DC4
DC3
DC2
bittorrent peer-to-peer
Down
DC1
DC4
DC3
DC2
No problem with bittorrent swarm
RAD treats artifact independently
Design GoalsDesign Goals
Developer empowerment
System-wide visibility5
Minimize network bottlenecks1
Loose coupling2
Automatic recovery3
4
Adding a new artifact in the rsync system
Ask System Administrators
Adding a new artifact in the RAD system
Declare it in the code
REST API is language agnostic
Design GoalsDesign Goals
System-wide visibility
Minimize network bottlenecks1
Loose coupling2
Automatic recovery3
Developer empowerment4
5
Rhone already knows all artifacts
Rhone stores list of versions by artifact.
version 4
version 5
version 6
artifactA
version 221
version 226
version 227
version 228
artifactB
version 1artifactC
Heartbeats from Delta and Headwater
Rhone has system-wide view
RADAR: Developers can easily see where their data is
RADAR: Developers can easily see where their data is
RADAR: Developers can easily see where their data is
RADAR: Developers can easily see where their data is
start simple and iterate
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
Lesson learned: prevent people from
using the system incorrectly
We made configuration TOO easy
New Requirement: protect the disks
Delta
Prevent downloading artifacts that will fill the disk (and alarm)
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2016
80 RAD
artifacts
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2016
80 RAD
artifacts
100 artifacts in 10 years
100 artifacts in 10 years
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2016
80 RAD
artifacts
80 new
artifacts
in 1 year
7,666
versions published
Producer
Consumer
56
unique producers
52,357
versions downloaded
670
unique consumers
RAD Stats
March 23, 2016
Duration of JobIndex replication in RAD v. Rsync
Jan 18 6 AM 12 PM 6 PM Jan 19 6 AM 12 PM 6 PM
1,000
2,000
3,000
RAD rsync
time
replicating over 65,193 TB per month
Learn More
Engineering blog & talks http://indeed.tech
Open Source http://opensource.indeedeng.io
Careers http://indeed.jobs
Twitter @IndeedEng

More Related Content

What's hot

(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...Amazon Web Services
 
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...confluent
 
Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...
Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...
Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...confluent
 
HDFSネームノードのHAについて #hcj13w
HDFSネームノードのHAについて #hcj13wHDFSネームノードのHAについて #hcj13w
HDFSネームノードのHAについて #hcj13wCloudera Japan
 
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan EwenAdvanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewenconfluent
 
카프카 기반의 대규모 모니터링 플랫폼 개발이야기
카프카 기반의 대규모 모니터링 플랫폼 개발이야기카프카 기반의 대규모 모니터링 플랫폼 개발이야기
카프카 기반의 대규모 모니터링 플랫폼 개발이야기if kakao
 
Iceberg + Alluxio for Fast Data Analytics
Iceberg + Alluxio for Fast Data AnalyticsIceberg + Alluxio for Fast Data Analytics
Iceberg + Alluxio for Fast Data AnalyticsAlluxio, Inc.
 
講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー
講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー
講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナーNGINX, Inc.
 
Presto on YARNの導入・運用
Presto on YARNの導入・運用Presto on YARNの導入・運用
Presto on YARNの導入・運用cyberagent
 
Step-by-Step Introduction to Apache Flink
Step-by-Step Introduction to Apache Flink Step-by-Step Introduction to Apache Flink
Step-by-Step Introduction to Apache Flink Slim Baltagi
 
地理分散DBについて
地理分散DBについて地理分散DBについて
地理分散DBについてKumazaki Hiroki
 
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안SANG WON PARK
 
ksqlDB: A Stream-Relational Database System
ksqlDB: A Stream-Relational Database SystemksqlDB: A Stream-Relational Database System
ksqlDB: A Stream-Relational Database Systemconfluent
 
Apache Kafka 0.11 の Exactly Once Semantics
Apache Kafka 0.11 の Exactly Once SemanticsApache Kafka 0.11 の Exactly Once Semantics
Apache Kafka 0.11 の Exactly Once SemanticsYoshiyasu SAEKI
 
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational CacheUsing Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational CacheDremio Corporation
 
Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理Cloudera Japan
 

What's hot (20)

Apache Helix presentation at Vmware
Apache Helix presentation at VmwareApache Helix presentation at Vmware
Apache Helix presentation at Vmware
 
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
 
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
 
Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...
Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...
Synchronous Commands over Apache Kafka (Neil Buesing, Object Partners, Inc) K...
 
HDFSネームノードのHAについて #hcj13w
HDFSネームノードのHAについて #hcj13wHDFSネームノードのHAについて #hcj13w
HDFSネームノードのHAについて #hcj13w
 
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan EwenAdvanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
 
카프카 기반의 대규모 모니터링 플랫폼 개발이야기
카프카 기반의 대규모 모니터링 플랫폼 개발이야기카프카 기반의 대규모 모니터링 플랫폼 개발이야기
카프카 기반의 대규모 모니터링 플랫폼 개발이야기
 
Iceberg + Alluxio for Fast Data Analytics
Iceberg + Alluxio for Fast Data AnalyticsIceberg + Alluxio for Fast Data Analytics
Iceberg + Alluxio for Fast Data Analytics
 
講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー
講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー
講演資料: コスト最適なプライベートCDNを「NGINX」で実現するWeb最適化セミナー
 
Presto on YARNの導入・運用
Presto on YARNの導入・運用Presto on YARNの導入・運用
Presto on YARNの導入・運用
 
Step-by-Step Introduction to Apache Flink
Step-by-Step Introduction to Apache Flink Step-by-Step Introduction to Apache Flink
Step-by-Step Introduction to Apache Flink
 
地理分散DBについて
地理分散DBについて地理分散DBについて
地理分散DBについて
 
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
Apache kafka 모니터링을 위한 Metrics 이해 및 최적화 방안
 
ksqlDB: A Stream-Relational Database System
ksqlDB: A Stream-Relational Database SystemksqlDB: A Stream-Relational Database System
ksqlDB: A Stream-Relational Database System
 
Apache Kafka 0.11 の Exactly Once Semantics
Apache Kafka 0.11 の Exactly Once SemanticsApache Kafka 0.11 の Exactly Once Semantics
Apache Kafka 0.11 の Exactly Once Semantics
 
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational CacheUsing Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
 
The Impala Cookbook
The Impala CookbookThe Impala Cookbook
The Impala Cookbook
 
Hiveを高速化するLLAP
Hiveを高速化するLLAPHiveを高速化するLLAP
Hiveを高速化するLLAP
 
Fast analytics kudu to druid
Fast analytics  kudu to druidFast analytics  kudu to druid
Fast analytics kudu to druid
 
Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理Apache Hadoop YARNとマルチテナントにおけるリソース管理
Apache Hadoop YARNとマルチテナントにおけるリソース管理
 

Similar to @Indeedeng: RAD - How We Replicate Terabytes of Data Around the World Every Day

How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014Dipti Borkar
 
Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"Lviv Startup Club
 
A Call for Sanity in NoSQL
A Call for Sanity in NoSQLA Call for Sanity in NoSQL
A Call for Sanity in NoSQLC4Media
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist SoftServe
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Gruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter
 
Move a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloudMove a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloudIke Ellis
 
Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)Wooga
 
Data as a Service
Data as a Service Data as a Service
Data as a Service Kyle Hailey
 
Softshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with CouchbaseSoftshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with CouchbaseTugdual Grall
 
Immutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine ImagesImmutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine ImagesC4Media
 
Puppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdbPuppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdbm_richardson
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDBPuppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDBPuppet
 
Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014ALTER WAY
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet
 
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)lakeFS
 
Patterns of Streaming Applications
Patterns of Streaming ApplicationsPatterns of Streaming Applications
Patterns of Streaming ApplicationsC4Media
 
Scaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInScaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInDataWorks Summit
 
Scaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road AheadScaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road AheadC4Media
 

Similar to @Indeedeng: RAD - How We Replicate Terabytes of Data Around the World Every Day (20)

How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014
 
Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"
 
A Call for Sanity in NoSQL
A Call for Sanity in NoSQLA Call for Sanity in NoSQL
A Call for Sanity in NoSQL
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Gruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in Telco
 
WSDM09-keynote
WSDM09-keynoteWSDM09-keynote
WSDM09-keynote
 
Move a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloudMove a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloud
 
Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)
 
Data as a Service
Data as a Service Data as a Service
Data as a Service
 
Softshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with CouchbaseSoftshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with Couchbase
 
Immutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine ImagesImmutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine Images
 
Puppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdbPuppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdb
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDBPuppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
 
Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
 
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)
 
Patterns of Streaming Applications
Patterns of Streaming ApplicationsPatterns of Streaming Applications
Patterns of Streaming Applications
 
Scaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInScaling Hadoop at LinkedIn
Scaling Hadoop at LinkedIn
 
Scaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road AheadScaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road Ahead
 

More from indeedeng

Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-DrivenWeapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-Drivenindeedeng
 
Alchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That WorkAlchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That Workindeedeng
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...indeedeng
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...indeedeng
 
Improving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentationImproving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentationindeedeng
 
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision MakingData-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Makingindeedeng
 
Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...indeedeng
 
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)indeedeng
 
Data Day Texas - Recommendations
Data Day Texas - RecommendationsData Day Texas - Recommendations
Data Day Texas - Recommendationsindeedeng
 
Vectorized VByte Decoding
Vectorized VByte DecodingVectorized VByte Decoding
Vectorized VByte Decodingindeedeng
 
[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshop[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshopindeedeng
 
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...indeedeng
 
[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotepindeedeng
 
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisionsindeedeng
 
[@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol [@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol indeedeng
 
[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product design[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product designindeedeng
 
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctorindeedeng
 
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...indeedeng
 
[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacenters[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacentersindeedeng
 
[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Search[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Searchindeedeng
 

More from indeedeng (20)

Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-DrivenWeapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
 
Alchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That WorkAlchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That Work
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
 
Improving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentationImproving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentation
 
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision MakingData-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
 
Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...
 
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
 
Data Day Texas - Recommendations
Data Day Texas - RecommendationsData Day Texas - Recommendations
Data Day Texas - Recommendations
 
Vectorized VByte Decoding
Vectorized VByte DecodingVectorized VByte Decoding
Vectorized VByte Decoding
 
[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshop[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshop
 
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...
 
[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep
 
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
 
[@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol [@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol
 
[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product design[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product design
 
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
 
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
 
[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacenters[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacenters
 
[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Search[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Search
 

Recently uploaded

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 

Recently uploaded (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

@Indeedeng: RAD - How We Replicate Terabytes of Data Around the World Every Day