SlideShare uma empresa Scribd logo
1 de 51
1
Kai Waehner | Technology Evangelist, Confluent
contact@kai-waehner.de | LinkedIn | @KaiWaehner | www.confluent.io | www.kai-waehner.de
Architecture patterns for
distributed, hybrid, edge and global
Apache Kafka deployments
2Abstract
Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments
Multi-cluster and cross-data center deployments of Apache Kafka have become the norm rather than an exception. This
session gives an overview of several scenarios that may require multi-cluster solutions and discusses real-world examples
with their specific requirements and trade-offs, including disaster recovery, aggregation for analytics, cloud migration,
mission-critical stretched deployments and global Kafka.
Key takeaways:
• In many scenarios, one Kafka cluster is not enough. Understand different architectures and alternatives for multi-cluster
deployments.
• Zero data loss and high availability are two key requirements. Understand how to realize this, including trade-offs.
• Learn about features and limitations of Kafka for multi cluster deployments
• Global Kafka and mission-critical multi-cluster deployments with zero data loss and high availability became the normal,
not an exception.
www.kai-waehner.de | @KaiWaehner
3
Agenda
1) Definition ‘Kafka Cluster’
2) One Kafka Cluster
3) Multiple Kafka Clusters
4) Multi-Region / Global Kafka Cluster
5) Infrastructure Options
www.kai-waehner.de | @KaiWaehner
4
Agenda
1) Definition ‘Kafka Cluster’
2) One Kafka Cluster
3) Multiple Kafka Clusters
4) Multi-Region / Global Kafka Cluster
5) Infrastructure Options
www.kai-waehner.de | @KaiWaehner
5
www.kai-waehner.de | @KaiWaehner
6
The Beginning of a New Era
www.kai-waehner.de | @KaiWaehner
https://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying
The first use case: Log Analytics. This is why Kafka was created!
7
Event Streaming Platform –
The Commit Log
www.kai-waehner.de | @KaiWaehner
Time
P
C1 C2
C3
8
Event Streaming Platform –
A Distributed System
www.kai-waehner.de | @KaiWaehner
Broker 1
Topic1
partition1
Broker 2 Broker 3 Broker 4
Topic1
partition1
Topic1
partition1
Leader Follower
Topic1
partition2
Topic1
partition2
Topic1
partition2
Topic1
partition3
Topic1
partition4
Topic1
partition3
Topic1
partition3
Topic1
partition4
Topic1
partition4
9Apache Kafka (kafka.apache.org) includes
Kafka Connect and Kafka Streams
www.kai-waehner.de | @KaiWaehner
Kafka Streams
Your
app
sinksource
KafkaConnect
KafkaConnect
Kafka Cluster
10A Streaming Platform
is the Underpinning of an Event-driven Architecture
www.kai-waehner.de | @KaiWaehner
Microservices
DBs
SaaS apps
Customer 360
Real-time fraud
detection
Data warehouse
Producers
Consumers
Database
change
Microservices
events
SaaS
data
Customer
experiences
Streams of real time events
Stream processing apps
Connectors
Connectors
Stream processing apps
11
Apache Kafka at Scale at Tech Giants
www.kai-waehner.de | @KaiWaehner
> 7 trillion messages / day > 6 Petabytes / day
“You name it”
* Kafka Is not just used by tech giants
** Kafka is not just used for big data
12
www.kai-waehner.de | @KaiWaehner
Improve
Customer
Experience
(CX)
Increase
Revenue
(make money)
Business
Value
Decrease
Costs
(save
money)
Core Business
Platform
Increase
Operational
Efficiency
Migrate to
Cloud
Mitigate Risk
(protect money)
Key Drivers
Strategic Objectives
(sample)
Fraud
Detection
IoT sensor
ingestion
Digital
replatforming/
Mainframe Offload
Connected Car: Navigation & improved
in-car experience: Audi
Customer 360
Simplifying Omni-channel Retail at
Scale: Target
Faster transactional
processing / analysis
incl. Machine Learning / AI
Mainframe Offload: RBC
Microservices
Architecture
Online Fraud Detection
Online Security
(syslog, log aggregation,
Splunk replacement)
Middleware
replacement
Regulatory
Digital
Transformation
Application Modernization: Multiple
Examples
Website / Core
Operations
(Central Nervous System)
The [Silicon Valley] Digital Natives;
LinkedIn, Netflix, Uber, Yelp...
Predictive Maintenance: Audi
Streaming Platform in a regulated
environment (e.g. Electronic Medical
Records): Celmatix
Real-time app
updates
Real Time Streaming Platform for
Communications and Beyond: Capital One
Developer Velocity - Building Stateful
Financial Applications with Kafka
Streams: Funding Circle
Detect Fraud & Prevent Fraud in Real
Time: PayPal
Kafka as a Service - A Tale of Security
and Multi-Tenancy: Apple
Example Use Cases
$↑
$↓
$
Example Case Studies
(of many)
13
A Kafka Cluster
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
14
A Kafka Cluster
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
“Server Side” of a Kafka Cluster
1 - * Kafka Broker
1 - * ZooKeeper
0 - * Schema Registry
0 - * Kafka Connect
0 - * REST Proxy
Security, Ops, Monitoring, …
“Client Side” of a Kafka Cluster
Kafka Clients (Java, C, C++, Python, Go, JavaScript, …)
Kafka Stream Processing Apps (Kafka Streams, ksqlDB)
External Producers / Consumers (Oracle, Hadoop, Flink, …)
15
Why Multiple Kafka Clusters?
www.kai-waehner.de | @KaiWaehner
* Not a representative survey J
** Many DCs does NOT necessarily mean more than one Kafka Cluster
16
Disaster Recovery – RPO and RTO
www.kai-waehner.de | @KaiWaehner
RPO = Recovery Point Objective
RTO = Recovery Time Objective
17
Agenda
1) Definition ‘Kafka Cluster’
2) One Kafka Cluster
3) Multiple Kafka Clusters
4) Multi-Region / Global Kafka Cluster
5) Infrastructure Options
www.kai-waehner.de | @KaiWaehner
18
A Kafka Cluster
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
19
A Kafka Cluster for High Availability
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
PROD / Pre-PROD / TEST
3 - * Kafka Broker
3 / 5 / 7 ZooKeeper
2 Schema Registry
2 - * Kafka Connect
2 - * REST Proxy
Security, Ops, Monitoring, …
DEV / Functional TEST
1 Kafka Broker
1 ZooKeeper
0 - 1 Schema Registry
0 - 1 Kafka Connect
0 - 1 REST Proxy
Security, Ops, Monitoring, …
20
A Stretched Kafka Cluster over 3DC
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
DC1 DC2 DC3
21
A Stretched Kafka Cluster over 3DC
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
DC1 DC2 DC3
High availability (Survives DC outage)
Zero data loss and zero downtime
Automatic client fail-over
Works well in cloud (3 AZs in 1 region)
Requires “good” latency (à DCs ”close” to each other)
Requires three DCs (Quorum / split brain)
Complex to configure and operate
22
A Stretched Kafka Cluster over 2DC
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
DC1 DC2
Kafka Broker
23
A Stretched Kafka Cluster over 2DC
www.kai-waehner.de | @KaiWaehner
High availability (Survives DC outage)
Zero data loss or zero downtime
Automatic client fail-over
Stopgap solution for on premise (if only 2 DCs available)
à 2.5 DC deployment as workaround
Requires “good” latency (à DCs ”close” to each other)
Quorum in 2 DCs not possible
Complex to configure and operate
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
DC1 DC2
Kafka Broker
24
A Single Kafka Cluster
www.kai-waehner.de | @KaiWaehner
Zookeeper
Kafka Broker
Schema Registry
OPC-UA
MQTT
PLC4X
KSQL
Grafana
Postgres
Kafka Connect
Simple setup
Works
Often used “at the Edge”
No high availability
25
Agenda
1) Definition ‘Kafka Cluster’
2) One Kafka Cluster
3) Multiple Kafka Clusters
4) Multi-Region / Global Kafka Cluster
5) Infrastructure Options
www.kai-waehner.de | @KaiWaehner
26
Independent Kafka Clusters
www.kai-waehner.de | @KaiWaehner
Total Independence
Owned by the project teams, central ops or SaaS
Different sizing, security, infrastructure
Related projects should run on the same Kafka cluster
Independent projects can run on the same Kafka cluster
• similar SLAs and requirements
• e.g. NOT Instant payment vs. log analytics vs. file transfer vs. video streaming
• ACLs / RBAC for fine-grained authentication and authorization
• throughput typically no issue (Confluent Cloud processes 1 Gigabyte / sec and more in one cluster)
• reduce overhead (operations, hardware, …)
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
27
Hybrid Integration of 2 Kafka Clusters
www.kai-waehner.de | @KaiWaehner
Hybrid integration
On premise and cloud or multi-cloud scenarios (due to technical, business or legal reasons)
Uni- or bi-directional
Know the best practices or get help
Know your SLAs and timelines
Choose the right (battle-tested?) tool
Works
Relatively easy to setup (some tools are complex / not up-to-date / not mature / not documented well)
Example: Replicate data from production to analytics cluster
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
MirrorMaker 1 / 2
Confluent Replicator
uReplicator (Uber)
Mirus (Salesforce)
Brooklin (LinkedIn)
Custom Replication
DC1 DC2
Streaming Replication
28
Migration of Kafka Clusters
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
MirrorMaker 1 / 2
Confluent Replicator
uReplicator (Uber)
Mirus (Salesforce)
Brooklin (LinkedIn)
Custom Replication
DC1 DC2
Streaming Replication
Common migration scenarios
On premise à Cloud
Cloud A à Cloud B
Vendor 1 à Vendor 2
Self-Managed à SaaS
Migration steps:
1) Create new Kafka cluster
2) Producer / Consumer re-configuration
3) Shutdown of old Cluster
Know the best practices or get help
Know your SLAs and timelines
Choose the right (battle-tested?) tool
29
Disaster Recovery with 2 Kafka Clusters
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
MirrorMaker 1 / 2
Confluent Replicator
uReplicator (Uber)
Mirus (Salesforce)
Brooklin (LinkedIn)
Custom Replication
DC1 DC2
Streaming Replication
Know the trade-offs!
If Kafka Cluster 1 is down, Kafka Cluster 2 is still live and running
Timestamp preservation
Offset translation
Manual client-failover / custom client code
Data loss in case of DC outage (asynchronous replication)
30
Disaster Recovery @ JPMorgan
www.kai-waehner.de | @KaiWaehner
https://www.confluent.io/kafka-summit-san-francisco-2019/secure-kafka-at-scale-in-true-multi-tenant-environment
31
Aggregation of Kafka Clusters
www.kai-waehner.de | @KaiWaehner
Local smaller Kafka Clusters in each site for critical real time applications (high SLAs)
Central bigger Kafka Cluster for analytics use cases (often less critical SLAs)
Works
Relatively easy to setup (some tools are complex / not up-to-date / not mature / not documented well)
Some tools do not support same topic name in each DC
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Kafka Connect
Kafka Connect
Kafka Broker Kafka Broker Kafka Broker
32
Aggregation Cluster @ Royal Caribbean
www.kai-waehner.de | @KaiWaehner
https://www.confluent.io/kafka-summit-lon19/seamless-guest-experience-with-kafka-streams/
33
Aggregation of Edge Kafka Clusters
www.kai-waehner.de | @KaiWaehner
Small Kafka clusters in each site for data collection (often low SLAs, sometimes single Kafka broker)
Kafka at the edge sometimes OEM / hardware appliance
Central big Kafka cluster for critical use cases and edge integration (high SLAs)
Zookeeper
Kafka Broker
Schema Registry
OPC-UA
MQTT
PLC4X
KSQL
Grafana
Postgres
Kafka Connect
Zookeeper
Kafka Broker
Schema Registry
OPC-UA
MQTT
PLC4X
KSQL
Grafana
Postgres
Kafka Connect
Zookeeper
Kafka Broker
Schema Registry
OPC-UA
MQTT
PLC4X
KSQL
Grafana
Postgres
Kafka Connect
Zookeeper
Kafka Broker
Schema Registry
OPC-UA
MQTT
PLC4X
KSQL
Grafana
Postgres
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Kafka Connect
Kafka Connect
Kafka Broker Kafka Broker Kafka Broker
34
Real Time Streaming ML at the Edge @ Severstal
www.kai-waehner.de | @KaiWaehner
https://www.confluent.io/customers/severstal/
35
Cross-Company Kafka Integration
(Special Case of Hybrid Integration)
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
MirrorMaker 1 / 2
Confluent Replicator
uReplicator (Uber)
Mirus (Salesforce)
Brooklin (LinkedIn)
Custom Replication
Company A Company B
Streaming Replication
Streaming integration between companies
API Management (REST et al) not appropriate for streaming data
Infosec and politics are your biggest enemy
36
Agenda
1) Definition ‘Kafka Cluster’
2) One Kafka Cluster
3) Multiple Kafka Clusters
4) Multi-Region / Global Kafka Cluster
5) Infrastructure Options
www.kai-waehner.de | @KaiWaehner
37
A Stretched Kafka Cluster over 3 Regions?
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
US-East US-Central US-West
No!
Fail!
Error!
Sorry!
38Replication Between Kafka Clusters over
Multiple Regions or Continents
www.kai-waehner.de | @KaiWaehner
Streaming replication works (MirrorMaker 2, Confluent Replicator)
Same challenges as in one region (data loss, custom code for fail-over, offset translation, etc.)
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Schema Registry
Schema Registry
Producer Consumer
Kafka Connect
Kafka Connect
China USA
Europe
39
A Single Kafka Cluster over 3 Regions
with Multi-Region Replication
www.kai-waehner.de | @KaiWaehner
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
US-East US-Central US-West
40A Single Multi Region Kafka Cluster (MRC)
www.kai-waehner.de | @KaiWaehner
High availability (Survives region outage)
Zero data loss and zero downtime
Automatic client fail-over over regions
Works well in cloud and on premise
No external tools (like MirrorMaker) needed
Not part of Open Source Kafka à Build vs. Buy
Zookeeper Zookeeper Zookeeper
Kafka Broker Kafka Broker Kafka Broker
Kafka Connect
Schema Registry
Producer
Consumer
Producer
Consumer
Kafka Connect
Schema Registry
US-East US-Central US-West
How does this work?
Region-awareness
Synchronous or asynchronous replication per Topic
Follower-fetching
Regional topic locality
Replication rules
…
(Confluent Platform)
41A Single Multi Region Kafka Cluster (MRC)
www.kai-waehner.de | @KaiWaehner
Broker
1
Broker
2
Broker
3
ZK1
Broker
4
Broker
5
Broker
6
Broker
1
Broker
2
ZK2
Client D Client F Client G
Failover site
ZK3
Broker
3
Broker
4
Broker
5
Broker
6
Client A Client B
us-central-1
Client A Client B
automated
client failover
Observer
replicas
us-west-1 us-east-1
Site failure!
“tie-breaker”
datacenter
Single Kafka Cluster
(Confluent Platform)
$ bin/kafka-topics.sh --bootstrap-servers localhost:2181
--create
--topic trades-west
--partitions 3
--config replication-factor={us-west: 2}
--config min.insync.replicas=2
--config async.replication-factor={us-east: 2}
--config max.async.time.behind.min=5
--config replay.truncated.messages=true
42
Vision: One Global Kafka Cluster
www.kai-waehner.de | @KaiWaehner
Topic‘pos_payments’
43
ZooKeeper Removal (KIP-500)
www.kai-waehner.de | @KaiWaehner
https://cwiki.apache.org/confluence/display/KAFKA/KIP-500%3A+Replace+ZooKeeper+with+a+Self-Managed+Metadata+Quorum
44
Architecture patterns for
distributed, hybrid, edge and global
Apache Kafka deployments…
www.kai-waehner.de | @KaiWaehner
à There is no best solution. It depends!
45
Agenda
1) Definition ‘Kafka Cluster’
2) One Kafka Cluster
3) Multiple Kafka Clusters
4) Multi-Region / Global Kafka Cluster
5) Infrastructure Options
www.kai-waehner.de | @KaiWaehner
46
Infrastructure Options
www.kai-waehner.de | @KaiWaehner
Infrastructure is your choice!
à Bare metal vs. VM vs. container vs. cloud…
Software is your choice!
à Open source vs. commercial vs. SaaS
Ops and management are your choice!
à Self-Managed vs. PaaS vs. fully-managed
Integration is your choice!
à Kafka-native vs. other tools / services
Find the right solution for your business case and for your SLAs…
4747
CONFLUENT PLATFORM
EFFICIENT
OPERATIONS AT SCALE
PRODUCTION-STAGE
PREREQUISITES
UNRESTRICTED
DEVELOPER PRODUCTIVITY
Multi-language Development
Rich Pre-built Ecosystem
SQL-based Stream Processing
GUI-driven Mgmt & Monitoring
Flexible DevOps Automation
Dynamic Performance & Elasticity
Enterprise-grade Security
Data Compatibility
Global Availability
APACHE KAFKA
Fully Managed Cloud ServiceSelf Managed Software
FREEDOM OF CHOICE
COMMITTER-LED EXPERTISE PartnersTraining
Professional
Services
Enterprise
Support
DEVELOPER OPERATOR ARCHITECT
Hybrid Infrastructure
48
Kafka as a Service – Fully Managed?
Infrastructure
management
(commodity)
Scaling
● Upgrades (latest stable version of Kafka)
● Patching
● Maintenance
● Sizing (retention, latency, throughput, storage, etc.)
● Data balancing for optimal performance
● Performance tuning for real-time and latency requirements
● Fixing Kafka bugs
● Uptime monitoring and proactive remediation of issues
● Recovery support from data corruption
● Scaling the cluster as needed
● Data balancing the cluster as nodes are added
● Support for any Kafka issue with less than X minutes response time
Infra-as-a-Service
Harness full power of Kafka
Kafka-specific
management
Platform-as-a-Service
Evolve as you need
Future-proof
Mission-critical reliability
Most Kafka-as-a-Service offerings are partially-managed
Kafka as a Service should be a serverless experience with consumption-based pricing!
4949
I N V E S T M E N T & T I M E
VALUE
3
4
5
1
2
Event Streaming Maturity Model
49
Initial Awareness /
Pilot (1 Kafka Cluster)
Start to Build Pipeline /
Deliver 1 New Outcome
(1 Kafka Cluster)
Mission-Critical
Deployment
(Stretched, Hybrid,
Multi-Region)
Build Contextual
Event-Driven Apps
(Stretched, Hybrid,
Multi-Region)
Central Nervous
System
(Global Kafka)
Product, Support, Training, Partners, Technical Account Management...
50End-to-End Integration with 24/7 Uptime and Zero Data Loss?
www.kai-waehner.de | @KaiWaehner
…. more components, clusters, technologies means more risks, conflicts, incompatibilities, operations burden!
ETL MQ
Storage Streaming
Messaging: Kafka Core
Storage: Kafka Core
Caching: Kafka Core
Real-Time, Batch: Kafka Clients
Integration: Kafka Connect
Stream Processing: Kafka Streams / KSQL
Request-Response: REST Proxy
Replication between Kafka Clusters
(Edge, On Premises, Hybrid, Cloud)
Multi-Region Kafka Cluster
”Eat your own dog food”
vs.
http://www.kai-waehner.de/blog/2019/03/07/apache-kafka-middleware-mq-etl-esb-comparison/
51
Questions?
Let’s connect...
Kai Waehner
Technology Evangelist
kai.waehner@confluent.io
@KaiWaehner
www.confluent.io
www.kai-waehner.de
LinkedIn

Mais conteúdo relacionado

Mais procurados

Confluent Operator as Cloud-Native Kafka Operator for Kubernetes
Confluent Operator as Cloud-Native Kafka Operator for KubernetesConfluent Operator as Cloud-Native Kafka Operator for Kubernetes
Confluent Operator as Cloud-Native Kafka Operator for KubernetesKai Wähner
 
Kafka At Scale in the Cloud
Kafka At Scale in the CloudKafka At Scale in the Cloud
Kafka At Scale in the Cloudconfluent
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaKai Wähner
 
ksqlDB - Stream Processing simplified!
ksqlDB - Stream Processing simplified!ksqlDB - Stream Processing simplified!
ksqlDB - Stream Processing simplified!Guido Schmutz
 
Introduction to Kafka Streams
Introduction to Kafka StreamsIntroduction to Kafka Streams
Introduction to Kafka StreamsGuozhang Wang
 
Fundamentals of Apache Kafka
Fundamentals of Apache KafkaFundamentals of Apache Kafka
Fundamentals of Apache KafkaChhavi Parasher
 
Kafka 101 and Developer Best Practices
Kafka 101 and Developer Best PracticesKafka 101 and Developer Best Practices
Kafka 101 and Developer Best Practicesconfluent
 
Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?confluent
 
Stream processing using Kafka
Stream processing using KafkaStream processing using Kafka
Stream processing using KafkaKnoldus Inc.
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022Kai Wähner
 
Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Kai Wähner
 
Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022
Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022
Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022HostedbyConfluent
 
Introduction to Apache Kafka
Introduction to Apache KafkaIntroduction to Apache Kafka
Introduction to Apache KafkaJeff Holoman
 
Apache Kafka Introduction
Apache Kafka IntroductionApache Kafka Introduction
Apache Kafka IntroductionAmita Mirajkar
 
Stream Processing with Apache Kafka and .NET
Stream Processing with Apache Kafka and .NETStream Processing with Apache Kafka and .NET
Stream Processing with Apache Kafka and .NETconfluent
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Araf Karsh Hamid
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache KafkaTop 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache KafkaKai Wähner
 

Mais procurados (20)

Confluent Operator as Cloud-Native Kafka Operator for Kubernetes
Confluent Operator as Cloud-Native Kafka Operator for KubernetesConfluent Operator as Cloud-Native Kafka Operator for Kubernetes
Confluent Operator as Cloud-Native Kafka Operator for Kubernetes
 
Kafka At Scale in the Cloud
Kafka At Scale in the CloudKafka At Scale in the Cloud
Kafka At Scale in the Cloud
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
 
ksqlDB - Stream Processing simplified!
ksqlDB - Stream Processing simplified!ksqlDB - Stream Processing simplified!
ksqlDB - Stream Processing simplified!
 
Introduction to Kafka Streams
Introduction to Kafka StreamsIntroduction to Kafka Streams
Introduction to Kafka Streams
 
Fundamentals of Apache Kafka
Fundamentals of Apache KafkaFundamentals of Apache Kafka
Fundamentals of Apache Kafka
 
Kafka 101 and Developer Best Practices
Kafka 101 and Developer Best PracticesKafka 101 and Developer Best Practices
Kafka 101 and Developer Best Practices
 
Introduction to Apache Kafka
Introduction to Apache KafkaIntroduction to Apache Kafka
Introduction to Apache Kafka
 
Kafka presentation
Kafka presentationKafka presentation
Kafka presentation
 
Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?
 
Stream processing using Kafka
Stream processing using KafkaStream processing using Kafka
Stream processing using Kafka
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022
 
Apache Kafka at LinkedIn
Apache Kafka at LinkedInApache Kafka at LinkedIn
Apache Kafka at LinkedIn
 
Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?
 
Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022
Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022
Schema Registry 101 with Bill Bejeck | Kafka Summit London 2022
 
Introduction to Apache Kafka
Introduction to Apache KafkaIntroduction to Apache Kafka
Introduction to Apache Kafka
 
Apache Kafka Introduction
Apache Kafka IntroductionApache Kafka Introduction
Apache Kafka Introduction
 
Stream Processing with Apache Kafka and .NET
Stream Processing with Apache Kafka and .NETStream Processing with Apache Kafka and .NET
Stream Processing with Apache Kafka and .NET
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache KafkaTop 5 Event Streaming Use Cases for 2021 with Apache Kafka
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
 

Semelhante a Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments

Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...HostedbyConfluent
 
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...Red Hat Developers
 
JHipster conf 2019 - Kafka Ecosystem
JHipster conf 2019 - Kafka EcosystemJHipster conf 2019 - Kafka Ecosystem
JHipster conf 2019 - Kafka EcosystemFlorent Ramiere
 
New Features in Confluent Platform 6.0 / Apache Kafka 2.6
New Features in Confluent Platform 6.0 / Apache Kafka 2.6New Features in Confluent Platform 6.0 / Apache Kafka 2.6
New Features in Confluent Platform 6.0 / Apache Kafka 2.6Kai Wähner
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...confluent
 
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...Kai Wähner
 
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARNKafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARNDataWorks Summit
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...confluent
 
The Top 5 Event Streaming Use Cases & Architectures in 2021
The Top 5 Event Streaming Use Cases & Architectures in 2021The Top 5 Event Streaming Use Cases & Architectures in 2021
The Top 5 Event Streaming Use Cases & Architectures in 2021confluent
 
Beyond the Brokers: A Tour of the Kafka Ecosystem
Beyond the Brokers: A Tour of the Kafka EcosystemBeyond the Brokers: A Tour of the Kafka Ecosystem
Beyond the Brokers: A Tour of the Kafka Ecosystemconfluent
 
Beyond the brokers - A tour of the Kafka ecosystem
Beyond the brokers - A tour of the Kafka ecosystemBeyond the brokers - A tour of the Kafka ecosystem
Beyond the brokers - A tour of the Kafka ecosystemDamien Gasparina
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKai Wähner
 
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)Kai Wähner
 
Service Mesh with Apache Kafka, Kubernetes, Envoy, Istio and Linkerd
Service Mesh with Apache Kafka, Kubernetes, Envoy, Istio and LinkerdService Mesh with Apache Kafka, Kubernetes, Envoy, Istio and Linkerd
Service Mesh with Apache Kafka, Kubernetes, Envoy, Istio and LinkerdKai Wähner
 
BBL KAPPA Lesfurets.com
BBL KAPPA Lesfurets.comBBL KAPPA Lesfurets.com
BBL KAPPA Lesfurets.comCedric Vidal
 
Beyond the brokers - Un tour de l'écosystème Kafka
Beyond the brokers - Un tour de l'écosystème KafkaBeyond the brokers - Un tour de l'écosystème Kafka
Beyond the brokers - Un tour de l'écosystème KafkaFlorent Ramiere
 
Introducing Kafka's Streams API
Introducing Kafka's Streams APIIntroducing Kafka's Streams API
Introducing Kafka's Streams APIconfluent
 
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
Au delà des brokers, un tour de l’environnement Kafka | Florent RamièreAu delà des brokers, un tour de l’environnement Kafka | Florent Ramière
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramièreconfluent
 
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...confluent
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...confluent
 

Semelhante a Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments (20)

Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
 
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
Kafka at the Edge: an IoT scenario with OpenShift Streams for Apache Kafka | ...
 
JHipster conf 2019 - Kafka Ecosystem
JHipster conf 2019 - Kafka EcosystemJHipster conf 2019 - Kafka Ecosystem
JHipster conf 2019 - Kafka Ecosystem
 
New Features in Confluent Platform 6.0 / Apache Kafka 2.6
New Features in Confluent Platform 6.0 / Apache Kafka 2.6New Features in Confluent Platform 6.0 / Apache Kafka 2.6
New Features in Confluent Platform 6.0 / Apache Kafka 2.6
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
 
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
 
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARNKafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
 
The Top 5 Event Streaming Use Cases & Architectures in 2021
The Top 5 Event Streaming Use Cases & Architectures in 2021The Top 5 Event Streaming Use Cases & Architectures in 2021
The Top 5 Event Streaming Use Cases & Architectures in 2021
 
Beyond the Brokers: A Tour of the Kafka Ecosystem
Beyond the Brokers: A Tour of the Kafka EcosystemBeyond the Brokers: A Tour of the Kafka Ecosystem
Beyond the Brokers: A Tour of the Kafka Ecosystem
 
Beyond the brokers - A tour of the Kafka ecosystem
Beyond the brokers - A tour of the Kafka ecosystemBeyond the brokers - A tour of the Kafka ecosystem
Beyond the brokers - A tour of the Kafka ecosystem
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
 
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
 
Service Mesh with Apache Kafka, Kubernetes, Envoy, Istio and Linkerd
Service Mesh with Apache Kafka, Kubernetes, Envoy, Istio and LinkerdService Mesh with Apache Kafka, Kubernetes, Envoy, Istio and Linkerd
Service Mesh with Apache Kafka, Kubernetes, Envoy, Istio and Linkerd
 
BBL KAPPA Lesfurets.com
BBL KAPPA Lesfurets.comBBL KAPPA Lesfurets.com
BBL KAPPA Lesfurets.com
 
Beyond the brokers - Un tour de l'écosystème Kafka
Beyond the brokers - Un tour de l'écosystème KafkaBeyond the brokers - Un tour de l'écosystème Kafka
Beyond the brokers - Un tour de l'écosystème Kafka
 
Introducing Kafka's Streams API
Introducing Kafka's Streams APIIntroducing Kafka's Streams API
Introducing Kafka's Streams API
 
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
Au delà des brokers, un tour de l’environnement Kafka | Florent RamièreAu delà des brokers, un tour de l’environnement Kafka | Florent Ramière
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
 
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
 

Mais de Kai Wähner

Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Kai Wähner
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?Kai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKai Wähner
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaKai Wähner
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareKai Wähner
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Kai Wähner
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureKai Wähner
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Kai Wähner
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryKai Wähner
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail IndustryKai Wähner
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKai Wähner
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Kai Wähner
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingKai Wähner
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesKai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Kai Wähner
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Kai Wähner
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationKai Wähner
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Kai Wähner
 

Mais de Kai Wähner (20)

Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and Manufacturing
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR ModernizationApache Kafka for Cybersecurity and SIEM / SOAR Modernization
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
 

Último

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
How To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROHow To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROmotivationalword821
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfYashikaSharma391629
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxAndreas Kunz
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 

Último (20)

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
How To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROHow To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTRO
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 

Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments

  • 1. 1 Kai Waehner | Technology Evangelist, Confluent contact@kai-waehner.de | LinkedIn | @KaiWaehner | www.confluent.io | www.kai-waehner.de Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments
  • 2. 2Abstract Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments Multi-cluster and cross-data center deployments of Apache Kafka have become the norm rather than an exception. This session gives an overview of several scenarios that may require multi-cluster solutions and discusses real-world examples with their specific requirements and trade-offs, including disaster recovery, aggregation for analytics, cloud migration, mission-critical stretched deployments and global Kafka. Key takeaways: • In many scenarios, one Kafka cluster is not enough. Understand different architectures and alternatives for multi-cluster deployments. • Zero data loss and high availability are two key requirements. Understand how to realize this, including trade-offs. • Learn about features and limitations of Kafka for multi cluster deployments • Global Kafka and mission-critical multi-cluster deployments with zero data loss and high availability became the normal, not an exception. www.kai-waehner.de | @KaiWaehner
  • 3. 3 Agenda 1) Definition ‘Kafka Cluster’ 2) One Kafka Cluster 3) Multiple Kafka Clusters 4) Multi-Region / Global Kafka Cluster 5) Infrastructure Options www.kai-waehner.de | @KaiWaehner
  • 4. 4 Agenda 1) Definition ‘Kafka Cluster’ 2) One Kafka Cluster 3) Multiple Kafka Clusters 4) Multi-Region / Global Kafka Cluster 5) Infrastructure Options www.kai-waehner.de | @KaiWaehner
  • 6. 6 The Beginning of a New Era www.kai-waehner.de | @KaiWaehner https://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying The first use case: Log Analytics. This is why Kafka was created!
  • 7. 7 Event Streaming Platform – The Commit Log www.kai-waehner.de | @KaiWaehner Time P C1 C2 C3
  • 8. 8 Event Streaming Platform – A Distributed System www.kai-waehner.de | @KaiWaehner Broker 1 Topic1 partition1 Broker 2 Broker 3 Broker 4 Topic1 partition1 Topic1 partition1 Leader Follower Topic1 partition2 Topic1 partition2 Topic1 partition2 Topic1 partition3 Topic1 partition4 Topic1 partition3 Topic1 partition3 Topic1 partition4 Topic1 partition4
  • 9. 9Apache Kafka (kafka.apache.org) includes Kafka Connect and Kafka Streams www.kai-waehner.de | @KaiWaehner Kafka Streams Your app sinksource KafkaConnect KafkaConnect Kafka Cluster
  • 10. 10A Streaming Platform is the Underpinning of an Event-driven Architecture www.kai-waehner.de | @KaiWaehner Microservices DBs SaaS apps Customer 360 Real-time fraud detection Data warehouse Producers Consumers Database change Microservices events SaaS data Customer experiences Streams of real time events Stream processing apps Connectors Connectors Stream processing apps
  • 11. 11 Apache Kafka at Scale at Tech Giants www.kai-waehner.de | @KaiWaehner > 7 trillion messages / day > 6 Petabytes / day “You name it” * Kafka Is not just used by tech giants ** Kafka is not just used for big data
  • 12. 12 www.kai-waehner.de | @KaiWaehner Improve Customer Experience (CX) Increase Revenue (make money) Business Value Decrease Costs (save money) Core Business Platform Increase Operational Efficiency Migrate to Cloud Mitigate Risk (protect money) Key Drivers Strategic Objectives (sample) Fraud Detection IoT sensor ingestion Digital replatforming/ Mainframe Offload Connected Car: Navigation & improved in-car experience: Audi Customer 360 Simplifying Omni-channel Retail at Scale: Target Faster transactional processing / analysis incl. Machine Learning / AI Mainframe Offload: RBC Microservices Architecture Online Fraud Detection Online Security (syslog, log aggregation, Splunk replacement) Middleware replacement Regulatory Digital Transformation Application Modernization: Multiple Examples Website / Core Operations (Central Nervous System) The [Silicon Valley] Digital Natives; LinkedIn, Netflix, Uber, Yelp... Predictive Maintenance: Audi Streaming Platform in a regulated environment (e.g. Electronic Medical Records): Celmatix Real-time app updates Real Time Streaming Platform for Communications and Beyond: Capital One Developer Velocity - Building Stateful Financial Applications with Kafka Streams: Funding Circle Detect Fraud & Prevent Fraud in Real Time: PayPal Kafka as a Service - A Tale of Security and Multi-Tenancy: Apple Example Use Cases $↑ $↓ $ Example Case Studies (of many)
  • 13. 13 A Kafka Cluster www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect
  • 14. 14 A Kafka Cluster www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect “Server Side” of a Kafka Cluster 1 - * Kafka Broker 1 - * ZooKeeper 0 - * Schema Registry 0 - * Kafka Connect 0 - * REST Proxy Security, Ops, Monitoring, … “Client Side” of a Kafka Cluster Kafka Clients (Java, C, C++, Python, Go, JavaScript, …) Kafka Stream Processing Apps (Kafka Streams, ksqlDB) External Producers / Consumers (Oracle, Hadoop, Flink, …)
  • 15. 15 Why Multiple Kafka Clusters? www.kai-waehner.de | @KaiWaehner * Not a representative survey J ** Many DCs does NOT necessarily mean more than one Kafka Cluster
  • 16. 16 Disaster Recovery – RPO and RTO www.kai-waehner.de | @KaiWaehner RPO = Recovery Point Objective RTO = Recovery Time Objective
  • 17. 17 Agenda 1) Definition ‘Kafka Cluster’ 2) One Kafka Cluster 3) Multiple Kafka Clusters 4) Multi-Region / Global Kafka Cluster 5) Infrastructure Options www.kai-waehner.de | @KaiWaehner
  • 18. 18 A Kafka Cluster www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect
  • 19. 19 A Kafka Cluster for High Availability www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect PROD / Pre-PROD / TEST 3 - * Kafka Broker 3 / 5 / 7 ZooKeeper 2 Schema Registry 2 - * Kafka Connect 2 - * REST Proxy Security, Ops, Monitoring, … DEV / Functional TEST 1 Kafka Broker 1 ZooKeeper 0 - 1 Schema Registry 0 - 1 Kafka Connect 0 - 1 REST Proxy Security, Ops, Monitoring, …
  • 20. 20 A Stretched Kafka Cluster over 3DC www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry DC1 DC2 DC3
  • 21. 21 A Stretched Kafka Cluster over 3DC www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry DC1 DC2 DC3 High availability (Survives DC outage) Zero data loss and zero downtime Automatic client fail-over Works well in cloud (3 AZs in 1 region) Requires “good” latency (à DCs ”close” to each other) Requires three DCs (Quorum / split brain) Complex to configure and operate
  • 22. 22 A Stretched Kafka Cluster over 2DC www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry DC1 DC2 Kafka Broker
  • 23. 23 A Stretched Kafka Cluster over 2DC www.kai-waehner.de | @KaiWaehner High availability (Survives DC outage) Zero data loss or zero downtime Automatic client fail-over Stopgap solution for on premise (if only 2 DCs available) à 2.5 DC deployment as workaround Requires “good” latency (à DCs ”close” to each other) Quorum in 2 DCs not possible Complex to configure and operate Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry DC1 DC2 Kafka Broker
  • 24. 24 A Single Kafka Cluster www.kai-waehner.de | @KaiWaehner Zookeeper Kafka Broker Schema Registry OPC-UA MQTT PLC4X KSQL Grafana Postgres Kafka Connect Simple setup Works Often used “at the Edge” No high availability
  • 25. 25 Agenda 1) Definition ‘Kafka Cluster’ 2) One Kafka Cluster 3) Multiple Kafka Clusters 4) Multi-Region / Global Kafka Cluster 5) Infrastructure Options www.kai-waehner.de | @KaiWaehner
  • 26. 26 Independent Kafka Clusters www.kai-waehner.de | @KaiWaehner Total Independence Owned by the project teams, central ops or SaaS Different sizing, security, infrastructure Related projects should run on the same Kafka cluster Independent projects can run on the same Kafka cluster • similar SLAs and requirements • e.g. NOT Instant payment vs. log analytics vs. file transfer vs. video streaming • ACLs / RBAC for fine-grained authentication and authorization • throughput typically no issue (Confluent Cloud processes 1 Gigabyte / sec and more in one cluster) • reduce overhead (operations, hardware, …) Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect
  • 27. 27 Hybrid Integration of 2 Kafka Clusters www.kai-waehner.de | @KaiWaehner Hybrid integration On premise and cloud or multi-cloud scenarios (due to technical, business or legal reasons) Uni- or bi-directional Know the best practices or get help Know your SLAs and timelines Choose the right (battle-tested?) tool Works Relatively easy to setup (some tools are complex / not up-to-date / not mature / not documented well) Example: Replicate data from production to analytics cluster Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect MirrorMaker 1 / 2 Confluent Replicator uReplicator (Uber) Mirus (Salesforce) Brooklin (LinkedIn) Custom Replication DC1 DC2 Streaming Replication
  • 28. 28 Migration of Kafka Clusters www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect MirrorMaker 1 / 2 Confluent Replicator uReplicator (Uber) Mirus (Salesforce) Brooklin (LinkedIn) Custom Replication DC1 DC2 Streaming Replication Common migration scenarios On premise à Cloud Cloud A à Cloud B Vendor 1 à Vendor 2 Self-Managed à SaaS Migration steps: 1) Create new Kafka cluster 2) Producer / Consumer re-configuration 3) Shutdown of old Cluster Know the best practices or get help Know your SLAs and timelines Choose the right (battle-tested?) tool
  • 29. 29 Disaster Recovery with 2 Kafka Clusters www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect MirrorMaker 1 / 2 Confluent Replicator uReplicator (Uber) Mirus (Salesforce) Brooklin (LinkedIn) Custom Replication DC1 DC2 Streaming Replication Know the trade-offs! If Kafka Cluster 1 is down, Kafka Cluster 2 is still live and running Timestamp preservation Offset translation Manual client-failover / custom client code Data loss in case of DC outage (asynchronous replication)
  • 30. 30 Disaster Recovery @ JPMorgan www.kai-waehner.de | @KaiWaehner https://www.confluent.io/kafka-summit-san-francisco-2019/secure-kafka-at-scale-in-true-multi-tenant-environment
  • 31. 31 Aggregation of Kafka Clusters www.kai-waehner.de | @KaiWaehner Local smaller Kafka Clusters in each site for critical real time applications (high SLAs) Central bigger Kafka Cluster for analytics use cases (often less critical SLAs) Works Relatively easy to setup (some tools are complex / not up-to-date / not mature / not documented well) Some tools do not support same topic name in each DC Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Kafka Connect Kafka Connect Kafka Broker Kafka Broker Kafka Broker
  • 32. 32 Aggregation Cluster @ Royal Caribbean www.kai-waehner.de | @KaiWaehner https://www.confluent.io/kafka-summit-lon19/seamless-guest-experience-with-kafka-streams/
  • 33. 33 Aggregation of Edge Kafka Clusters www.kai-waehner.de | @KaiWaehner Small Kafka clusters in each site for data collection (often low SLAs, sometimes single Kafka broker) Kafka at the edge sometimes OEM / hardware appliance Central big Kafka cluster for critical use cases and edge integration (high SLAs) Zookeeper Kafka Broker Schema Registry OPC-UA MQTT PLC4X KSQL Grafana Postgres Kafka Connect Zookeeper Kafka Broker Schema Registry OPC-UA MQTT PLC4X KSQL Grafana Postgres Kafka Connect Zookeeper Kafka Broker Schema Registry OPC-UA MQTT PLC4X KSQL Grafana Postgres Kafka Connect Zookeeper Kafka Broker Schema Registry OPC-UA MQTT PLC4X KSQL Grafana Postgres Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Kafka Connect Kafka Connect Kafka Broker Kafka Broker Kafka Broker
  • 34. 34 Real Time Streaming ML at the Edge @ Severstal www.kai-waehner.de | @KaiWaehner https://www.confluent.io/customers/severstal/
  • 35. 35 Cross-Company Kafka Integration (Special Case of Hybrid Integration) www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect MirrorMaker 1 / 2 Confluent Replicator uReplicator (Uber) Mirus (Salesforce) Brooklin (LinkedIn) Custom Replication Company A Company B Streaming Replication Streaming integration between companies API Management (REST et al) not appropriate for streaming data Infosec and politics are your biggest enemy
  • 36. 36 Agenda 1) Definition ‘Kafka Cluster’ 2) One Kafka Cluster 3) Multiple Kafka Clusters 4) Multi-Region / Global Kafka Cluster 5) Infrastructure Options www.kai-waehner.de | @KaiWaehner
  • 37. 37 A Stretched Kafka Cluster over 3 Regions? www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry US-East US-Central US-West No! Fail! Error! Sorry!
  • 38. 38Replication Between Kafka Clusters over Multiple Regions or Continents www.kai-waehner.de | @KaiWaehner Streaming replication works (MirrorMaker 2, Confluent Replicator) Same challenges as in one region (data loss, custom code for fail-over, offset translation, etc.) Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Schema Registry Schema Registry Producer Consumer Kafka Connect Kafka Connect China USA Europe
  • 39. 39 A Single Kafka Cluster over 3 Regions with Multi-Region Replication www.kai-waehner.de | @KaiWaehner Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry US-East US-Central US-West
  • 40. 40A Single Multi Region Kafka Cluster (MRC) www.kai-waehner.de | @KaiWaehner High availability (Survives region outage) Zero data loss and zero downtime Automatic client fail-over over regions Works well in cloud and on premise No external tools (like MirrorMaker) needed Not part of Open Source Kafka à Build vs. Buy Zookeeper Zookeeper Zookeeper Kafka Broker Kafka Broker Kafka Broker Kafka Connect Schema Registry Producer Consumer Producer Consumer Kafka Connect Schema Registry US-East US-Central US-West How does this work? Region-awareness Synchronous or asynchronous replication per Topic Follower-fetching Regional topic locality Replication rules … (Confluent Platform)
  • 41. 41A Single Multi Region Kafka Cluster (MRC) www.kai-waehner.de | @KaiWaehner Broker 1 Broker 2 Broker 3 ZK1 Broker 4 Broker 5 Broker 6 Broker 1 Broker 2 ZK2 Client D Client F Client G Failover site ZK3 Broker 3 Broker 4 Broker 5 Broker 6 Client A Client B us-central-1 Client A Client B automated client failover Observer replicas us-west-1 us-east-1 Site failure! “tie-breaker” datacenter Single Kafka Cluster (Confluent Platform) $ bin/kafka-topics.sh --bootstrap-servers localhost:2181 --create --topic trades-west --partitions 3 --config replication-factor={us-west: 2} --config min.insync.replicas=2 --config async.replication-factor={us-east: 2} --config max.async.time.behind.min=5 --config replay.truncated.messages=true
  • 42. 42 Vision: One Global Kafka Cluster www.kai-waehner.de | @KaiWaehner Topic‘pos_payments’
  • 43. 43 ZooKeeper Removal (KIP-500) www.kai-waehner.de | @KaiWaehner https://cwiki.apache.org/confluence/display/KAFKA/KIP-500%3A+Replace+ZooKeeper+with+a+Self-Managed+Metadata+Quorum
  • 44. 44 Architecture patterns for distributed, hybrid, edge and global Apache Kafka deployments… www.kai-waehner.de | @KaiWaehner à There is no best solution. It depends!
  • 45. 45 Agenda 1) Definition ‘Kafka Cluster’ 2) One Kafka Cluster 3) Multiple Kafka Clusters 4) Multi-Region / Global Kafka Cluster 5) Infrastructure Options www.kai-waehner.de | @KaiWaehner
  • 46. 46 Infrastructure Options www.kai-waehner.de | @KaiWaehner Infrastructure is your choice! à Bare metal vs. VM vs. container vs. cloud… Software is your choice! à Open source vs. commercial vs. SaaS Ops and management are your choice! à Self-Managed vs. PaaS vs. fully-managed Integration is your choice! à Kafka-native vs. other tools / services Find the right solution for your business case and for your SLAs…
  • 47. 4747 CONFLUENT PLATFORM EFFICIENT OPERATIONS AT SCALE PRODUCTION-STAGE PREREQUISITES UNRESTRICTED DEVELOPER PRODUCTIVITY Multi-language Development Rich Pre-built Ecosystem SQL-based Stream Processing GUI-driven Mgmt & Monitoring Flexible DevOps Automation Dynamic Performance & Elasticity Enterprise-grade Security Data Compatibility Global Availability APACHE KAFKA Fully Managed Cloud ServiceSelf Managed Software FREEDOM OF CHOICE COMMITTER-LED EXPERTISE PartnersTraining Professional Services Enterprise Support DEVELOPER OPERATOR ARCHITECT Hybrid Infrastructure
  • 48. 48 Kafka as a Service – Fully Managed? Infrastructure management (commodity) Scaling ● Upgrades (latest stable version of Kafka) ● Patching ● Maintenance ● Sizing (retention, latency, throughput, storage, etc.) ● Data balancing for optimal performance ● Performance tuning for real-time and latency requirements ● Fixing Kafka bugs ● Uptime monitoring and proactive remediation of issues ● Recovery support from data corruption ● Scaling the cluster as needed ● Data balancing the cluster as nodes are added ● Support for any Kafka issue with less than X minutes response time Infra-as-a-Service Harness full power of Kafka Kafka-specific management Platform-as-a-Service Evolve as you need Future-proof Mission-critical reliability Most Kafka-as-a-Service offerings are partially-managed Kafka as a Service should be a serverless experience with consumption-based pricing!
  • 49. 4949 I N V E S T M E N T & T I M E VALUE 3 4 5 1 2 Event Streaming Maturity Model 49 Initial Awareness / Pilot (1 Kafka Cluster) Start to Build Pipeline / Deliver 1 New Outcome (1 Kafka Cluster) Mission-Critical Deployment (Stretched, Hybrid, Multi-Region) Build Contextual Event-Driven Apps (Stretched, Hybrid, Multi-Region) Central Nervous System (Global Kafka) Product, Support, Training, Partners, Technical Account Management...
  • 50. 50End-to-End Integration with 24/7 Uptime and Zero Data Loss? www.kai-waehner.de | @KaiWaehner …. more components, clusters, technologies means more risks, conflicts, incompatibilities, operations burden! ETL MQ Storage Streaming Messaging: Kafka Core Storage: Kafka Core Caching: Kafka Core Real-Time, Batch: Kafka Clients Integration: Kafka Connect Stream Processing: Kafka Streams / KSQL Request-Response: REST Proxy Replication between Kafka Clusters (Edge, On Premises, Hybrid, Cloud) Multi-Region Kafka Cluster ”Eat your own dog food” vs. http://www.kai-waehner.de/blog/2019/03/07/apache-kafka-middleware-mq-etl-esb-comparison/
  • 51. 51 Questions? Let’s connect... Kai Waehner Technology Evangelist kai.waehner@confluent.io @KaiWaehner www.confluent.io www.kai-waehner.de LinkedIn