SlideShare uma empresa Scribd logo
1 de 31
Baixar para ler offline
Strategies For Migrating
From SQL to NoSQL — The
Apache Kafka Way
Geetha Anne, Sr Solutions Engineer
Geetha Anne
■ Silicon Valley
■ 2 daughters
■ Cloudera, Servicenow, Hawaiian Airlines prior to
joining Confluent
■ 10 years in the space
■ Software Development, Automation
Engineering/Presales are key areas of expertise
■ Cooking, Singing, Hiking
■ The Problem - Migrating to a modern NoSQL Database is a complex
process
■ Why Confluent - Database and data modernization with Confluent
■ The Solution - Proposed architecture and action plan
■ Takeaways - Food for thought and Next Steps
Agenda
The Problem
Modern, cloud-native databases power business
critical applications with lower operational
overhead
Self-Managed Databases
● Rigid architecture that makes it
hard to integrate with other
systems
● Expensive in both upfront and
ongoing maintenance costs
● Slower to scale to meet evolving
demands
Cloud Databases
● Lower TCO by decoupling storage
from compute and leveraging
consumption- based pricing
● Increased overall flexibility and
business agility
● Worry free operations with built
into auto-scaling and
maintenance cycles
Integrating multiple legacy system to the cloud
could be a complex, multi-year process
Time and resource intensive
Replacing or refactoring legacy data systems
across environments is not easy. During
which, data visibility can be limited.
Insight blind spots
Getting actionable data from disparate data
sources is cumbersome. Most data insight
comes from nightly loads, merges, and batch
updates to create a complete view.
Data silos across environments
Difficulties with integrating multiple data silos
and data formats.
On-Prem
Legacy
Database
Cloud
Cloud
Database
CRM
SaaS
App
Nightly
Reporting
Applications ETL App
Batch Jobs
ETL & Database Syncs
Easily modernize your database by integrating
legacy with the cloud using Confluent
1. Simplify and accelerate migration
Link on-prem and cloud for easy data movement across
environments and process data in flight with ksqlDB stream
processing
2. Stay synchronized in real-time
Move from batch to real-time streaming and access change
data capture technology using Confluent and our CDC
connectors
3. Reduce total cost of ownership
Leverage fully managed services and avoid prohibitive
licensing costs from existing solutions offered by legacy
vendors
Why Confluent
Real-time &
Historical
Data
A sale
A shipment
A trade
A customer
interaction
A new paradigm is required for Data in Motion
Continuously process streams of data in real time
“We need to shift our thinking from everything
at rest, to everything in motion.”
Real-Time Stream Processing
Rich, front-end
customer experiences
Real-time, software-driven
business operations
Operationalizing Kafka on your own is difficult
Kafka is hard in experimentation. It gets harder (and riskier) as you add
mission-critical data and use cases.
● Architecture planning
● Cluster sizing
● Cluster provisioning
● Broker settings
● Zookeeper management
● Partition placement & data
durability
● Source/sink connectors
development & maintenance
● Monitoring & reporting tools
setup
● Software patches and upgrades
● Security controls and
integrations
● Failover design & planning
● Mirroring & geo-replication
● Streaming data governance
● Load rebalancing & monitoring
● Expansion planning & execution
● Utilization optimization &
visibility
● Cluster migrations
● Infrastructure & performance
upgrades / enhancements
V
A
L
U
E
1
2
3
4
5
Experimentation
/ Early Interest
Central Nervous
System
Mission critical,
disparate LOBs
Identify a
Project
Mission-critical,
connected LOBs
Key challenges:
Operational burden & resources
Manage and scale platform to support
ever-growing demand
Security & governance
Ensure streaming data is as safe & secure as
data-at-rest as Kafka usage scales
Real-time connectivity &
processing
Leverage valuable legacy data to power
modern, cloud-based apps & experiences
Global availability
Maintain high availability across environments
with minimal downtime
Cloud-native
Infinite
Store unlimited
data on Confluent
to enhance your
real-time apps
and use cases
with a broader set
of data
Global
Create a consistent
data fabric
throughout your
organization by
linking clusters
across your
different
environments
Elastic
Scale up instantly
to meet any
demand and scale
back down to
avoid
over-provisioning
infrastructure
Everywhere
Confluent provides deployment flexibility to span all of your
environments
SELF-MANAGED SOFTWARE
Confluent Platform
The Enterprise Distribution of Apache Kafka
Deploy on-premises or in your private cloud
VM
FULLY MANAGED SERVICE
Confluent Cloud
Cloud-native service for Apache Kafka
Available on the leading public clouds
The Solution
Three Phase Plan
Modernize your Databases with Confluent
1. Migrate
● Choose the workloads that
you’d like to migrate to the
cloud
● Seamlessly integrate your data
source via managed
Confluent source connectors
2. Optimize
● Perform real-time data
transformations using ksqlDB
● Find the most useful queries
for your cloud data
● Work with our ecosystem of
partners to find the best use of
your data
3. Modernize
● Use our managed sink
connectors to send data
into your cloud database
of choice
● Continue migrating workloads
into the cloud
as chances arise
Migrate:
Source Connectors
18
Instantly Connect Popular Data Sources & Sinks
130+
pre-built
connectors
100+ Confluent Supported 30+ Partner Supported, Confluent Verified
AWS
Lambda
Modernize and bridge your entire data architecture
with Confluent robust connector portfolio
Modern, cloud-based data
systems
Legacy data
systems
Oracle
Database
ksqlDB
Mainframes
Applications
Cloud-native / SaaS apps
Azure Synapse
Analytics
Expensive,
custom-built
integrations
Expensive,
custom-built
integrations
Expensive,
custom-built
integrations
Source
Connectors
Expensive,
custom-built
integrations
Expensive,
custom-built
integrations
Sink
Connectors
Modernize:
Sink Connectors
3 Modalities of Stream Processing with Confluent
Kafka clients
21
Kafka Streams ksqlDB
ConsumerRecords<String, String> records = consumer.poll(100);
Map<String, Integer> counts = new DefaultMap<String,
Integer>();
for (ConsumerRecord<String, Integer> record : records) {
String key = record.key();
int c = counts.get(key)
c += record.value()
counts.put(key, c)
}
for (Map.Entry<String, Integer> entry : counts.entrySet()) {
int stateCount;
int attempts;
while (attempts++ < MAX_RETRIES) {
try {
stateCount = stateStore.getValue(entry.getKey())
stateStore.setValue(entry.getKey(), entry.getValue() +
stateCount)
break;
} catch (StateStoreException e) {
RetryUtils.backoff(attempts);
}
}
}
builder
.stream("input-stream",
Consumed.with(Serdes.String(), Serdes.String()))
.groupBy((key, value) -> value)
.count()
.toStream()
.to("counts", Produced.with(Serdes.String(),
Serdes.Long()));
SELECT x, count(*) FROM stream GROUP BY x EMIT CHANGES;
Flexibility Simplicity
ksqlDB at a Glance
What is it?
ksqlDB is an event streaming database for working with streams and tables of data.
All the key features of a modern
streaming solution.
Aggregations Joins
Windowing
Event-Time
Dual Query Support
Exactly-Once
Semantics
Out-of-Order
Handling
User-Defined
Functions
Compute Storage
CREATE TABLE activePromotions AS
SELECT rideId,
qualifyPromotion(distanceToDst) AS promotion
FROM locations
GROUP BY rideId
EMIT CHANGES
How does it work?
It separates compute from storage, and scales
elastically in a fault-tolerant manner.
It remains highly available during disruption,
even in the face of failure to a quorum of its
servers.
ksqlDB Kafka
22
Built on the Best Technology,
Available as a Fully-Managed Service
Kafka is the backbone of ksqlDB
ksqlDB is built on top of Kafka’s battle-tested streaming
foundation. Its design re-uses Kafka to achieve elasticity,
fault-tolerance, and scalability for stream processing &
analytics..
Use a fully-managed service
With Confluent Cloud ksqlDB, you need
not worry about any of the details of
running it. You can forget about:
● Clusters
● Brokers
● Scaling
● Upgrading
● Monitoring
Pay only for what you use.
ksqlDB server Kafka
topic
topic
changelog topic
Push & Pull
Queries
Kafka Streams
Engine
Local State
(transient)
topic
Compute Storage
23
Accelerate your migration from legacy on-prem systems
to modern, cloud-based technologies
24
Modern, cloud-based data systems
Legacy data systems
Oracle
Database
ksqlDB
Mainframes
Applications
Cloud-native / SaaS apps
Azure Synapse
Analytics
Expensive,
custom-built
integrations
Expensive,
custom-built
integrations
Expensive,
custom-built
integrations
Source
Connectors
Expensive,
custom-built
integrations
Expensive,
custom-built
integrations
Sink
Connectors
Confluent the central nervous system of data
25
Confluent Cloud
Fully Managed Connectors
● Limited set of the larger Connector Catalogue
● Elastic scaling with no infrastructure to manage
● Connector networking configuration dependent
on your clusters networking
● Limited configuration options
● Stable Source IPs are Available for certain
connectors
Proposed Architecture
NOSQL DB
Three Phase Plan
Modernize your Database with Confluent
28
1. Migrate
● Choose the workloads that
you’d like to migrate to the
cloud
● Seamlessly integrate your data
source via managed
Confluent source connectors
2. Optimize
● Perform real-time data
transformations using ksqlDB
● Find the most useful queries
for your cloud data
● Work with our ecosystem of
partners to find the best use of
your data
3. Modernize
● Use our managed sink
connectors to send data
into your cloud database
of choice
● Continue migrating workloads
into the cloud
as chances arise
Cloud-native, Complete, Everywhere
with Kafka at its core
Infinite Storage
Security &
Data Governance
ksqlDB & Stream
Processing, Analytics
Connectors
APIs, UIs, CLIs
Fully Managed ‘NoOps’
on AWS, Azure, GCP
29
Resources
https://github.com/confluentinc/demo-database-modernization
https://www.confluent.io/blog/real-time-cdc-pipelines-with-oracle-on-gke-using-co
nfluent-connector/?utm_source=linkedin&utm_medium=organicsocial&utm_campa
ign=tm.devx_ch.bp_building-a-real-time-data-pipeline-with-oracle-cdc-and-marklogi
c-using-cfk-and-cloud_content.pipelines
Thank You
Stay in Touch
Geetha Anne
geethaanne.sjsu@gmail.com
Geethaay
github.com/GeethaAnne
www.linkedin.com/in/geetha-anne-8646011a/

Mais conteúdo relacionado

Semelhante a Strategies For Migrating From SQL to NoSQL — The Apache Kafka Way

Luciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdf
Luciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdfLuciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdf
Luciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdfHostedbyConfluent
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open NetworkingCumulus Networks
 
Reinventing Kafka in the Data Streaming Era - Jun Rao
Reinventing Kafka in the Data Streaming Era - Jun RaoReinventing Kafka in the Data Streaming Era - Jun Rao
Reinventing Kafka in the Data Streaming Era - Jun Raoconfluent
 
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it YourselfWhy Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it YourselfDATAVERSITY
 
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWSBridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWSconfluent
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans JespersenBest Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersenconfluent
 
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...HostedbyConfluent
 
Seamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectSeamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectPrecisely
 
APAC Kafka Summit - Best Of
APAC Kafka Summit - Best Of APAC Kafka Summit - Best Of
APAC Kafka Summit - Best Of confluent
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservicesconfluent
 
An Introduction to Confluent Cloud: Apache Kafka as a Service
An Introduction to Confluent Cloud: Apache Kafka as a ServiceAn Introduction to Confluent Cloud: Apache Kafka as a Service
An Introduction to Confluent Cloud: Apache Kafka as a Serviceconfluent
 
SQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT SolutionsSQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT SolutionsMichaela Murray
 
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...AWS User Group Kochi
 
Migrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceMigrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceAmazon Web Services
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScyllaDB
 
Confluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern AnalyticsConfluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern Analyticsconfluent
 
High Performance Computing Pitch Deck
High Performance Computing Pitch DeckHigh Performance Computing Pitch Deck
High Performance Computing Pitch DeckNicholas Vossburg
 
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...Kai Wähner
 
Enterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVaultEnterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVaultAmazon Web Services
 

Semelhante a Strategies For Migrating From SQL to NoSQL — The Apache Kafka Way (20)

Luciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdf
Luciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdfLuciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdf
Luciano Moreira_Jacob Bogie-BRSP005-10.3_22_FINAL.pdf
 
Unleash the Power of Open Networking
Unleash the Power of Open NetworkingUnleash the Power of Open Networking
Unleash the Power of Open Networking
 
Reinventing Kafka in the Data Streaming Era - Jun Rao
Reinventing Kafka in the Data Streaming Era - Jun RaoReinventing Kafka in the Data Streaming Era - Jun Rao
Reinventing Kafka in the Data Streaming Era - Jun Rao
 
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it YourselfWhy Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
 
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWSBridge to Cloud: Using Apache Kafka to Migrate to AWS
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans JespersenBest Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
 
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
 
Seamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectSeamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with Connect
 
APAC Kafka Summit - Best Of
APAC Kafka Summit - Best Of APAC Kafka Summit - Best Of
APAC Kafka Summit - Best Of
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservices
 
An Introduction to Confluent Cloud: Apache Kafka as a Service
An Introduction to Confluent Cloud: Apache Kafka as a ServiceAn Introduction to Confluent Cloud: Apache Kafka as a Service
An Introduction to Confluent Cloud: Apache Kafka as a Service
 
Enterprise Cloud Transformation
Enterprise Cloud TransformationEnterprise Cloud Transformation
Enterprise Cloud Transformation
 
SQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT SolutionsSQL Server 2019 hotlap - WARDY IT Solutions
SQL Server 2019 hotlap - WARDY IT Solutions
 
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
ACDKOCHI19 - Journey from a traditional on-prem Datacenter to AWS: Challenges...
 
Migrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceMigrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration Service
 
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi KivityScylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2019 Keynote - Avi Kivity
 
Confluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern AnalyticsConfluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern Analytics
 
High Performance Computing Pitch Deck
High Performance Computing Pitch DeckHigh Performance Computing Pitch Deck
High Performance Computing Pitch Deck
 
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
 
Enterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVaultEnterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVault
 

Mais de ScyllaDB

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLWhat Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLScyllaDB
 
Low Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsLow Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasScyllaDB
 
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBBeyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasScyllaDB
 
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...ScyllaDB
 
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...ScyllaDB
 
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaScyllaDB
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityScyllaDB
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptxScyllaDB
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDBScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationScyllaDB
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsScyllaDB
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesScyllaDB
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsScyllaDB
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBScyllaDB
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101ScyllaDB
 

Mais de ScyllaDB (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLWhat Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQL
 
Low Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsLow Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & Pitfalls
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
 
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBBeyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
 
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
 
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
 
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual Workshop
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & Tradeoffs
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDB
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101
 

Último

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Último (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

Strategies For Migrating From SQL to NoSQL — The Apache Kafka Way

  • 1. Strategies For Migrating From SQL to NoSQL — The Apache Kafka Way Geetha Anne, Sr Solutions Engineer
  • 2. Geetha Anne ■ Silicon Valley ■ 2 daughters ■ Cloudera, Servicenow, Hawaiian Airlines prior to joining Confluent ■ 10 years in the space ■ Software Development, Automation Engineering/Presales are key areas of expertise ■ Cooking, Singing, Hiking
  • 3. ■ The Problem - Migrating to a modern NoSQL Database is a complex process ■ Why Confluent - Database and data modernization with Confluent ■ The Solution - Proposed architecture and action plan ■ Takeaways - Food for thought and Next Steps Agenda
  • 5.
  • 6. Modern, cloud-native databases power business critical applications with lower operational overhead Self-Managed Databases ● Rigid architecture that makes it hard to integrate with other systems ● Expensive in both upfront and ongoing maintenance costs ● Slower to scale to meet evolving demands Cloud Databases ● Lower TCO by decoupling storage from compute and leveraging consumption- based pricing ● Increased overall flexibility and business agility ● Worry free operations with built into auto-scaling and maintenance cycles
  • 7. Integrating multiple legacy system to the cloud could be a complex, multi-year process Time and resource intensive Replacing or refactoring legacy data systems across environments is not easy. During which, data visibility can be limited. Insight blind spots Getting actionable data from disparate data sources is cumbersome. Most data insight comes from nightly loads, merges, and batch updates to create a complete view. Data silos across environments Difficulties with integrating multiple data silos and data formats. On-Prem Legacy Database Cloud Cloud Database CRM SaaS App Nightly Reporting Applications ETL App Batch Jobs ETL & Database Syncs
  • 8. Easily modernize your database by integrating legacy with the cloud using Confluent 1. Simplify and accelerate migration Link on-prem and cloud for easy data movement across environments and process data in flight with ksqlDB stream processing 2. Stay synchronized in real-time Move from batch to real-time streaming and access change data capture technology using Confluent and our CDC connectors 3. Reduce total cost of ownership Leverage fully managed services and avoid prohibitive licensing costs from existing solutions offered by legacy vendors
  • 10. Real-time & Historical Data A sale A shipment A trade A customer interaction A new paradigm is required for Data in Motion Continuously process streams of data in real time “We need to shift our thinking from everything at rest, to everything in motion.” Real-Time Stream Processing Rich, front-end customer experiences Real-time, software-driven business operations
  • 11. Operationalizing Kafka on your own is difficult Kafka is hard in experimentation. It gets harder (and riskier) as you add mission-critical data and use cases. ● Architecture planning ● Cluster sizing ● Cluster provisioning ● Broker settings ● Zookeeper management ● Partition placement & data durability ● Source/sink connectors development & maintenance ● Monitoring & reporting tools setup ● Software patches and upgrades ● Security controls and integrations ● Failover design & planning ● Mirroring & geo-replication ● Streaming data governance ● Load rebalancing & monitoring ● Expansion planning & execution ● Utilization optimization & visibility ● Cluster migrations ● Infrastructure & performance upgrades / enhancements V A L U E 1 2 3 4 5 Experimentation / Early Interest Central Nervous System Mission critical, disparate LOBs Identify a Project Mission-critical, connected LOBs Key challenges: Operational burden & resources Manage and scale platform to support ever-growing demand Security & governance Ensure streaming data is as safe & secure as data-at-rest as Kafka usage scales Real-time connectivity & processing Leverage valuable legacy data to power modern, cloud-based apps & experiences Global availability Maintain high availability across environments with minimal downtime
  • 12. Cloud-native Infinite Store unlimited data on Confluent to enhance your real-time apps and use cases with a broader set of data Global Create a consistent data fabric throughout your organization by linking clusters across your different environments Elastic Scale up instantly to meet any demand and scale back down to avoid over-provisioning infrastructure
  • 13. Everywhere Confluent provides deployment flexibility to span all of your environments SELF-MANAGED SOFTWARE Confluent Platform The Enterprise Distribution of Apache Kafka Deploy on-premises or in your private cloud VM FULLY MANAGED SERVICE Confluent Cloud Cloud-native service for Apache Kafka Available on the leading public clouds
  • 15.
  • 16. Three Phase Plan Modernize your Databases with Confluent 1. Migrate ● Choose the workloads that you’d like to migrate to the cloud ● Seamlessly integrate your data source via managed Confluent source connectors 2. Optimize ● Perform real-time data transformations using ksqlDB ● Find the most useful queries for your cloud data ● Work with our ecosystem of partners to find the best use of your data 3. Modernize ● Use our managed sink connectors to send data into your cloud database of choice ● Continue migrating workloads into the cloud as chances arise
  • 18. 18 Instantly Connect Popular Data Sources & Sinks 130+ pre-built connectors 100+ Confluent Supported 30+ Partner Supported, Confluent Verified AWS Lambda
  • 19. Modernize and bridge your entire data architecture with Confluent robust connector portfolio Modern, cloud-based data systems Legacy data systems Oracle Database ksqlDB Mainframes Applications Cloud-native / SaaS apps Azure Synapse Analytics Expensive, custom-built integrations Expensive, custom-built integrations Expensive, custom-built integrations Source Connectors Expensive, custom-built integrations Expensive, custom-built integrations Sink Connectors
  • 21. 3 Modalities of Stream Processing with Confluent Kafka clients 21 Kafka Streams ksqlDB ConsumerRecords<String, String> records = consumer.poll(100); Map<String, Integer> counts = new DefaultMap<String, Integer>(); for (ConsumerRecord<String, Integer> record : records) { String key = record.key(); int c = counts.get(key) c += record.value() counts.put(key, c) } for (Map.Entry<String, Integer> entry : counts.entrySet()) { int stateCount; int attempts; while (attempts++ < MAX_RETRIES) { try { stateCount = stateStore.getValue(entry.getKey()) stateStore.setValue(entry.getKey(), entry.getValue() + stateCount) break; } catch (StateStoreException e) { RetryUtils.backoff(attempts); } } } builder .stream("input-stream", Consumed.with(Serdes.String(), Serdes.String())) .groupBy((key, value) -> value) .count() .toStream() .to("counts", Produced.with(Serdes.String(), Serdes.Long())); SELECT x, count(*) FROM stream GROUP BY x EMIT CHANGES; Flexibility Simplicity
  • 22. ksqlDB at a Glance What is it? ksqlDB is an event streaming database for working with streams and tables of data. All the key features of a modern streaming solution. Aggregations Joins Windowing Event-Time Dual Query Support Exactly-Once Semantics Out-of-Order Handling User-Defined Functions Compute Storage CREATE TABLE activePromotions AS SELECT rideId, qualifyPromotion(distanceToDst) AS promotion FROM locations GROUP BY rideId EMIT CHANGES How does it work? It separates compute from storage, and scales elastically in a fault-tolerant manner. It remains highly available during disruption, even in the face of failure to a quorum of its servers. ksqlDB Kafka 22
  • 23. Built on the Best Technology, Available as a Fully-Managed Service Kafka is the backbone of ksqlDB ksqlDB is built on top of Kafka’s battle-tested streaming foundation. Its design re-uses Kafka to achieve elasticity, fault-tolerance, and scalability for stream processing & analytics.. Use a fully-managed service With Confluent Cloud ksqlDB, you need not worry about any of the details of running it. You can forget about: ● Clusters ● Brokers ● Scaling ● Upgrading ● Monitoring Pay only for what you use. ksqlDB server Kafka topic topic changelog topic Push & Pull Queries Kafka Streams Engine Local State (transient) topic Compute Storage 23
  • 24. Accelerate your migration from legacy on-prem systems to modern, cloud-based technologies 24 Modern, cloud-based data systems Legacy data systems Oracle Database ksqlDB Mainframes Applications Cloud-native / SaaS apps Azure Synapse Analytics Expensive, custom-built integrations Expensive, custom-built integrations Expensive, custom-built integrations Source Connectors Expensive, custom-built integrations Expensive, custom-built integrations Sink Connectors
  • 25. Confluent the central nervous system of data 25
  • 26. Confluent Cloud Fully Managed Connectors ● Limited set of the larger Connector Catalogue ● Elastic scaling with no infrastructure to manage ● Connector networking configuration dependent on your clusters networking ● Limited configuration options ● Stable Source IPs are Available for certain connectors
  • 28. Three Phase Plan Modernize your Database with Confluent 28 1. Migrate ● Choose the workloads that you’d like to migrate to the cloud ● Seamlessly integrate your data source via managed Confluent source connectors 2. Optimize ● Perform real-time data transformations using ksqlDB ● Find the most useful queries for your cloud data ● Work with our ecosystem of partners to find the best use of your data 3. Modernize ● Use our managed sink connectors to send data into your cloud database of choice ● Continue migrating workloads into the cloud as chances arise
  • 29. Cloud-native, Complete, Everywhere with Kafka at its core Infinite Storage Security & Data Governance ksqlDB & Stream Processing, Analytics Connectors APIs, UIs, CLIs Fully Managed ‘NoOps’ on AWS, Azure, GCP 29
  • 31. Thank You Stay in Touch Geetha Anne geethaanne.sjsu@gmail.com Geethaay github.com/GeethaAnne www.linkedin.com/in/geetha-anne-8646011a/