SlideShare uma empresa Scribd logo
1 de 29
Baixar para ler offline
Using Machine Learning to Govern
Kafka Clients
UmaMahesh Sistu & Shu Wang
Event Streaming Platform Team, Fidelity Investments
PART 1 - Introduction
Streaming platform
Application structure
Initial ideas
Application governance
PART 3 – Machine Learning
Machine learning
Continuous learning
Optimization
PART 2 - Implementation
Implementation
Challenges
Kafka linting tool
Rules engine
Demo
Objectives
Using Machine Learning to Govern Kafka Clients
3
Part 1
Introduction
Streaming Platform Ecosystem of Services
Fidelity Management &
Research Company
Founded
Fidelity Purchases
First Computer
Fidelity Becomes an
Internet Pioneer with
First Mutual Fund to
Create a Home Page
Today
6,500+ Applications on Public Cloud
Fidelity’s First
Product Application
Deployed to the
Cloud
1965
1946
1995
2016
2019
Fidelity Launches Multi-Cloud Hybrid Strategy
Fidelity’s Cloud Journey
Event Streaming Platform
4
Years in
public cloud
6B+
Events per day
72+
16k+
Producer and consumer
Applications
Self-service APIs
300+
Observability Metrics
Event Streaming Platform
Core and Common Capabilities
Schema Registry
Connectors
Query and Analyze Streaming
Data
Management Plane
Events and Streams
Control
Plane
Data Plane
Tech
Stack
Telemetry
Database
CRM
Other
Data
Warehouse
Down Stream
Apps
DATA
INTEGRATION
App Events Log Events Mobile IoT Events
Business
Events
Analytics & ML
Inventory
Management
Other
Fraud
Detection
Transactional
Apps
REAL-TIME
APPLICATIONS
Producer
Consumer
Platform capabilities and
standards
Enhance developer experience
FinOps management
Consistent observability
Security and governance
LARGE DATA SETS
Multi
Cloud
Streaming Applications
• Labyrinth of streaming paths
• Many producers and
consumers
• Multi-cloud service
providers
• Polyglot of programming
languages
• Complex event mesh
Governance Challenges
Increase
throughput
Avoid
common
pitfalls
Producer:
1. buffer.memory
2. batch.size
3. compression.type
4. linger.ms
5. retries
Consumer:
1. fetch.min.bytes
2. fetch.max.wait.ms
3. max.poll.interval.ms
4. max.poll.records
5. enable.auto.commit
Initial Approach
Documentation
Best practices
Training
Workshops
Initial Approach
Scale and efficiency?
Application Governance
Introduction of Application / Client Governance Gates
Application Observability
Performance Testing
Resiliency Testing
Client Config Linting
Client Optimization Release to Prod
Release to Prod
Application Governance
Introduction of Application / Client Governance Gates
Application Observability
Performance Testing
Resiliency Testing
Client Config Linting
Client Optimization Release to Prod
Release to Prod
Part 2
Implementation
Challenges and Opportunities
13
Challenges
• Thousands of producers/consumers
• 170+ client configurations
How can we help producer/consumer teams configure client apps
properly?
How can we convert our experiences to a tool that can help users?
Analyze
Rule Engine & Linting API
What feedback can we provide?
• Rule/Policy engine
• Decouple decision making logic from business code
• Maintain the rules/policies centrally as code
• Serve as a linting API to client application to provide
feedback
How do we integrate with the client applications?
Demo
What’s next?
What about the parameters that have range value?
What’s the best value for them?
Integration Details
Looking Forward
How to find the relationship between the parameters and the throughput?
Performance Testing
• Cost / expensive
• Too many teams
Part 3
Machine Learning
Continuous Learning and Optimization
20
How to use the model to figure out the best value of parameter
Inferring Insights using Machine Learning Models
Where should we start?
Use Model to Find the Better Result
How to find the best possible performance and the corresponding parameters
Determine the Starting Point
Is the model’s prediction perfect?
Inference
Determine the optimal values based on performance testing
Performance Testing
How to deal with changes and improve the model
Default allowed values
Rule base, security/best practice
Get start value to do further tuning
Inference to find better values
Much less performance testing
The optimal values of client configurations
Fine-Tuning Stages
Self-Learning Cycle
Keep adding new data
Keep removing obsolete data
Keep identifying the new factors
Keep improving the ML model
ML Model Optimization
Summary
Application Observability
Performance Testing
Resiliency Testing
Client Config Linting
Client Optimization
Thank You!
Kafka Summit London 2023

Mais conteúdo relacionado

Semelhante a Using Machine Learning to Govern Kafka Clients with Shu Wang & UmaMahesh Sistu

Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital TransformationMukund Babbar
 
Enterprise API deployment best practice
Enterprise API deployment best practiceEnterprise API deployment best practice
Enterprise API deployment best practiceSanjay Roy
 
Moving to the windows azure cloud - Strategic options for getting to the cloud
Moving to the windows azure cloud - Strategic options for getting to the cloudMoving to the windows azure cloud - Strategic options for getting to the cloud
Moving to the windows azure cloud - Strategic options for getting to the cloudNigel Watson
 
API Management
API ManagementAPI Management
API ManagementProlifics
 
Coghead Overview 21 Aug08
Coghead Overview 21 Aug08Coghead Overview 21 Aug08
Coghead Overview 21 Aug08Tomoaki Sawada
 
Sonoa Cloud Services for Elasticity and Mobility
Sonoa Cloud Services for Elasticity and MobilitySonoa Cloud Services for Elasticity and Mobility
Sonoa Cloud Services for Elasticity and MobilityIntel Corporation
 
Acquisition of IT Service Management tools
Acquisition of IT Service Management toolsAcquisition of IT Service Management tools
Acquisition of IT Service Management toolsChristian F. Nissen
 
Forum Presentation
Forum PresentationForum Presentation
Forum Presentationallaboutsyed
 
Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516
Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516
Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516Tanjina Prema
 
Kochi mulesoft meetup 02
Kochi mulesoft meetup 02Kochi mulesoft meetup 02
Kochi mulesoft meetup 02sumitahuja94
 
Browser Diagnostics using dynatrace Ajax Edition
Browser Diagnostics using dynatrace Ajax EditionBrowser Diagnostics using dynatrace Ajax Edition
Browser Diagnostics using dynatrace Ajax EditionDeepak Kaul
 
I T E008 Bezar 091907
I T E008  Bezar 091907I T E008  Bezar 091907
I T E008 Bezar 091907Dreamforce07
 
Platform for Secure Digital Business
Platform for Secure Digital BusinessPlatform for Secure Digital Business
Platform for Secure Digital BusinessAkana
 
Business Analyst Series 2023 - Week 4 Session 7
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7DianaGray10
 
Next Gen ADM: The future of application services.
Next Gen ADM: The future of application services.Next Gen ADM: The future of application services.
Next Gen ADM: The future of application services.IBM
 
Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...
Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...
Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...apidays
 
What You Need to Know About IBM i Modernization and Optimization
What You Need to Know About IBM i Modernization and OptimizationWhat You Need to Know About IBM i Modernization and Optimization
What You Need to Know About IBM i Modernization and OptimizationEnsono
 
Insurance Innovation Award-Optalix
Insurance Innovation Award-OptalixInsurance Innovation Award-Optalix
Insurance Innovation Award-OptalixThe Digital Insurer
 

Semelhante a Using Machine Learning to Govern Kafka Clients with Shu Wang & UmaMahesh Sistu (20)

Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
 
Enterprise API deployment best practice
Enterprise API deployment best practiceEnterprise API deployment best practice
Enterprise API deployment best practice
 
AI Planning Workshop overview
AI Planning Workshop overviewAI Planning Workshop overview
AI Planning Workshop overview
 
Moving to the windows azure cloud - Strategic options for getting to the cloud
Moving to the windows azure cloud - Strategic options for getting to the cloudMoving to the windows azure cloud - Strategic options for getting to the cloud
Moving to the windows azure cloud - Strategic options for getting to the cloud
 
API Management
API ManagementAPI Management
API Management
 
Coghead Overview 21 Aug08
Coghead Overview 21 Aug08Coghead Overview 21 Aug08
Coghead Overview 21 Aug08
 
Sonoa Cloud Services for Elasticity and Mobility
Sonoa Cloud Services for Elasticity and MobilitySonoa Cloud Services for Elasticity and Mobility
Sonoa Cloud Services for Elasticity and Mobility
 
Acquisition of IT Service Management tools
Acquisition of IT Service Management toolsAcquisition of IT Service Management tools
Acquisition of IT Service Management tools
 
Forum Presentation
Forum PresentationForum Presentation
Forum Presentation
 
Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516
Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516
Hybrid cloud-cloud-services-white-paper-external-apw12358usen-20180516
 
Kochi mulesoft meetup 02
Kochi mulesoft meetup 02Kochi mulesoft meetup 02
Kochi mulesoft meetup 02
 
Browser Diagnostics using dynatrace Ajax Edition
Browser Diagnostics using dynatrace Ajax EditionBrowser Diagnostics using dynatrace Ajax Edition
Browser Diagnostics using dynatrace Ajax Edition
 
I T E008 Bezar 091907
I T E008  Bezar 091907I T E008  Bezar 091907
I T E008 Bezar 091907
 
TEC-Roundtable-API
TEC-Roundtable-APITEC-Roundtable-API
TEC-Roundtable-API
 
Platform for Secure Digital Business
Platform for Secure Digital BusinessPlatform for Secure Digital Business
Platform for Secure Digital Business
 
Business Analyst Series 2023 - Week 4 Session 7
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
 
Next Gen ADM: The future of application services.
Next Gen ADM: The future of application services.Next Gen ADM: The future of application services.
Next Gen ADM: The future of application services.
 
Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...
Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...
Apidays Paris 2023 - How to Master the Lifecycle of your APIs, Ivan Frain, Su...
 
What You Need to Know About IBM i Modernization and Optimization
What You Need to Know About IBM i Modernization and OptimizationWhat You Need to Know About IBM i Modernization and Optimization
What You Need to Know About IBM i Modernization and Optimization
 
Insurance Innovation Award-Optalix
Insurance Innovation Award-OptalixInsurance Innovation Award-Optalix
Insurance Innovation Award-Optalix
 

Mais de HostedbyConfluent

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Renaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonRenaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonHostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolHostedbyConfluent
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesEnsuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesHostedbyConfluent
 
Exactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaExactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaHostedbyConfluent
 
Fish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonFish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonHostedbyConfluent
 
Tiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonTiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonHostedbyConfluent
 
Building a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyBuilding a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyHostedbyConfluent
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...HostedbyConfluent
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...HostedbyConfluent
 
Navigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersNavigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersHostedbyConfluent
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformApache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformHostedbyConfluent
 
Explaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubExplaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubHostedbyConfluent
 
TL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonTL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonHostedbyConfluent
 
A Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLA Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLHostedbyConfluent
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceMastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceHostedbyConfluent
 
Data Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondData Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondHostedbyConfluent
 
Code-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsCode-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsHostedbyConfluent
 
Debezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemDebezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemHostedbyConfluent
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksBeyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksHostedbyConfluent
 

Mais de HostedbyConfluent (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Renaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit LondonRenaming a Kafka Topic | Kafka Summit London
Renaming a Kafka Topic | Kafka Summit London
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at TrendyolEvolution of NRT Data Ingestion Pipeline at Trendyol
Evolution of NRT Data Ingestion Pipeline at Trendyol
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking TechniquesEnsuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
 
Exactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and KafkaExactly-once Stream Processing with Arroyo and Kafka
Exactly-once Stream Processing with Arroyo and Kafka
 
Fish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit LondonFish Plays Pokemon | Kafka Summit London
Fish Plays Pokemon | Kafka Summit London
 
Tiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit LondonTiered Storage 101 | Kafla Summit London
Tiered Storage 101 | Kafla Summit London
 
Building a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And WhyBuilding a Self-Service Stream Processing Portal: How And Why
Building a Self-Service Stream Processing Portal: How And Why
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
 
Navigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka ClustersNavigating Private Network Connectivity Options for Kafka Clusters
Navigating Private Network Connectivity Options for Kafka Clusters
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data PlatformApache Flink: Building a Company-wide Self-service Streaming Data Platform
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
 
Explaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy PubExplaining How Real-Time GenAI Works in a Noisy Pub
Explaining How Real-Time GenAI Works in a Noisy Pub
 
TL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit LondonTL;DR Kafka Metrics | Kafka Summit London
TL;DR Kafka Metrics | Kafka Summit London
 
A Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSLA Window Into Your Kafka Streams Tasks | KSL
A Window Into Your Kafka Streams Tasks | KSL
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing PerformanceMastering Kafka Producer Configs: A Guide to Optimizing Performance
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
 
Data Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and BeyondData Contracts Management: Schema Registry and Beyond
Data Contracts Management: Schema Registry and Beyond
 
Code-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink AppsCode-First Approach: Crafting Efficient Flink Apps
Code-First Approach: Crafting Efficient Flink Apps
 
Debezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC EcosystemDebezium vs. the World: An Overview of the CDC Ecosystem
Debezium vs. the World: An Overview of the CDC Ecosystem
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local DisksBeyond Tiered Storage: Serverless Kafka with No Local Disks
Beyond Tiered Storage: Serverless Kafka with No Local Disks
 

Último

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Using Machine Learning to Govern Kafka Clients with Shu Wang & UmaMahesh Sistu

  • 1. Using Machine Learning to Govern Kafka Clients UmaMahesh Sistu & Shu Wang Event Streaming Platform Team, Fidelity Investments
  • 2. PART 1 - Introduction Streaming platform Application structure Initial ideas Application governance PART 3 – Machine Learning Machine learning Continuous learning Optimization PART 2 - Implementation Implementation Challenges Kafka linting tool Rules engine Demo Objectives Using Machine Learning to Govern Kafka Clients
  • 4. Fidelity Management & Research Company Founded Fidelity Purchases First Computer Fidelity Becomes an Internet Pioneer with First Mutual Fund to Create a Home Page Today 6,500+ Applications on Public Cloud Fidelity’s First Product Application Deployed to the Cloud 1965 1946 1995 2016 2019 Fidelity Launches Multi-Cloud Hybrid Strategy Fidelity’s Cloud Journey
  • 5. Event Streaming Platform 4 Years in public cloud 6B+ Events per day 72+ 16k+ Producer and consumer Applications Self-service APIs 300+ Observability Metrics
  • 6. Event Streaming Platform Core and Common Capabilities Schema Registry Connectors Query and Analyze Streaming Data Management Plane Events and Streams Control Plane Data Plane Tech Stack Telemetry Database CRM Other Data Warehouse Down Stream Apps DATA INTEGRATION App Events Log Events Mobile IoT Events Business Events Analytics & ML Inventory Management Other Fraud Detection Transactional Apps REAL-TIME APPLICATIONS Producer Consumer Platform capabilities and standards Enhance developer experience FinOps management Consistent observability Security and governance LARGE DATA SETS Multi Cloud
  • 7. Streaming Applications • Labyrinth of streaming paths • Many producers and consumers • Multi-cloud service providers • Polyglot of programming languages • Complex event mesh
  • 8. Governance Challenges Increase throughput Avoid common pitfalls Producer: 1. buffer.memory 2. batch.size 3. compression.type 4. linger.ms 5. retries Consumer: 1. fetch.min.bytes 2. fetch.max.wait.ms 3. max.poll.interval.ms 4. max.poll.records 5. enable.auto.commit
  • 11. Application Governance Introduction of Application / Client Governance Gates Application Observability Performance Testing Resiliency Testing Client Config Linting Client Optimization Release to Prod Release to Prod
  • 12. Application Governance Introduction of Application / Client Governance Gates Application Observability Performance Testing Resiliency Testing Client Config Linting Client Optimization Release to Prod Release to Prod
  • 14. Challenges • Thousands of producers/consumers • 170+ client configurations How can we help producer/consumer teams configure client apps properly?
  • 15. How can we convert our experiences to a tool that can help users? Analyze
  • 16. Rule Engine & Linting API What feedback can we provide? • Rule/Policy engine • Decouple decision making logic from business code • Maintain the rules/policies centrally as code • Serve as a linting API to client application to provide feedback
  • 17. How do we integrate with the client applications? Demo
  • 18. What’s next? What about the parameters that have range value? What’s the best value for them? Integration Details
  • 19. Looking Forward How to find the relationship between the parameters and the throughput? Performance Testing • Cost / expensive • Too many teams
  • 20. Part 3 Machine Learning Continuous Learning and Optimization 20
  • 21. How to use the model to figure out the best value of parameter Inferring Insights using Machine Learning Models
  • 22. Where should we start? Use Model to Find the Better Result
  • 23. How to find the best possible performance and the corresponding parameters Determine the Starting Point
  • 24. Is the model’s prediction perfect? Inference
  • 25. Determine the optimal values based on performance testing Performance Testing
  • 26. How to deal with changes and improve the model Default allowed values Rule base, security/best practice Get start value to do further tuning Inference to find better values Much less performance testing The optimal values of client configurations Fine-Tuning Stages
  • 27. Self-Learning Cycle Keep adding new data Keep removing obsolete data Keep identifying the new factors Keep improving the ML model ML Model Optimization
  • 28. Summary Application Observability Performance Testing Resiliency Testing Client Config Linting Client Optimization
  • 29. Thank You! Kafka Summit London 2023