SlideShare uma empresa Scribd logo
1 de 49
Baixar para ler offline
1
Using Machine
Learning to
Understand Kafka
Runtime Behavior
Shivnath Babu
Cofounder/CTO, Unravel Data
Adjunct Professor, Duke University
shivnath@unraveldata.com
Nate Snapp
Big Data Engineering
Adobe, Palo Alto Networks, Omniture
LinkedIn or nate.snapp@gmail.com
2
Meet the speakers
• Cofounder/CTO at Unravel
• Adjunct Professor of Computer
Science at Duke University
• Focusing on ease-of-use and
manageability of data apps & platforms
• Recipient of US National Science
Foundation CAREER Award, three
IBM Faculty Awards, HP Labs
Innovation Research Award
Shivnath BabuNate Snapp
• Senior SRE from Adobe, Palo Alto
Networks, and Omniture
• 12 years experience in streaming
• First 6 years on proprietary
streaming analytics for 9/10 Fortune
500, 20B events daily, 10K+ servers
• Last 2 years have moved to Kafka
• Blogging on SRE, Hadoop, and data
streaming space at natesnapp.com
3
MODERN DATA APPLICATIONS
Machine Learning Predictive Analytics
AI loT
ENVIRONMENTS
On-Premises HybridCloud
PLATFORMS & TECHNOLOGIES
NoSQL SQL MPP API
01
uncover
ADAPTIVE DATA COLLECTION
understand
02
DATA MODEL & CORRELATION
ANALYTICS
ENGINE
AUTOMATION
ENGINE
TUNING
ENGINE
INFERENCE
ENGINE
unravel
03
DASHBOARDS
AUTO-ACTIONS
SMART ALERTS
REPORTING
RECOMMENDATIONS
4
• Clusters with 6-29 brokers
• Confluent Kafka 5.2.1, Apache Kafka 2.2.0-cp2
• 1700 topics across all clusters
• Largest topics top out with over 20K+ messages/sec
• Smaller topics are 300-500 messages/sec
• Large self-service components
• Ingress is a mix of separate Kafka, Java client API as well as load balanced
REST API frontends; some clusters have use of Schema Registry
• Egress is a mix of custom endpoints, and in many cases, HDFS sink
What Kafka setups?
5
Nature of streaming behavior
6
Practical challenge #1
Variance in flow
How do we decide if there is an anomaly?
7
Variance in flow (contd.)
Partition number
Count
8
Practical challenge #2
Negative effects of really slow data
9
Practical challenge #3
Event Sourcing
Time
Purchase
Topic
Cancellation
Topic
10
• Runtime schema changes
• “Flexible-Rigid Schema”
• Timeouts causing rebalance storms
• Leader affinity and poor assignment
• Poor partition assignment
Elements of surprise!
11
Anomaly
Detection
Most enterprises now have mission-critical
streaming apps
Predictive
Maintenance
Threat
Monitoring
Recommendation
Engines
Real-time customer
sentiment analysis
12
Streaming data architecture must be reliable
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSORIoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
13
Many problems can cause unreliable performance
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSORIoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
Untimely results
14
Many problems can cause unreliable performance
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSORIoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
Untimely resultsPoor partitioning Inefficient Configuration Resource contention
+ + =
15
DevOps face many challenges today
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSORIoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
Poor partitioning Inefficient Configuration Resource contention
+ +
• No single tool
• No correlation across the stack
• No application view
• No insights
• No recommendations
• No automated actions
=
Untimely results
16
How we can empower DevOps teams
IoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
Platform Metrics App Metrics
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSOR
App Platform
Interaction Metrics
Bring all performance data into
one complete & correlated view
17
How we can empower DevOps teams
IoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
Platform Metrics App Metrics
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSOR
App Platform
Interaction Metrics
Provide out-of-the-box
intelligence with
Machine Learning (ML)
18
How we can empower DevOps teams
IoT Sensors
Database
Other Data
Dashboard
Result Store
Other Output
Platform Metrics App Metrics
STREAM STORE
Kafka HBase Spark Flink
REAL-TIME PROCESSOR
App Platform
Interaction Metrics
Automate actions
smartly with
Artificial Intelligence (AI)
19
20
ALGORITHM
GOAL
21
Goals of Streaming App and Kafka DevOps
Teams
Throughput SLA
Latency SLA
Data loss tolerance
Stability/resiliency
Resource usage/cost
Planning/growth
App-level Goals
Platform-level Goals
AI/ML Algorithms
Outlier Detection
Forecasting
Anomaly Detection
Correlation Analysis
Model Learning
22
Outlier Detection
23
1. Detecting load imbalance among Kafka brokers
2. Detecting load imbalance among Kafka partitions
Use Cases
24
Detecting load imbalance among Kafka brokers
Brokers kabo2 and
kabo3 have much
higher number of
incoming messages
than broker kabo1
25
Algorithms for Outlier Detection
Picture credit: http://historum.com/asian-history/128081-aryan-migration-theory-update-128.html
• Based on one feature Vs.
multiple features
• Is the distribution of data
assumed?
1. Z-score: How many standard
deviations is a data point from the
mean
26
• Based on one feature Vs.
multiple features
• Is the distribution of data
assumed?
1. Z-score: How many standard
deviations is a data point from the
mean
2. DBScan: Density-based clustering
3. Isolation forests
4. Deep learning (e.g., Autoencoders)
Algorithms for Outlier Detection
Picture credit: http://en.proft.me/2017/02/3/density-based-clustering-r/
27
Forecasting
28
1. Predicting when SLAs are in danger of being missed
2. Predicting when system may run out of headroom or
capacity
Use Cases
29
A Real-life Application: RealTimeSentimentMonitor
TWEETS
Partitions
1-N
TWEETS
TWEETS
30
Predicting when SLAs are in danger of being missed
Latency SLA is
3 minutes
Latency SLA can be
missed by this time
Current time is here
31
• Many standard time-series forecasting
techniques: ARIMA, Holt-Winters
• Deep-learning techniques (e.g., LSTM)
Algorithms for Forecasting
32
• Many standard time-series forecasting
techniques: ARIMA, Holt-Winters
• Deep-learning techniques (e.g., LSTM)
• Facebook’s Prophet Algorithm: Mixes stats
methods & judgment from domain experts
• Uses Generative Additive Model (GAM)
• Decomposed time-series model: trend,
seasonality, holidays, and error term
Algorithms for Forecasting
y(t) = trend(t) + periodic(t) + shock(t) + error
33
• Many standard time-series forecasting
techniques: ARIMA, Holt-Winters
• Deep-learning techniques (e.g., LSTM)
• Facebook’s Prophet Algorithm: Mixes stats
methods & judgment from domain experts
• Uses Generative Additive Model (GAM)
• Decomposed time-series model: trend,
seasonality, holidays, and error term
• Advantages:
• Fits faster than ARIMA
• Models various growth trends
• Can handle unevenly spaced data
• Defaults often produce accurate forecasts
Algorithms for Forecasting
34
Anomaly Detection
35
1. An unexpected change that needs your attention
2. Smart alerts:
• False negatives should be minimal
• False positives should be minimal
Use Cases
36
Detecting anomalies is tricky
Is this an unexpected lag
worth alerting on?
37
Algorithms for Anomaly Detection
Picture credit: https://blog.statsbot.co/time-series-anomaly-detection-algorithms-1cef5519aef2
• Deviation from forecasts
38
Algorithms for Anomaly Detection
Picture credit: https://blog.statsbot.co/time-series-anomaly-detection-algorithms-1cef5519aef2
• Deviation from forecasts
• ARIMA
• Regression trees
• Prophet
• STL: Seasonal and Trend
Decomposition using Loess
• Topic of intensive
research
• Deep learning (e.g., LSTM)
39
Correlation Analysis
40
• Fast root-causing of problems
• What lower-level cause led to the change in the
streaming application’s performance?
Use Cases
41
What caused the unexpected change in performance?
Anomaly
What caused it?
100s of time series from every level of the stack!
LATENCY is 421.07% WORSE THAN THE BASELINE
42
• Be aware of the many pitfalls
• E.g., trends can make arbitrary time
series look correlated!
• Pick robust time-series
similarity metrics
• E.g., Euclidean distance Vs. Dynamic
Time Warping
Algorithms for Correlation Analysis
Picture credit: https://izbicki.me/blog/converting-images-into-time-series-for-data-mining.html
Euclidean
Distance
Dynamic
Time
Warping
43
• Be aware of the many pitfalls
• E.g., trends can make arbitrary time
series look correlated!
• Pick robust time-series
similarity metrics
• E.g., Euclidean distance Vs. Dynamic
Time Warping
• Carefully incorporate domain
knowledge
• E.g., what caused latency SLA miss?
• Application-level problem?
• Resource allocation problem?
• Platform-level problem?
• Data-level problem?
Algorithms for Correlation Analysis
Picture credit: https://izbicki.me/blog/converting-images-into-time-series-for-data-mining.html
Euclidean
Distance
Dynamic
Time
Warping
44
Model Learning
45
1. Helps answer what-if and optimization questions
• What is the best number of partitions?
• What is the best setting of timeouts to avoid rebalance storms?
• What is the best partition rebalancing action to take?
• What will the impact of adding a new broker be?
2. Enables Auto Actions for resource/cost efficiency & SLA management
Use Cases
46
Automated tuning suggestions to meet SLA
Precise recommendation to meet SLA
47
• Performance = Func(Input Features)
• Have to find the best set of input features
• Supervised learning is often possible: Training data is available
or easy to generate
Algorithms for Learning Models
Picture credit: https://myslide.cn/slides/8328#
48
Summary: Meeting Kafka DevOps Goals with AI/ML
Throughput goal
Stability goal
Latency goal
Resource usage/cost goal
Data loss tolerance goal
App-level Goals
Platform-level Goals
Planning/growth goal
AI/ML Algorithms
Outlier Detection
Forecasting
Anomaly Detection
Correlation Analysis
Model Learning
49
AIOps: Rich opportunities to address
distributed application performance
management as AI/ML problems
Start your free trial: unraveldata.com/free-trial
Visit us at the Unravel booth
And yes, we are hiring!
shivnath@unraveldata.com

Mais conteúdo relacionado

Mais procurados

Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry
Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry
Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry confluent
 
Akka at Enterprise Scale: Performance Tuning Distributed Applications
Akka at Enterprise Scale: Performance Tuning Distributed ApplicationsAkka at Enterprise Scale: Performance Tuning Distributed Applications
Akka at Enterprise Scale: Performance Tuning Distributed ApplicationsLightbend
 
Operationalizing Machine Learning: Serving ML Models
Operationalizing Machine Learning: Serving ML ModelsOperationalizing Machine Learning: Serving ML Models
Operationalizing Machine Learning: Serving ML ModelsLightbend
 
Decoupling Decisions with Apache Kafka
Decoupling Decisions with Apache KafkaDecoupling Decisions with Apache Kafka
Decoupling Decisions with Apache KafkaGrant Henke
 
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...confluent
 
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Being Ready for Apache Kafka - Apache: Big Data Europe 2015Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Being Ready for Apache Kafka - Apache: Big Data Europe 2015Michael Noll
 
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...confluent
 
Malo Denielou - No shard left behind: Dynamic work rebalancing in Apache Beam
Malo Denielou - No shard left behind: Dynamic work rebalancing in Apache BeamMalo Denielou - No shard left behind: Dynamic work rebalancing in Apache Beam
Malo Denielou - No shard left behind: Dynamic work rebalancing in Apache BeamFlink Forward
 
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...confluent
 
Kafka and Storm - event processing in realtime
Kafka and Storm - event processing in realtimeKafka and Storm - event processing in realtime
Kafka and Storm - event processing in realtimeGuido Schmutz
 
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)KafkaZone
 
Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...
Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...
Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...confluent
 
Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data StreamsMachine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data StreamsLightbend
 
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails? Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails? confluent
 
Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019
Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019 Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019
Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019 confluent
 
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams APIuser Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams APIconfluent
 
Event Detection Pipelines with Apache Kafka
Event Detection Pipelines with Apache KafkaEvent Detection Pipelines with Apache Kafka
Event Detection Pipelines with Apache KafkaDataWorks Summit
 
Building High-Throughput, Low-Latency Pipelines in Kafka
Building High-Throughput, Low-Latency Pipelines in KafkaBuilding High-Throughput, Low-Latency Pipelines in Kafka
Building High-Throughput, Low-Latency Pipelines in Kafkaconfluent
 

Mais procurados (20)

Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry
Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry
Fundamentals of Stream Processing with Apache Beam, Tyler Akidau, Frances Perry
 
Akka at Enterprise Scale: Performance Tuning Distributed Applications
Akka at Enterprise Scale: Performance Tuning Distributed ApplicationsAkka at Enterprise Scale: Performance Tuning Distributed Applications
Akka at Enterprise Scale: Performance Tuning Distributed Applications
 
Operationalizing Machine Learning: Serving ML Models
Operationalizing Machine Learning: Serving ML ModelsOperationalizing Machine Learning: Serving ML Models
Operationalizing Machine Learning: Serving ML Models
 
Decoupling Decisions with Apache Kafka
Decoupling Decisions with Apache KafkaDecoupling Decisions with Apache Kafka
Decoupling Decisions with Apache Kafka
 
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
 
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Being Ready for Apache Kafka - Apache: Big Data Europe 2015Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
 
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
Apache Kafka, Tiered Storage and TensorFlow for Streaming Machine Learning wi...
 
Malo Denielou - No shard left behind: Dynamic work rebalancing in Apache Beam
Malo Denielou - No shard left behind: Dynamic work rebalancing in Apache BeamMalo Denielou - No shard left behind: Dynamic work rebalancing in Apache Beam
Malo Denielou - No shard left behind: Dynamic work rebalancing in Apache Beam
 
Kafka aws
Kafka awsKafka aws
Kafka aws
 
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
 
Kafka and Storm - event processing in realtime
Kafka and Storm - event processing in realtimeKafka and Storm - event processing in realtime
Kafka and Storm - event processing in realtime
 
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
 
Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...
Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...
Five Fabulous Sinks for Your Kafka Data. #3 will surprise you! (Rachel Pedres...
 
Event Driven Architecture
Event Driven ArchitectureEvent Driven Architecture
Event Driven Architecture
 
Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data StreamsMachine Learning At Speed: Operationalizing ML For Real-Time Data Streams
Machine Learning At Speed: Operationalizing ML For Real-Time Data Streams
 
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails? Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
 
Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019
Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019 Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019
Shattering The Monolith(s) (Martin Kess, Namely) Kafka Summit SF 2019
 
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams APIuser Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
 
Event Detection Pipelines with Apache Kafka
Event Detection Pipelines with Apache KafkaEvent Detection Pipelines with Apache Kafka
Event Detection Pipelines with Apache Kafka
 
Building High-Throughput, Low-Latency Pipelines in Kafka
Building High-Throughput, Low-Latency Pipelines in KafkaBuilding High-Throughput, Low-Latency Pipelines in Kafka
Building High-Throughput, Low-Latency Pipelines in Kafka
 

Semelhante a Machine Learning Insights for Kafka Runtime Behavior

Huawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingHuawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingJen Aman
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringTao Xie
 
20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security Brokers20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security BrokersRobin Vermeirsch
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Precisely
 
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022HostedbyConfluent
 
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Spark Summit
 
C19013010 the tutorial to build shared ai services session 2
C19013010 the tutorial to build shared ai services session 2C19013010 the tutorial to build shared ai services session 2
C19013010 the tutorial to build shared ai services session 2Bill Liu
 
Ncku csie talk about Spark
Ncku csie talk about SparkNcku csie talk about Spark
Ncku csie talk about SparkGiivee The
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceeRic Choo
 
Challenges in Practicing High Frequency Releases in Cloud Environments
Challenges in Practicing High Frequency Releases in Cloud Environments Challenges in Practicing High Frequency Releases in Cloud Environments
Challenges in Practicing High Frequency Releases in Cloud Environments Liming Zhu
 
Building data intensive applications
Building data intensive applicationsBuilding data intensive applications
Building data intensive applicationsAmit Kejriwal
 
Design Patterns for Large-Scale Real-Time Learning
Design Patterns for Large-Scale Real-Time LearningDesign Patterns for Large-Scale Real-Time Learning
Design Patterns for Large-Scale Real-Time LearningSwiss Big Data User Group
 
IBM Strategy for Spark
IBM Strategy for SparkIBM Strategy for Spark
IBM Strategy for SparkMark Kerzner
 
Interpreting Performance Test Results
Interpreting Performance Test ResultsInterpreting Performance Test Results
Interpreting Performance Test ResultsEric Proegler
 
Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...
Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...
Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...Cloudera, Inc.
 
Bringing Deep Learning into production
Bringing Deep Learning into production Bringing Deep Learning into production
Bringing Deep Learning into production Paolo Platter
 
Databricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User GroupDatabricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User GroupPaco Nathan
 
10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About 10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About Jesus Rodriguez
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data EngineeringDurga Gadiraju
 
Hadoop Application Architectures - Fraud Detection
Hadoop Application Architectures - Fraud  DetectionHadoop Application Architectures - Fraud  Detection
Hadoop Application Architectures - Fraud Detectionhadooparchbook
 

Semelhante a Machine Learning Insights for Kafka Runtime Behavior (20)

Huawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingHuawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark Streaming
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
 
20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security Brokers20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security Brokers
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
 
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
 
C19013010 the tutorial to build shared ai services session 2
C19013010 the tutorial to build shared ai services session 2C19013010 the tutorial to build shared ai services session 2
C19013010 the tutorial to build shared ai services session 2
 
Ncku csie talk about Spark
Ncku csie talk about SparkNcku csie talk about Spark
Ncku csie talk about Spark
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Challenges in Practicing High Frequency Releases in Cloud Environments
Challenges in Practicing High Frequency Releases in Cloud Environments Challenges in Practicing High Frequency Releases in Cloud Environments
Challenges in Practicing High Frequency Releases in Cloud Environments
 
Building data intensive applications
Building data intensive applicationsBuilding data intensive applications
Building data intensive applications
 
Design Patterns for Large-Scale Real-Time Learning
Design Patterns for Large-Scale Real-Time LearningDesign Patterns for Large-Scale Real-Time Learning
Design Patterns for Large-Scale Real-Time Learning
 
IBM Strategy for Spark
IBM Strategy for SparkIBM Strategy for Spark
IBM Strategy for Spark
 
Interpreting Performance Test Results
Interpreting Performance Test ResultsInterpreting Performance Test Results
Interpreting Performance Test Results
 
Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...
Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...
Cloudera Federal Forum 2014: The Evolution of Machine Learning from Science t...
 
Bringing Deep Learning into production
Bringing Deep Learning into production Bringing Deep Learning into production
Bringing Deep Learning into production
 
Databricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User GroupDatabricks Meetup @ Los Angeles Apache Spark User Group
Databricks Meetup @ Los Angeles Apache Spark User Group
 
10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About 10 Big Data Technologies you Didn't Know About
10 Big Data Technologies you Didn't Know About
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Hadoop Application Architectures - Fraud Detection
Hadoop Application Architectures - Fraud  DetectionHadoop Application Architectures - Fraud  Detection
Hadoop Application Architectures - Fraud Detection
 

Mais de confluent

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flinkconfluent
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsconfluent
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flinkconfluent
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...confluent
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluentconfluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkconfluent
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloudconfluent
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Diveconfluent
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluentconfluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Meshconfluent
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservicesconfluent
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3confluent
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernizationconfluent
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataconfluent
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2confluent
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023confluent
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesisconfluent
 
The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023confluent
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streamsconfluent
 

Mais de confluent (20)

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flink
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insights
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flink
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Mesh
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservices
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernization
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time data
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 
The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streams
 

Último

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 

Último (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 

Machine Learning Insights for Kafka Runtime Behavior

  • 1. 1 Using Machine Learning to Understand Kafka Runtime Behavior Shivnath Babu Cofounder/CTO, Unravel Data Adjunct Professor, Duke University shivnath@unraveldata.com Nate Snapp Big Data Engineering Adobe, Palo Alto Networks, Omniture LinkedIn or nate.snapp@gmail.com
  • 2. 2 Meet the speakers • Cofounder/CTO at Unravel • Adjunct Professor of Computer Science at Duke University • Focusing on ease-of-use and manageability of data apps & platforms • Recipient of US National Science Foundation CAREER Award, three IBM Faculty Awards, HP Labs Innovation Research Award Shivnath BabuNate Snapp • Senior SRE from Adobe, Palo Alto Networks, and Omniture • 12 years experience in streaming • First 6 years on proprietary streaming analytics for 9/10 Fortune 500, 20B events daily, 10K+ servers • Last 2 years have moved to Kafka • Blogging on SRE, Hadoop, and data streaming space at natesnapp.com
  • 3. 3 MODERN DATA APPLICATIONS Machine Learning Predictive Analytics AI loT ENVIRONMENTS On-Premises HybridCloud PLATFORMS & TECHNOLOGIES NoSQL SQL MPP API 01 uncover ADAPTIVE DATA COLLECTION understand 02 DATA MODEL & CORRELATION ANALYTICS ENGINE AUTOMATION ENGINE TUNING ENGINE INFERENCE ENGINE unravel 03 DASHBOARDS AUTO-ACTIONS SMART ALERTS REPORTING RECOMMENDATIONS
  • 4. 4 • Clusters with 6-29 brokers • Confluent Kafka 5.2.1, Apache Kafka 2.2.0-cp2 • 1700 topics across all clusters • Largest topics top out with over 20K+ messages/sec • Smaller topics are 300-500 messages/sec • Large self-service components • Ingress is a mix of separate Kafka, Java client API as well as load balanced REST API frontends; some clusters have use of Schema Registry • Egress is a mix of custom endpoints, and in many cases, HDFS sink What Kafka setups?
  • 6. 6 Practical challenge #1 Variance in flow How do we decide if there is an anomaly?
  • 7. 7 Variance in flow (contd.) Partition number Count
  • 8. 8 Practical challenge #2 Negative effects of really slow data
  • 9. 9 Practical challenge #3 Event Sourcing Time Purchase Topic Cancellation Topic
  • 10. 10 • Runtime schema changes • “Flexible-Rigid Schema” • Timeouts causing rebalance storms • Leader affinity and poor assignment • Poor partition assignment Elements of surprise!
  • 11. 11 Anomaly Detection Most enterprises now have mission-critical streaming apps Predictive Maintenance Threat Monitoring Recommendation Engines Real-time customer sentiment analysis
  • 12. 12 Streaming data architecture must be reliable STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSORIoT Sensors Database Other Data Dashboard Result Store Other Output
  • 13. 13 Many problems can cause unreliable performance STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSORIoT Sensors Database Other Data Dashboard Result Store Other Output Untimely results
  • 14. 14 Many problems can cause unreliable performance STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSORIoT Sensors Database Other Data Dashboard Result Store Other Output Untimely resultsPoor partitioning Inefficient Configuration Resource contention + + =
  • 15. 15 DevOps face many challenges today STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSORIoT Sensors Database Other Data Dashboard Result Store Other Output Poor partitioning Inefficient Configuration Resource contention + + • No single tool • No correlation across the stack • No application view • No insights • No recommendations • No automated actions = Untimely results
  • 16. 16 How we can empower DevOps teams IoT Sensors Database Other Data Dashboard Result Store Other Output Platform Metrics App Metrics STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSOR App Platform Interaction Metrics Bring all performance data into one complete & correlated view
  • 17. 17 How we can empower DevOps teams IoT Sensors Database Other Data Dashboard Result Store Other Output Platform Metrics App Metrics STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSOR App Platform Interaction Metrics Provide out-of-the-box intelligence with Machine Learning (ML)
  • 18. 18 How we can empower DevOps teams IoT Sensors Database Other Data Dashboard Result Store Other Output Platform Metrics App Metrics STREAM STORE Kafka HBase Spark Flink REAL-TIME PROCESSOR App Platform Interaction Metrics Automate actions smartly with Artificial Intelligence (AI)
  • 19. 19
  • 21. 21 Goals of Streaming App and Kafka DevOps Teams Throughput SLA Latency SLA Data loss tolerance Stability/resiliency Resource usage/cost Planning/growth App-level Goals Platform-level Goals AI/ML Algorithms Outlier Detection Forecasting Anomaly Detection Correlation Analysis Model Learning
  • 23. 23 1. Detecting load imbalance among Kafka brokers 2. Detecting load imbalance among Kafka partitions Use Cases
  • 24. 24 Detecting load imbalance among Kafka brokers Brokers kabo2 and kabo3 have much higher number of incoming messages than broker kabo1
  • 25. 25 Algorithms for Outlier Detection Picture credit: http://historum.com/asian-history/128081-aryan-migration-theory-update-128.html • Based on one feature Vs. multiple features • Is the distribution of data assumed? 1. Z-score: How many standard deviations is a data point from the mean
  • 26. 26 • Based on one feature Vs. multiple features • Is the distribution of data assumed? 1. Z-score: How many standard deviations is a data point from the mean 2. DBScan: Density-based clustering 3. Isolation forests 4. Deep learning (e.g., Autoencoders) Algorithms for Outlier Detection Picture credit: http://en.proft.me/2017/02/3/density-based-clustering-r/
  • 28. 28 1. Predicting when SLAs are in danger of being missed 2. Predicting when system may run out of headroom or capacity Use Cases
  • 29. 29 A Real-life Application: RealTimeSentimentMonitor TWEETS Partitions 1-N TWEETS TWEETS
  • 30. 30 Predicting when SLAs are in danger of being missed Latency SLA is 3 minutes Latency SLA can be missed by this time Current time is here
  • 31. 31 • Many standard time-series forecasting techniques: ARIMA, Holt-Winters • Deep-learning techniques (e.g., LSTM) Algorithms for Forecasting
  • 32. 32 • Many standard time-series forecasting techniques: ARIMA, Holt-Winters • Deep-learning techniques (e.g., LSTM) • Facebook’s Prophet Algorithm: Mixes stats methods & judgment from domain experts • Uses Generative Additive Model (GAM) • Decomposed time-series model: trend, seasonality, holidays, and error term Algorithms for Forecasting y(t) = trend(t) + periodic(t) + shock(t) + error
  • 33. 33 • Many standard time-series forecasting techniques: ARIMA, Holt-Winters • Deep-learning techniques (e.g., LSTM) • Facebook’s Prophet Algorithm: Mixes stats methods & judgment from domain experts • Uses Generative Additive Model (GAM) • Decomposed time-series model: trend, seasonality, holidays, and error term • Advantages: • Fits faster than ARIMA • Models various growth trends • Can handle unevenly spaced data • Defaults often produce accurate forecasts Algorithms for Forecasting
  • 35. 35 1. An unexpected change that needs your attention 2. Smart alerts: • False negatives should be minimal • False positives should be minimal Use Cases
  • 36. 36 Detecting anomalies is tricky Is this an unexpected lag worth alerting on?
  • 37. 37 Algorithms for Anomaly Detection Picture credit: https://blog.statsbot.co/time-series-anomaly-detection-algorithms-1cef5519aef2 • Deviation from forecasts
  • 38. 38 Algorithms for Anomaly Detection Picture credit: https://blog.statsbot.co/time-series-anomaly-detection-algorithms-1cef5519aef2 • Deviation from forecasts • ARIMA • Regression trees • Prophet • STL: Seasonal and Trend Decomposition using Loess • Topic of intensive research • Deep learning (e.g., LSTM)
  • 40. 40 • Fast root-causing of problems • What lower-level cause led to the change in the streaming application’s performance? Use Cases
  • 41. 41 What caused the unexpected change in performance? Anomaly What caused it? 100s of time series from every level of the stack! LATENCY is 421.07% WORSE THAN THE BASELINE
  • 42. 42 • Be aware of the many pitfalls • E.g., trends can make arbitrary time series look correlated! • Pick robust time-series similarity metrics • E.g., Euclidean distance Vs. Dynamic Time Warping Algorithms for Correlation Analysis Picture credit: https://izbicki.me/blog/converting-images-into-time-series-for-data-mining.html Euclidean Distance Dynamic Time Warping
  • 43. 43 • Be aware of the many pitfalls • E.g., trends can make arbitrary time series look correlated! • Pick robust time-series similarity metrics • E.g., Euclidean distance Vs. Dynamic Time Warping • Carefully incorporate domain knowledge • E.g., what caused latency SLA miss? • Application-level problem? • Resource allocation problem? • Platform-level problem? • Data-level problem? Algorithms for Correlation Analysis Picture credit: https://izbicki.me/blog/converting-images-into-time-series-for-data-mining.html Euclidean Distance Dynamic Time Warping
  • 45. 45 1. Helps answer what-if and optimization questions • What is the best number of partitions? • What is the best setting of timeouts to avoid rebalance storms? • What is the best partition rebalancing action to take? • What will the impact of adding a new broker be? 2. Enables Auto Actions for resource/cost efficiency & SLA management Use Cases
  • 46. 46 Automated tuning suggestions to meet SLA Precise recommendation to meet SLA
  • 47. 47 • Performance = Func(Input Features) • Have to find the best set of input features • Supervised learning is often possible: Training data is available or easy to generate Algorithms for Learning Models Picture credit: https://myslide.cn/slides/8328#
  • 48. 48 Summary: Meeting Kafka DevOps Goals with AI/ML Throughput goal Stability goal Latency goal Resource usage/cost goal Data loss tolerance goal App-level Goals Platform-level Goals Planning/growth goal AI/ML Algorithms Outlier Detection Forecasting Anomaly Detection Correlation Analysis Model Learning
  • 49. 49 AIOps: Rich opportunities to address distributed application performance management as AI/ML problems Start your free trial: unraveldata.com/free-trial Visit us at the Unravel booth And yes, we are hiring! shivnath@unraveldata.com