How to Move from Monitoring to Observability, On-Premises and in a Multi-Cloud Environment

Splunk
SplunkSplunk
© 2018 SPLUNK INC.© 2018 SPLUNK INC.
How to Move From Monitoring
to Observability
Observability: the disingenuous rebranding of monitoring?
Dr. Siyka Andreeva | IT Operations Analytics Specialist
Marc Serieys | Staff Sales Engineer
June 2019
© 2018 SPLUNK INC.
Forward Looking Statements
During the course of this presentation, we may make forward-looking statements regarding future events or
the expected performance of the company. We caution you that such statements reflect our current
expectations and estimates based on factors currently known to us and that actual events or results could
differ materially. For important factors that may cause actual results to differ from those contained in our
forward-looking statements, please review our filings with the SEC.
The forward-looking statements made in this presentation are being made as of the time and date of its live
presentation. If reviewed after its live presentation, this presentation may not contain current or accurate
information. We do not assume any obligation to update any forward-looking statements we may make. In
addition, any information about our roadmap outlines our general product direction and is subject to change
at any time without notice. It is for informational purposes only and shall not be incorporated into any contract
or other commitment. Splunk undertakes no obligation either to develop the features or functionality
described or to include any such feature or functionality in a future release.
Splunk, Splunk>, Listen to Your Data, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other
brand names, product names, or trademarks belong to their respective owners. © 2017 Splunk Inc. All rights reserved.
© 2018 SPLUNK INC.
Agenda
What is observability ? And how it differs from monitoring?
Why is observability even a bigger challenge in a multi-cloud and containerized world?
How Splunk can help?
© 2018 SPLUNK INC.
What is
Observability?
the disingenuous rebranding of monitoring ?
monitoring on steroids?
DevOpsifying monitoring?
© 2018 SPLUNK INC.
Observability…the word starts spreading
because failure is shifting to application code and in production system behavior
© 2018 SPLUNK INC.
Why the word starts spreading ?
IT Operations monitoring challenges are getting worth in a distributed world:
• IT teams know that something is not working -- but not exactly why it’s not working
• Repetitive, manual processes for reactive troubleshooting
• Inability to get to root cause quickly
• Siloed analysis of logs, traces, and metrics
Management Expectations:
• Avoid financial impact from fewer system outages
• Accelerate investigation of application performance and system incidents with real-time log and metric analysis
• Consolidate operational tools and/or external services into one observability tool
• Improve collaboration across teams with targeted alerting and tailored visualization increases collaboration across teams
Same for Dev teams:
• Gap between perception and the reality
• Dev teams spending too much time observing the dev and pre prod env and not prod
© 2018 SPLUNK INC.
Why observability (in IT) ?
Source Wikipedia
Survivorship bias or survival bias is the logical error of concentrating on the people or things that made it
past some selection process and overlooking those that did not, typically because of their lack of visibility. This
can lead to false conclusions in several different ways.
Shot down aircraft don’t
externalize their state
© 2018 SPLUNK INC.
in Software Systems
Input Output
What is observability?
Flow Valv Purity
Velocity Direction Quality
Physical Telemetry
in Industrial Systems
Customer ID Success/Fail $ Spend
Add to cart Checkout Bill/Ship
Logging, Metrics Functions
© 2018 SPLUNK INC.
From monitoring to the three (only three?) pillars of Observability
Inspired from © @copyconstruct
Symptoms
(what’s broken?)
Monitoring
Alerting
Service health Overview
Investigation
Allthetime
Passive
Ops
Causes
(why?)
Debugging
Profiling
(system behavior)
Dependency analysis
(distributed systems tracing infrastructure)
Observability
Onthefly
Reactive
Dev
Events ProfilesPillar
A
Pillar
B
Pillar
C
Pillar
D
LOGS METRICS TRACES
© 2018 SPLUNK INC.
Why is that important in a multi-cloud environment?
2019 trends
Business Logic
Monolithic
Architecture
Billing
Driver mgntUser mgnt
PaymentNotification
User
API
Driver
Trip mgnt
Microservices Architecture
User
API Gateway
Driver
Container
User mgnt
Container
Billing
Container
Notification
Container
Payment
Container
Driver mgnt
Container
Trip mgnt
Microservices
Business Intelligence
Legacy systems
Frontend
Storage
Compute
Security
?
Multi-Cloud
Hardware
OS
Libraries
App.
Bare metal
Hardware
Hypervisor
OS
Lib
App
OS
Lib
App
OS
Lib
App
Virtual
Machines
Hardware
OS
Container Mgr
Lib
App
Lib
App
Lib
App
Containers
Lib
App
Lib
App
Lib
App
Hardware
OS
Libraries
App Mgr
App AppApp
Serverless
(functions)
App AppApp
App AppApp
App AppApp
App AppApp
Containers / Kubernetes / Serverless
Observability in the distributed (and ephemeral)
systems/cloud space is non-negotiable
Distributed location / responsibilities Distributed systems/code
© 2018 SPLUNK INC.
Customer experience???
SAAS
What happens when we stack them? How does this
apply to you and your Ops teams?
ON PREMISES
Legacy systems
(Mainframe…)
Facilities
Dev/PreProd
Storage
Backup
Archive
DR
Security
VMs
Containers Micro
services
AWS (Application 1)Access / Security
Database
StorageDev
Compute
Containers
App engine
GCP
(Big Data project 1)
Dataflow
AWS
(Archive) Azure (Application 1)
VMs
Database
VM sets
Traffic mger
© 2018 SPLUNK INC.
Customer experience???
SAAS
The consequence: only green lights in the war room
ON PREMISES
Legacy systems
(Mainframe…)
Facilities
Dev/PreProd
Storage
Backup
Archive
DR
Security
VMs
Containers Micro
services
AWS (Application 1)Access / Security
Database
StorageDev
Compute
Containers
App engine
GCP
(Big Data project 1)
Dataflow
AWS
(Archive) Azure (Application 1)
VMs
Database
VM sets
Traffic mger
Cx
O
BLO
SAAS
CISO
DevSysAdmin
MKT
??
?
? ?
© 2018 SPLUNK INC.
Splunk for IT
Operations
How do we help with
Observability everywhere?
© 2018 SPLUNK INC.
A market leader
ITOM IT Operations Management
Tools to manage provisioning, capacity,
performance and availability of IT
OBSERVE
ITOA IT operations analytics
DECIDE
Practice of monitoring systems, and
gathering, processing, analyzing &
interpreting data from ITOps sources to
guide decisions & predict issues
AIOps
ACCELERATE
AIOps platforms enhance IT operations
through greater insights by combining
big data, machine learning and
visualization.
SIEM
PROTECT
security event information management)
#1
#2#1
SECURITY IT OPERATIONS
Sources: IDC and/or Gartner
#2
© 2018 SPLUNK INC.
We reached the limits of the traditional approach
Traditional Data Types
Not future proof
Complex
Never Change!
Untapped IT-generated
machine data
(logs, metrics, wired data…)
Machine data is messy and unpredictable
Requires massive scale
You don’t always know which questions to ask
80%
© 2018 SPLUNK INC.
NotconsumablebyhumansConsumablebyhumans
Industry Leading Platform For Machine Data
Online
ServicesNetworks
Security
Call Detail
Records
Web
Services
Telecoms
Web
Clickstreams
Tracing
Online
Shopping Cart
Smartphones
and Devices
Custom
Applications
Energy Meters
Storage
Public
Cloud Private
Cloud
Containers
On-Premises
Servers
GPS
Location
RFID
Packaged
ApplicationsDatabases MessagingFirewall
Logs Wired DB Mobile IoT APIMetrics
DATA
Any Amount
Any Location
Any Source
No need to “adapt or
structure” the data
No database
No need to filter data
SPLUNKBASE 1600+ Free Apps/add-ons
SPLUNK PLATFORM Custom
dashboards
Report &
analyze
Monitor
and alert
Developer
Platform
Ad hoc
search
On-prem or cloud
PREMIUM APPS “data scientist in a box”
IT Ops, DevOps Security Business Analytics, IoT
Different people asking different questions on the same data, in real time
3rd Party
Phantom Orchestration
VictorOps Collaboration
CMDB,
SNOW…
Data lake
APM
Traces
APM
Tracing
© 2018 SPLUNK INC.
Structure Machine data
= fighting a losing battle
© 2018 SPLUNK INC.
How to find a needle in multiple haystacks?
(choose your tool)
Network?
Database?
Middleware?
Hardware?
Wrong
command?
Connection?
Apache?
VM?
Mainframe?
Load
balancer?Wrong code
released?
Collect ALL data
• Collect from all silos
• Data in original raw format
• Add open sources apps to
ingest data on the fly
• Schema on the fly
• Dynamic thresholding
• Realtime correlation
Clustering & aggregation
• Real time event
clustering/correlation
• Reduce alert noise
• Behavioural analytics
• Deduplication
Add context
• Measure / report on
indicators that matters
• Add service / business
context
• Add actionable
information to detection
Salessso
Claims
Anomaly detection
• Catch issues that thresholds
cannot
• Reduce event clutter
• Deviation from past
behaviour
• Deviation from peers
• Unusual change in features
Assisted deep dive
investigation
• Root cause analysis
• Powerful & easy to use
search & investigate
language
?
Predictive
Analytics
• Predict service health
• Predict events
• Trend forecasting
• Detect influencing
entities
• Early warning of
failure
70% to 90%
Reduction in investigation time
15% to 45%
Reduction in high priority incidents
67% to 82%
Reduction in business
impact
© 2018 SPLUNK INC.
UnknownKnown
Awareness/DataAvailable
Knowns Unknowns
Understanding
Observability with Splunk
Known Knowns
(Known problem & solution)
Unknown Knowns
(didn’t realize but clear solution)
Known Unknowns
(we see the problem, not the solution)
Unknown Unknowns
(no idea it’ll happen)
Improve the Known-
Knowns
Dynamic thresholding,
automation, schema on fly,
real time dashboards…
Provide auto correlations, real
time search’s, analytics,
business process mining…
See the Known-
Unknowns
Discover the
Unknown-Knowns
Anomaly detection, predictive IT… Ingest any data, ask any question,
get answers in real time…
Explore the
Unknown-Unknowns
© 2018 SPLUNK INC.
Answer new questions, find new unknowns
Observe | Monitor | Analyze | Act
© 2018 SPLUNK INC.
It’s a journey
Search & Monitor
(Any) Data collection
Real time
monitoring/observability
Centralized Machine Data
Search
Business Insights
Business KPIs
Insights to drive experienceOperational visibility
Service Oriented View
Root Cause analysis
Stabilize IT
Predict & Improve
Predict issues
Recommend actions based
on prior behaviors
Increase MTBF
© 2018 SPLUNK INC.© 2018 SPLUNK INC.
Thank you
1 de 22

Recomendados

Cloud-Native Observability por
Cloud-Native ObservabilityCloud-Native Observability
Cloud-Native ObservabilityTyler Treat
885 visualizações95 slides
Monitoring & Observability por
Monitoring & ObservabilityMonitoring & Observability
Monitoring & ObservabilityLumban Sopian
305 visualizações20 slides
Observability For Modern Applications por
Observability For Modern ApplicationsObservability For Modern Applications
Observability For Modern ApplicationsAmazon Web Services
4.6K visualizações53 slides
More Than Monitoring: How Observability Takes You From Firefighting to Fire P... por
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...More Than Monitoring: How Observability Takes You From Firefighting to Fire P...
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...DevOps.com
383 visualizações45 slides
Observability for modern applications por
Observability for modern applications  Observability for modern applications
Observability for modern applications MoovingON
28.5K visualizações30 slides
Observability por
ObservabilityObservability
ObservabilityMaganathin Veeraragaloo
1.2K visualizações46 slides

Mais conteúdo relacionado

Mais procurados

Observability por
ObservabilityObservability
ObservabilityMartin Gross
258 visualizações21 slides
Observability vs APM vs Monitoring Comparison por
Observability vs APM vs  Monitoring ComparisonObservability vs APM vs  Monitoring Comparison
Observability vs APM vs Monitoring Comparisonjeetendra mandal
441 visualizações19 slides
Observability por
ObservabilityObservability
ObservabilityEbru Cucen Çüçen
193 visualizações39 slides
Observability & Datadog por
Observability & DatadogObservability & Datadog
Observability & DatadogJamesAnderson599331
345 visualizações14 slides
Observability por
Observability Observability
Observability Enes Altınok
362 visualizações37 slides
.conf Go 2022 - Observability Session por
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability SessionSplunk
302 visualizações26 slides

Mais procurados(20)

Observability por Martin Gross
ObservabilityObservability
Observability
Martin Gross258 visualizações
Observability vs APM vs Monitoring Comparison por jeetendra mandal
Observability vs APM vs  Monitoring ComparisonObservability vs APM vs  Monitoring Comparison
Observability vs APM vs Monitoring Comparison
jeetendra mandal441 visualizações
Observability & Datadog por JamesAnderson599331
Observability & DatadogObservability & Datadog
Observability & Datadog
JamesAnderson599331345 visualizações
Observability por Enes Altınok
Observability Observability
Observability
Enes Altınok362 visualizações
.conf Go 2022 - Observability Session por Splunk
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session
Splunk302 visualizações
Observability – the good, the bad, and the ugly por Timetrix
Observability – the good, the bad, and the uglyObservability – the good, the bad, and the ugly
Observability – the good, the bad, and the ugly
Timetrix199 visualizações
Monitoring and observability por Theo Schlossnagle
Monitoring and observabilityMonitoring and observability
Monitoring and observability
Theo Schlossnagle4K visualizações
Elastic Observability keynote por Elasticsearch
Elastic Observability keynoteElastic Observability keynote
Elastic Observability keynote
Elasticsearch624 visualizações
Monitoring and observability por Theo Schlossnagle
Monitoring and observabilityMonitoring and observability
Monitoring and observability
Theo Schlossnagle7.9K visualizações
Combining Logs, Metrics, and Traces for Unified Observability por Elasticsearch
Combining Logs, Metrics, and Traces for Unified ObservabilityCombining Logs, Metrics, and Traces for Unified Observability
Combining Logs, Metrics, and Traces for Unified Observability
Elasticsearch578 visualizações
Observability in the world of microservices por Chandresh Pancholi
Observability in the world of microservicesObservability in the world of microservices
Observability in the world of microservices
Chandresh Pancholi344 visualizações
Application Performance Monitoring (APM) por Site24x7
Application Performance Monitoring (APM)Application Performance Monitoring (APM)
Application Performance Monitoring (APM)
Site24x74.7K visualizações
Building an SRE Organization @ Squarespace por Franklin Angulo
Building an SRE Organization @ SquarespaceBuilding an SRE Organization @ Squarespace
Building an SRE Organization @ Squarespace
Franklin Angulo2.1K visualizações
Observability driven development por Geert van der Cruijsen
Observability driven developmentObservability driven development
Observability driven development
Geert van der Cruijsen640 visualizações
Using AIOps to reduce incidents volume por Amazon Web Services
Using AIOps to reduce incidents volumeUsing AIOps to reduce incidents volume
Using AIOps to reduce incidents volume
Amazon Web Services607 visualizações
Dynatrace por Purnima Kurella
DynatraceDynatrace
Dynatrace
Purnima Kurella20.8K visualizações
Observability por Diego Pacheco
ObservabilityObservability
Observability
Diego Pacheco474 visualizações
Getting started with Site Reliability Engineering (SRE) por Abeer R
Getting started with Site Reliability Engineering (SRE)Getting started with Site Reliability Engineering (SRE)
Getting started with Site Reliability Engineering (SRE)
Abeer R14.2K visualizações

Similar a How to Move from Monitoring to Observability, On-Premises and in a Multi-Cloud Environment

SplunkLive! Paris 2018: Integrating Metrics and Logs por
SplunkLive! Paris 2018: Integrating Metrics and LogsSplunkLive! Paris 2018: Integrating Metrics and Logs
SplunkLive! Paris 2018: Integrating Metrics and LogsSplunk
238 visualizações25 slides
SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk por
SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunk
525 visualizações25 slides
SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk por
SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunk
480 visualizações24 slides
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R... por
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...Splunk
256 visualizações32 slides
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk por
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunk
251 visualizações24 slides
Splunk for IT Operations Breakout Session por
Splunk for IT Operations Breakout SessionSplunk for IT Operations Breakout Session
Splunk for IT Operations Breakout SessionSplunk
860 visualizações23 slides

Similar a How to Move from Monitoring to Observability, On-Premises and in a Multi-Cloud Environment(20)

SplunkLive! Paris 2018: Integrating Metrics and Logs por Splunk
SplunkLive! Paris 2018: Integrating Metrics and LogsSplunkLive! Paris 2018: Integrating Metrics and Logs
SplunkLive! Paris 2018: Integrating Metrics and Logs
Splunk238 visualizações
SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk por Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
Splunk525 visualizações
SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk por Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
Splunk480 visualizações
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R... por Splunk
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...
Splunk256 visualizações
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk por Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
Splunk251 visualizações
Splunk for IT Operations Breakout Session por Splunk
Splunk for IT Operations Breakout SessionSplunk for IT Operations Breakout Session
Splunk for IT Operations Breakout Session
Splunk860 visualizações
Splunk IT Service Intelligence por Georg Knon
Splunk IT Service IntelligenceSplunk IT Service Intelligence
Splunk IT Service Intelligence
Georg Knon1.4K visualizações
Webinar: Neuigkeiten zu Splunk Enterprise 6.3 por Splunk
Webinar: Neuigkeiten zu Splunk Enterprise 6.3Webinar: Neuigkeiten zu Splunk Enterprise 6.3
Webinar: Neuigkeiten zu Splunk Enterprise 6.3
Splunk483 visualizações
Splunk for Industrial Data and the Internet of Things por aliciasyc
Splunk for Industrial Data and the Internet of ThingsSplunk for Industrial Data and the Internet of Things
Splunk for Industrial Data and the Internet of Things
aliciasyc458 visualizações
Splunk - Splunk for Industrial Data and the Internet of Things por Aruj Thirawat
Splunk - Splunk for Industrial Data and the Internet of ThingsSplunk - Splunk for Industrial Data and the Internet of Things
Splunk - Splunk for Industrial Data and the Internet of Things
Aruj Thirawat2.1K visualizações
SplunkLive! Munich 2018: Predictive, Proactive, and Collaborative ML with IT ... por Splunk
SplunkLive! Munich 2018: Predictive, Proactive, and Collaborative ML with IT ...SplunkLive! Munich 2018: Predictive, Proactive, and Collaborative ML with IT ...
SplunkLive! Munich 2018: Predictive, Proactive, and Collaborative ML with IT ...
Splunk681 visualizações
Adventures in Monitoring and Troubleshooting por Splunk
Adventures in Monitoring and Troubleshooting Adventures in Monitoring and Troubleshooting
Adventures in Monitoring and Troubleshooting
Splunk395 visualizações
Adventures in Monitoring and Troubleshooting por Splunk
Adventures in Monitoring and Troubleshooting Adventures in Monitoring and Troubleshooting
Adventures in Monitoring and Troubleshooting
Splunk151 visualizações
SplunkLive! Frankfurt 2018 - Predictive, Proactive, and Collaborative ML with... por Splunk
SplunkLive! Frankfurt 2018 - Predictive, Proactive, and Collaborative ML with...SplunkLive! Frankfurt 2018 - Predictive, Proactive, and Collaborative ML with...
SplunkLive! Frankfurt 2018 - Predictive, Proactive, and Collaborative ML with...
Splunk266 visualizações
SplunkLive! Paris 2018: Splunk Overview por Splunk
SplunkLive! Paris 2018: Splunk OverviewSplunkLive! Paris 2018: Splunk Overview
SplunkLive! Paris 2018: Splunk Overview
Splunk2.3K visualizações
AIOps Roundtable Munich 2018 por Splunk
AIOps Roundtable Munich 2018AIOps Roundtable Munich 2018
AIOps Roundtable Munich 2018
Splunk1.4K visualizações
2019 Performance Monitoring and Management Trends and Insights por OpsRamp
2019 Performance Monitoring and Management Trends and Insights2019 Performance Monitoring and Management Trends and Insights
2019 Performance Monitoring and Management Trends and Insights
OpsRamp1.6K visualizações
Virtual SplunkLive! for Higher Education Overview/Customers por Splunk
Virtual SplunkLive! for Higher Education Overview/CustomersVirtual SplunkLive! for Higher Education Overview/Customers
Virtual SplunkLive! for Higher Education Overview/Customers
Splunk1.7K visualizações
Getting Started with Splunk Enterprise por Splunk
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
Splunk1.4K visualizações
SplunkLive! Paris 2018: Splunk And AI 101 por Splunk
SplunkLive! Paris 2018: Splunk And AI 101SplunkLive! Paris 2018: Splunk And AI 101
SplunkLive! Paris 2018: Splunk And AI 101
Splunk392 visualizações

Mais de Splunk

.conf Go 2023 - Data analysis as a routine por
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routineSplunk
96 visualizações12 slides
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV por
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTVSplunk
91 visualizações20 slides
.conf Go 2023 - Comment Engie France Retail supervise ses activités critiques... por
.conf Go 2023 - Comment Engie France Retail supervise ses activités critiques....conf Go 2023 - Comment Engie France Retail supervise ses activités critiques...
.conf Go 2023 - Comment Engie France Retail supervise ses activités critiques...Splunk
93 visualizações28 slides
.conf Go 2023 - Navegando la normativa SOX (Telefónica) por
.conf Go 2023 - Navegando la normativa SOX (Telefónica).conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica)Splunk
193 visualizações31 slides
.conf Go 2023 - SIEM project @ SNF por
.conf Go 2023 - SIEM project @ SNF.conf Go 2023 - SIEM project @ SNF
.conf Go 2023 - SIEM project @ SNFSplunk
209 visualizações18 slides
.conf Go 2023 - Raiffeisen Bank International por
.conf Go 2023 - Raiffeisen Bank International.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank InternationalSplunk
217 visualizações16 slides

Mais de Splunk(20)

.conf Go 2023 - Data analysis as a routine por Splunk
.conf Go 2023 - Data analysis as a routine.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - Data analysis as a routine
Splunk96 visualizações
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV por Splunk
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
Splunk91 visualizações
.conf Go 2023 - Comment Engie France Retail supervise ses activités critiques... por Splunk
.conf Go 2023 - Comment Engie France Retail supervise ses activités critiques....conf Go 2023 - Comment Engie France Retail supervise ses activités critiques...
.conf Go 2023 - Comment Engie France Retail supervise ses activités critiques...
Splunk93 visualizações
.conf Go 2023 - Navegando la normativa SOX (Telefónica) por Splunk
.conf Go 2023 - Navegando la normativa SOX (Telefónica).conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
Splunk193 visualizações
.conf Go 2023 - SIEM project @ SNF por Splunk
.conf Go 2023 - SIEM project @ SNF.conf Go 2023 - SIEM project @ SNF
.conf Go 2023 - SIEM project @ SNF
Splunk209 visualizações
.conf Go 2023 - Raiffeisen Bank International por Splunk
.conf Go 2023 - Raiffeisen Bank International.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - Raiffeisen Bank International
Splunk217 visualizações
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett por Splunk
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett .conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
Splunk182 visualizações
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär) por Splunk
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär).conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
Splunk221 visualizações
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu... por Splunk
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu....conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
Splunk193 visualizações
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever... por Splunk
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever....conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
Splunk197 visualizações
.conf go 2023 - De NOC a CSIRT (Cellnex) por Splunk
.conf go 2023 - De NOC a CSIRT (Cellnex).conf go 2023 - De NOC a CSIRT (Cellnex)
.conf go 2023 - De NOC a CSIRT (Cellnex)
Splunk196 visualizações
conf go 2023 - El camino hacia la ciberseguridad (ABANCA) por Splunk
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
Splunk195 visualizações
Splunk - BMW connects business and IT with data driven operations SRE and O11y por Splunk
Splunk - BMW connects business and IT with data driven operations SRE and O11ySplunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk - BMW connects business and IT with data driven operations SRE and O11y
Splunk16 visualizações
Splunk x Freenet - .conf Go Köln por Splunk
Splunk x Freenet - .conf Go KölnSplunk x Freenet - .conf Go Köln
Splunk x Freenet - .conf Go Köln
Splunk97 visualizações
Splunk Security Session - .conf Go Köln por Splunk
Splunk Security Session - .conf Go KölnSplunk Security Session - .conf Go Köln
Splunk Security Session - .conf Go Köln
Splunk204 visualizações
Data foundations building success, at city scale – Imperial College London por Splunk
 Data foundations building success, at city scale – Imperial College London Data foundations building success, at city scale – Imperial College London
Data foundations building success, at city scale – Imperial College London
Splunk83 visualizações
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen... por Splunk
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk: How Vodafone established Operational Analytics in a Hybrid Environmen...
Splunk153 visualizações
SOC, Amore Mio! | Security Webinar por Splunk
SOC, Amore Mio! | Security WebinarSOC, Amore Mio! | Security Webinar
SOC, Amore Mio! | Security Webinar
Splunk508 visualizações
.conf Go Zurich 2022 - Keynote por Splunk
.conf Go Zurich 2022 - Keynote.conf Go Zurich 2022 - Keynote
.conf Go Zurich 2022 - Keynote
Splunk48 visualizações
.conf Go Zurich 2022 - Platform Session por Splunk
.conf Go Zurich 2022 - Platform Session.conf Go Zurich 2022 - Platform Session
.conf Go Zurich 2022 - Platform Session
Splunk97 visualizações

Último

HTTP headers that make your website go faster - devs.gent November 2023 por
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023Thijs Feryn
22 visualizações151 slides
Business Analyst Series 2023 - Week 3 Session 5 por
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5DianaGray10
248 visualizações20 slides
PRODUCT PRESENTATION.pptx por
PRODUCT PRESENTATION.pptxPRODUCT PRESENTATION.pptx
PRODUCT PRESENTATION.pptxangelicacueva6
14 visualizações1 slide
Special_edition_innovator_2023.pdf por
Special_edition_innovator_2023.pdfSpecial_edition_innovator_2023.pdf
Special_edition_innovator_2023.pdfWillDavies22
17 visualizações6 slides
Tunable Laser (1).pptx por
Tunable Laser (1).pptxTunable Laser (1).pptx
Tunable Laser (1).pptxHajira Mahmood
24 visualizações37 slides
Ransomware is Knocking your Door_Final.pdf por
Ransomware is Knocking your Door_Final.pdfRansomware is Knocking your Door_Final.pdf
Ransomware is Knocking your Door_Final.pdfSecurity Bootcamp
55 visualizações46 slides

Último(20)

HTTP headers that make your website go faster - devs.gent November 2023 por Thijs Feryn
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023
Thijs Feryn22 visualizações
Business Analyst Series 2023 - Week 3 Session 5 por DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10248 visualizações
PRODUCT PRESENTATION.pptx por angelicacueva6
PRODUCT PRESENTATION.pptxPRODUCT PRESENTATION.pptx
PRODUCT PRESENTATION.pptx
angelicacueva614 visualizações
Special_edition_innovator_2023.pdf por WillDavies22
Special_edition_innovator_2023.pdfSpecial_edition_innovator_2023.pdf
Special_edition_innovator_2023.pdf
WillDavies2217 visualizações
Tunable Laser (1).pptx por Hajira Mahmood
Tunable Laser (1).pptxTunable Laser (1).pptx
Tunable Laser (1).pptx
Hajira Mahmood24 visualizações
Ransomware is Knocking your Door_Final.pdf por Security Bootcamp
Ransomware is Knocking your Door_Final.pdfRansomware is Knocking your Door_Final.pdf
Ransomware is Knocking your Door_Final.pdf
Security Bootcamp55 visualizações
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf por Dr. Jimmy Schwarzkopf
STKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdfSTKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdf
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf
Dr. Jimmy Schwarzkopf19 visualizações
Case Study Copenhagen Energy and Business Central.pdf por Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana16 visualizações
AMAZON PRODUCT RESEARCH.pdf por JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta26 visualizações
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors por sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab19 visualizações
Attacking IoT Devices from a Web Perspective - Linux Day por Simone Onofri
Attacking IoT Devices from a Web Perspective - Linux Day Attacking IoT Devices from a Web Perspective - Linux Day
Attacking IoT Devices from a Web Perspective - Linux Day
Simone Onofri16 visualizações
Scaling Knowledge Graph Architectures with AI por Enterprise Knowledge
Scaling Knowledge Graph Architectures with AIScaling Knowledge Graph Architectures with AI
Scaling Knowledge Graph Architectures with AI
Enterprise Knowledge30 visualizações
Empathic Computing: Delivering the Potential of the Metaverse por Mark Billinghurst
Empathic Computing: Delivering  the Potential of the MetaverseEmpathic Computing: Delivering  the Potential of the Metaverse
Empathic Computing: Delivering the Potential of the Metaverse
Mark Billinghurst478 visualizações
PRODUCT LISTING.pptx por angelicacueva6
PRODUCT LISTING.pptxPRODUCT LISTING.pptx
PRODUCT LISTING.pptx
angelicacueva614 visualizações
Microsoft Power Platform.pptx por Uni Systems S.M.S.A.
Microsoft Power Platform.pptxMicrosoft Power Platform.pptx
Microsoft Power Platform.pptx
Uni Systems S.M.S.A.53 visualizações
handbook for web 3 adoption.pdf por Liveplex
handbook for web 3 adoption.pdfhandbook for web 3 adoption.pdf
handbook for web 3 adoption.pdf
Liveplex22 visualizações
Voice Logger - Telephony Integration Solution at Aegis por Nirmal Sharma
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at Aegis
Nirmal Sharma39 visualizações
Info Session November 2023.pdf por AleksandraKoprivica4
Info Session November 2023.pdfInfo Session November 2023.pdf
Info Session November 2023.pdf
AleksandraKoprivica412 visualizações

How to Move from Monitoring to Observability, On-Premises and in a Multi-Cloud Environment

  • 1. © 2018 SPLUNK INC.© 2018 SPLUNK INC. How to Move From Monitoring to Observability Observability: the disingenuous rebranding of monitoring? Dr. Siyka Andreeva | IT Operations Analytics Specialist Marc Serieys | Staff Sales Engineer June 2019
  • 2. © 2018 SPLUNK INC. Forward Looking Statements During the course of this presentation, we may make forward-looking statements regarding future events or the expected performance of the company. We caution you that such statements reflect our current expectations and estimates based on factors currently known to us and that actual events or results could differ materially. For important factors that may cause actual results to differ from those contained in our forward-looking statements, please review our filings with the SEC. The forward-looking statements made in this presentation are being made as of the time and date of its live presentation. If reviewed after its live presentation, this presentation may not contain current or accurate information. We do not assume any obligation to update any forward-looking statements we may make. In addition, any information about our roadmap outlines our general product direction and is subject to change at any time without notice. It is for informational purposes only and shall not be incorporated into any contract or other commitment. Splunk undertakes no obligation either to develop the features or functionality described or to include any such feature or functionality in a future release. Splunk, Splunk>, Listen to Your Data, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respective owners. © 2017 Splunk Inc. All rights reserved.
  • 3. © 2018 SPLUNK INC. Agenda What is observability ? And how it differs from monitoring? Why is observability even a bigger challenge in a multi-cloud and containerized world? How Splunk can help?
  • 4. © 2018 SPLUNK INC. What is Observability? the disingenuous rebranding of monitoring ? monitoring on steroids? DevOpsifying monitoring?
  • 5. © 2018 SPLUNK INC. Observability…the word starts spreading because failure is shifting to application code and in production system behavior
  • 6. © 2018 SPLUNK INC. Why the word starts spreading ? IT Operations monitoring challenges are getting worth in a distributed world: • IT teams know that something is not working -- but not exactly why it’s not working • Repetitive, manual processes for reactive troubleshooting • Inability to get to root cause quickly • Siloed analysis of logs, traces, and metrics Management Expectations: • Avoid financial impact from fewer system outages • Accelerate investigation of application performance and system incidents with real-time log and metric analysis • Consolidate operational tools and/or external services into one observability tool • Improve collaboration across teams with targeted alerting and tailored visualization increases collaboration across teams Same for Dev teams: • Gap between perception and the reality • Dev teams spending too much time observing the dev and pre prod env and not prod
  • 7. © 2018 SPLUNK INC. Why observability (in IT) ? Source Wikipedia Survivorship bias or survival bias is the logical error of concentrating on the people or things that made it past some selection process and overlooking those that did not, typically because of their lack of visibility. This can lead to false conclusions in several different ways. Shot down aircraft don’t externalize their state
  • 8. © 2018 SPLUNK INC. in Software Systems Input Output What is observability? Flow Valv Purity Velocity Direction Quality Physical Telemetry in Industrial Systems Customer ID Success/Fail $ Spend Add to cart Checkout Bill/Ship Logging, Metrics Functions
  • 9. © 2018 SPLUNK INC. From monitoring to the three (only three?) pillars of Observability Inspired from © @copyconstruct Symptoms (what’s broken?) Monitoring Alerting Service health Overview Investigation Allthetime Passive Ops Causes (why?) Debugging Profiling (system behavior) Dependency analysis (distributed systems tracing infrastructure) Observability Onthefly Reactive Dev Events ProfilesPillar A Pillar B Pillar C Pillar D LOGS METRICS TRACES
  • 10. © 2018 SPLUNK INC. Why is that important in a multi-cloud environment? 2019 trends Business Logic Monolithic Architecture Billing Driver mgntUser mgnt PaymentNotification User API Driver Trip mgnt Microservices Architecture User API Gateway Driver Container User mgnt Container Billing Container Notification Container Payment Container Driver mgnt Container Trip mgnt Microservices Business Intelligence Legacy systems Frontend Storage Compute Security ? Multi-Cloud Hardware OS Libraries App. Bare metal Hardware Hypervisor OS Lib App OS Lib App OS Lib App Virtual Machines Hardware OS Container Mgr Lib App Lib App Lib App Containers Lib App Lib App Lib App Hardware OS Libraries App Mgr App AppApp Serverless (functions) App AppApp App AppApp App AppApp App AppApp Containers / Kubernetes / Serverless Observability in the distributed (and ephemeral) systems/cloud space is non-negotiable Distributed location / responsibilities Distributed systems/code
  • 11. © 2018 SPLUNK INC. Customer experience??? SAAS What happens when we stack them? How does this apply to you and your Ops teams? ON PREMISES Legacy systems (Mainframe…) Facilities Dev/PreProd Storage Backup Archive DR Security VMs Containers Micro services AWS (Application 1)Access / Security Database StorageDev Compute Containers App engine GCP (Big Data project 1) Dataflow AWS (Archive) Azure (Application 1) VMs Database VM sets Traffic mger
  • 12. © 2018 SPLUNK INC. Customer experience??? SAAS The consequence: only green lights in the war room ON PREMISES Legacy systems (Mainframe…) Facilities Dev/PreProd Storage Backup Archive DR Security VMs Containers Micro services AWS (Application 1)Access / Security Database StorageDev Compute Containers App engine GCP (Big Data project 1) Dataflow AWS (Archive) Azure (Application 1) VMs Database VM sets Traffic mger Cx O BLO SAAS CISO DevSysAdmin MKT ?? ? ? ?
  • 13. © 2018 SPLUNK INC. Splunk for IT Operations How do we help with Observability everywhere?
  • 14. © 2018 SPLUNK INC. A market leader ITOM IT Operations Management Tools to manage provisioning, capacity, performance and availability of IT OBSERVE ITOA IT operations analytics DECIDE Practice of monitoring systems, and gathering, processing, analyzing & interpreting data from ITOps sources to guide decisions & predict issues AIOps ACCELERATE AIOps platforms enhance IT operations through greater insights by combining big data, machine learning and visualization. SIEM PROTECT security event information management) #1 #2#1 SECURITY IT OPERATIONS Sources: IDC and/or Gartner #2
  • 15. © 2018 SPLUNK INC. We reached the limits of the traditional approach Traditional Data Types Not future proof Complex Never Change! Untapped IT-generated machine data (logs, metrics, wired data…) Machine data is messy and unpredictable Requires massive scale You don’t always know which questions to ask 80%
  • 16. © 2018 SPLUNK INC. NotconsumablebyhumansConsumablebyhumans Industry Leading Platform For Machine Data Online ServicesNetworks Security Call Detail Records Web Services Telecoms Web Clickstreams Tracing Online Shopping Cart Smartphones and Devices Custom Applications Energy Meters Storage Public Cloud Private Cloud Containers On-Premises Servers GPS Location RFID Packaged ApplicationsDatabases MessagingFirewall Logs Wired DB Mobile IoT APIMetrics DATA Any Amount Any Location Any Source No need to “adapt or structure” the data No database No need to filter data SPLUNKBASE 1600+ Free Apps/add-ons SPLUNK PLATFORM Custom dashboards Report & analyze Monitor and alert Developer Platform Ad hoc search On-prem or cloud PREMIUM APPS “data scientist in a box” IT Ops, DevOps Security Business Analytics, IoT Different people asking different questions on the same data, in real time 3rd Party Phantom Orchestration VictorOps Collaboration CMDB, SNOW… Data lake APM Traces APM Tracing
  • 17. © 2018 SPLUNK INC. Structure Machine data = fighting a losing battle
  • 18. © 2018 SPLUNK INC. How to find a needle in multiple haystacks? (choose your tool) Network? Database? Middleware? Hardware? Wrong command? Connection? Apache? VM? Mainframe? Load balancer?Wrong code released? Collect ALL data • Collect from all silos • Data in original raw format • Add open sources apps to ingest data on the fly • Schema on the fly • Dynamic thresholding • Realtime correlation Clustering & aggregation • Real time event clustering/correlation • Reduce alert noise • Behavioural analytics • Deduplication Add context • Measure / report on indicators that matters • Add service / business context • Add actionable information to detection Salessso Claims Anomaly detection • Catch issues that thresholds cannot • Reduce event clutter • Deviation from past behaviour • Deviation from peers • Unusual change in features Assisted deep dive investigation • Root cause analysis • Powerful & easy to use search & investigate language ? Predictive Analytics • Predict service health • Predict events • Trend forecasting • Detect influencing entities • Early warning of failure 70% to 90% Reduction in investigation time 15% to 45% Reduction in high priority incidents 67% to 82% Reduction in business impact
  • 19. © 2018 SPLUNK INC. UnknownKnown Awareness/DataAvailable Knowns Unknowns Understanding Observability with Splunk Known Knowns (Known problem & solution) Unknown Knowns (didn’t realize but clear solution) Known Unknowns (we see the problem, not the solution) Unknown Unknowns (no idea it’ll happen) Improve the Known- Knowns Dynamic thresholding, automation, schema on fly, real time dashboards… Provide auto correlations, real time search’s, analytics, business process mining… See the Known- Unknowns Discover the Unknown-Knowns Anomaly detection, predictive IT… Ingest any data, ask any question, get answers in real time… Explore the Unknown-Unknowns
  • 20. © 2018 SPLUNK INC. Answer new questions, find new unknowns Observe | Monitor | Analyze | Act
  • 21. © 2018 SPLUNK INC. It’s a journey Search & Monitor (Any) Data collection Real time monitoring/observability Centralized Machine Data Search Business Insights Business KPIs Insights to drive experienceOperational visibility Service Oriented View Root Cause analysis Stabilize IT Predict & Improve Predict issues Recommend actions based on prior behaviors Increase MTBF
  • 22. © 2018 SPLUNK INC.© 2018 SPLUNK INC. Thank you