More Than Monitoring: How Observability Takes You From Firefighting to Fire Prevention

DevOps.com
DevOps.comDevOps.com
© 2020 SPLUNK INC.
More Than Monitoring:
How Observability Take
You From Firefighting
to Fire Prevention
© 2 0 1 9 S P L U N K I N C .
Stephane Estevez
EMEA Product Marketing Director, IT Markets, Splunk
During the course of this presentation, we may make forward-looking statements
regarding future events or plans of the company. We caution you that such statements
reflect our current expectations and estimates based on factors currently known to us
and that actual events or results may differ materially. The forward-looking statements
made in the this presentation are being made as of the time and date of its live
presentation. If reviewed after its live presentation, it may not contain current or
accurate information. We do not assume any obligation to update
any forward-looking statements made herein.
In addition, any information about our roadmap outlines our general product direction
and is subject to change at any time without notice. It is for informational purposes only,
and shall not be incorporated into any contract or other commitment. Splunk undertakes
no obligation either to develop the features or functionalities described or to include any
such feature or functionality in a future release.
Splunk, Splunk>, Turn Data Into Doing, The Engine for Machine Data, Splunk Cloud,
Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the
United States and other countries. All other brand names, product names, or
trademarks belong to their respective owners. © 2020 Splunk Inc. All rights reserved.
Forward-
Looking
Statements
© 2020 SPLUNK INC.
© 2020 SPLUNK INC.
• Observability in a nutshell
• Key Observability use cases
• Adding Observability to Monitoring
• Demo
• Adding AIOps
• About Splunk
Agenda
© 2020 SPLUNK INC.
Observability in a
Nutshell
© 2020 SPLUNK INC.
Distributed Services with High-Velocity Releases
= New Organizational Challenges
Investment in new observability and incident management tools becomes critical
© 2020 SPLUNK INC.
Understanding Observability Mindset
Source: Wikipedia
Survivorship bias or survival bias is the logical error of concentrating on the people or things that made it
past some selection process, and overlooking those that did not, typically because of their lack of visibility.
This can lead to false conclusions in several different ways.
“Gentlemen, you need
to put more armour-
plate where the holes
aren’t because that’s
where the holes
where on the airplane
that didn’t return”
–(Abraham Wald 1942)
A shot down aircraft
doesn’t externalize
its state
© 2020 SPLUNK INC.
Analyze
Monitoring
Observability
A Noun
A thing you have –
a property of a system
A Verb
Something you do to determine the state
of an application, a system, a service…
Act
If you are observable
I can monitor you
and take actions
find patterns
Turning Observability into Action
© 2020 SPLUNK INC.
Cloud-Native Journey Increases Operating
Complexity
Retain & Optimize Lift & Shift Re-Factor Re-Architect/
Cloud-Native
DEV OPS DEV OPS DEV OPS DEV OPS
Cloud Managed e.g. RDS,
DynamoDB, SaaS
Cloud First Architecture
Tightly Coupled Apps,
Slow Deployment Cycles
Primarily using
Cloud IaaS
More Modular, but
Dependent App
Components
Loosely Coupled
Microservices, and
Serverless Functions
VM VM VMVM VM VM VM VM VM
Private Public
VM VM VM VM VM VM
Private Public Private Public
© 2020 SPLUNK INC.
Adding Observability to Support Cloud-
Native Environments
Observability helps detect, investigate and resolve the unknown unknowns
Monitoring
Keep an eye on things
we know can
go wrong
Observability
Find the unexpected
and explain why
it happened
© 2 0 1 9 S P L U N K I N C .
“Focus on what you can’t see, the
unknowns. If the root cause of a
failure stays invisible (the bullet
holes) your IT-plane will be shot
down again”
So what is
Observability?
METRICS
TRACES
LOGS
© 2 0 1 9 S P L U N K I N C .
WHAT’S HAPPENING?
Observability The Three Pillars
WHY IS IT HAPPENING?
WHERE IS IT HAPPENING?
METRICS
EVENTS / LOGS
TRACES
© 2020 SPLUNK INC.
Enhancing Incident / Problem Management
Correlation / Investigation
Monitoring / Alerting
AIOps
Incident Response
Automation
VM VM VM VM VM VM
Private Public
LOGS METRICS TRACESImonitoryou
Observability
Private Public
Iamobservable
© 2020 SPLUNK INC.
All the
data
Real-time and
scalable
Analytics
/ML
What’s required for
Observability
Customer experience
Release quality and velocity
Developer efficiency
Business Adaptability
© 2020 SPLUNK INC.
Key Use Cases
© 2020 SPLUNK INC.
Frequent Use Cases
• Hybrid cloud monitoring
• Cloud cost management
• Cloud capacity planning
• Public cloud monitoring
• Kubernetes & container
monitoring
• Serverless monitoring
• KPIs monitoring using
custom metrics
• Observability-as-a-Service
• Application modernization
• Microservices monitoring &
troubleshooting
• Business SLx monitoring
• DevOps application lifecycle
monitoring
Cloud
Migration
Multi-Cloud
Monitoring
Application Performance
Monitoring
• Reduce remediation
time & Improve on-call
(“Incident Response”)
Incident
Response
© 2020 SPLUNK INC.
Observability
with Splunk
© 2020 SPLUNK INC.
“Observability means that you have the
data that you need (logs, metrics and
traces) for every single unit of work that is of
interest to the business.”
© 2020 SPLUNK INC.
Complexity is everywhere
even when you only have one public cloud
EVENTS
LOGS &
REPORTS
Elastic Load Balancing Access
Logs
Amazon CloudFront Access
logs
Amazon CloudTrail logs
Billing Reports
Application Logs Application S3 access
Logs
Other service logs AWS configs snapshots & history
files
METRICS
EMR Cluster Auto Scaling
EVENTS
LOGS
RULES/EVENTS
Events
Logs
Push path (via Splunk HEC)
Your IT team
© 2 0 1 9 S P L U N K I N C .
CISO
DevSysAdmin
MKT
??
?
? ?
Storage
Admin
DBA
GREEN
OUTSIDE
RED
INSIDE
SILOED
TEAMS
SILOED
TOOLS+ =
WATERMELON
EFFECT
CONSEQUENCE:
THE WAR ROOM WATERMELON EFFECT
© 2020 SPLUNK INC.
ENTERPRISE MANAGEMENT AND USABILITY
Infra Agent
Metrics for Host
Containers
VM, etc.
App Libraries
Custom
Metrics
Cloud Services
Integrations
Multi Region
Multi Cloud
Tracing / APM
APM Agent
Library
Event Collector
DATACOLLECTORS
DEPLOYMENT QUOTA / TEAMS SELF-SERVICE DATA ACCESS API
AGGREGATION
METRICS PIPELINE
TRACES PIPELINE
EVENTS PIPELINE
Metrics Dashboard
Grafana / Chronograph
Traces Dashboard
Alerts
DS / ML
SPARK
CI / CD
Automation
TRACES DB
TSDB
EVENTS DB
Replicated / Clustered
Replicated / Clustered
Replicated / Clustered
Long-Term Data Retention
CLOUD STORAGE
COLLECTION PIPELINE STORAGE VISUALIZATION
ALTERING
The DIY Approach is Too Complex
© 2020 SPLUNK INC.
Configs,
Tickets,
Changes…
DATA
VOLUME
FORMAT
LOCATION
Metrics
Logs
Clouds
WHAT’S
HAPPENING
?
WHY IT IS
HAPPENING
?
WHY IT IS
HAPPENING
?
Traces
Real User
Monitoring
(new)
WHO IS
IMPACTED ?
WHERE IT IS
HAPPENING
?
ANY :
On-call
WHO
SHOULD I
CALL
?
AutomationRELAX
© 2020 SPLUNK INC.
Configs,
Tickets,
Changes…
DATA
VOLUME
FORMAT
LOCATION
Metrics
Logs
Clouds
Traces
Real User
Monitoring
(new)
ANY :
On-call
Automation
Sources
2000+ apps available on
splunkbase.splunk.com
Logs
industry-leading solution to
consolidate and index any
log and machine data
(structured, unstructured,
complex multi-line
application logs…)
regardless of volume, format
or location
Metrics
Infrastructure Metrics:
massively scalable streaming
architecture
Traces
NoSampleTM Full-
Fidelity Tracing
& Open Standards
Events
unified operational console
of all your events and
service-impacting issues
RUM
leveraging our NoSample
Full-fidelity Tracing that
ingests ALL front-end traces
and connects them with their
corresponding backend
traces
On-call
Mobile- first incident
response using AI,
ChatOps, virtual war
rooms, Incident
timelines for
blameless incident
management
Orchestration
& Automation
Codify your workflows into
automated playbooks using our
visual editor (no coding
required) or the integrated
Python development
environment.
© 2020 SPLUNK INC.
Observability Suite
Single, tightly
integrated user
experience
NoSample™
Full-Fidelity
Real-Time
Streaming
Massively
Scalable
AI/ML-Driven
Analytics
OpenTelemetry
Logs | Metrics | Traces
Digital
Experience
Monitoring
Infrastructure MonitoringApplication
Performance
Monitoring
Log Investigation
Incident
Response
© 2020 SPLUNK INC.
Observability Suite
Single, tightly
integrated user
experience
NoSample™
Full-Fidelity
Real-Time
Streaming
Massively
Scalable
AI/ML-Driven
Analytics
OpenTelemetry
Logs | Metrics | Traces
Digital
Experience
Monitoring
Infrastructure MonitoringApplication
Performance
Monitoring
Log Investigation
Incident
Response
DEMO
© 2 0 1 9 S P L U N K I N C .
© 2020 SPLUNK INC.
Adding
AIOps and
Business
insights
© 2 0 2 0 S P L U N K I N C .
Keyword:
visibility
Correlatingbusiness
outcomes from all
‘altitudes’ is nowa must
have
INFRASTRUCTURE
APP
Cloud
Networks
Security
API
WEB Smartphones
and Devices
Custom
Applications
Storage
Servers
DB
APM
Containers /
microservices
APP logs
Syslogs
TraditionalITOps
Monitoring
BIZ / SERVICE
Call center
Revenue NPS
Customer
retention
Funnel
Exec
MBO’s
Business-value
Monitoring
Digital
Online
© 2 0 2 0 S P L U N K I N C .
Business &
IT service
monitoring
See across silos
DeepDive whenneeded
Metrics,traces andlogs
inone place for you
INFRASTRUCTURE
APP
Cloud
Networks
Security
API
WEB Smartphones
and Devices
Custom
Applications
Storage
Servers
DB
APM
Containers /
microservices
APP logs
Syslogs
TraditionalITOps
Monitoring
BIZ / SERVICE
Call center
Revenue NPS
Customer
retention
Funnel
Exec
MBO’s
Business-value
Monitoring
Digital
Online
© 2 0 2 0 S P L U N K I N C .
Business &
IT service
monitoring
See across silos
DeepDive whenneeded
Metrics,traces andlogs
inone place for you
INFRASTRUCTURE
APP
Cloud
Networks
Security
API
WEB Smartphones
and Devices
Custom
Applications
Storage
Servers
DB
APM
Containers /
microservices
APP logs
Syslogs
TraditionalITOps
Monitoring
BIZ / SERVICE
Call center
Revenue NPS
Customer
retention
Funnel
Exec
MBO’s
Business-value
Monitoring
Digital
Online
© 2020 SPLUNK INC.
Enhancing Incident / Problem Management
Correlation / Investigation
Monitoring / Alerting
AIOps
Incident Response
Automation
VM VM VM VM VM VM
Private Public
LOGS METRICS TRACESImonitoryou
Observability
Private Public
Iamobservable
AIOps
© 2020 SPLUNK INC.
Machine Learning:
Overview
© 2018 SPLUNK INC.
How to find a needle in multiple haystacks?
(choose your tool)
Network?
Database?
Middleware?
Hardware?
Wrong
command?
Connection?
Apache?
VM?
Mainframe?
Load
balancer?Wrong code
released?
Collect ALL data
• Collect from all silos
• Data in original raw format
• Add open sources apps to
ingest data on the fly
• Schema on the fly
• Dynamic thresholding
• Realtime correlation
Clustering & aggregation
• Real time event
clustering/correlation
• Reduce alert noise
• Behavioural analytics
• Deduplication
Add context
• Measure / report on
indicators that matters
• Add service / business
context
• Add actionable
information to detection
Salessso
Claims
Anomaly detection
• Catch issues that thresholds
cannot
• Reduce event clutter
• Deviation from past
behaviour
• Deviation from peers
• Unusual change in features
Assisted deep dive
investigation
• Root cause analysis
• Powerful & easy to use
search & investigate
language
?
Predictive
Analytics
• Predict service health
• Predict events
• Trend forecasting
• Detect influencing
entities
• Early warning of
failure
70% to 90%
Reduction in investigation time
15% to 45%
Reduction in high priority incidents
67% to 82%
Reduction in business
impact
© 2020 SPLUNK INC.
Machine Learning:
Predictive Analytics
© 2020 SPLUNK INC.
Predictive
Analytics
WHAT IT IS
Applying machine learning
to predict issues up
to 30 minutes before
they happen
WHY IT MATTERS
Find and fix issues
before they impact
your end users
KPIPredictions
Servicehealth
Predictions
© 2020 SPLUNK INC.
Machine Learning:
Event Analytics
© 2020 SPLUNK INC.
Event Analytics
Applications
Servers
Databases
We can extend the grouping across siloed monitoring tools, and across layers of the stack. What if I told
you that all the events in orange were associated with machines that run the Ecommerce Store.
Silo views
Silo views
Silo views
War room
Fatigue + Noise
eCommerce
store incident
Mobile app
incident
© 2020 SPLUNK INC.
Event Analytics
WHAT IT IS
Bring together events from Splunk or
any other tool to analyze events
together, reduce noise and
enhance triage
WHY IT MATTERS
A holistic view of your events can
provide better insights into the root
cause of issues and reduce
Operations Center workload
© 2020 SPLUNK INC.
Working with episodes
Machine Learning supported investigation
SMART IMPACT EVALUATION
• Blast radius
• Impacted entities
• Impacted business services
• Impact on KPIs and service health
• Service topology context
• Related tickets in ServiceNow
ROOT CAUSE ANALYSIS
• Auto identification of probable root cause
• Use of future alert prediction to score
episodes
• Contextual access to advanced diagnostic
data and tools
KNOWLEDGE REUSE
• Auto identifies similar episodes
• Allows operator to jump into solved
episodes for faster resolution
• Contextual access to full diagnostic
data
• Access to past episodes’ resolution
activities and people
© 2020 SPLUNK INC.
Your virtual War Room
Deep Dive Episode investigation
Deep dives is a powerful investigation tool
that allows users to drill down into the
collective behavior of multiple elements
related to an episode.
• View KPIs, metrics, events… in context
• Direct access to raw data for full
investigation visibility
• Navigate through service trees to bring
additional elements to the investigation,
easily
• Compare observed episode with past
behavior and quickly find differences
• One click creation of new multi
dimensional alerts when suspected
correlation of KPI behavior is identified
© 2020 SPLUNK INC.
About Splunk
© 2 0 2 0 S P L U N K I N C . © 2 0 1 9 S P L U N K I N C .
A Market LeaderSources: IDC ww Security Information & event management
Share 2018, IDC worldwide IT Operations Management
Software Market Share 2019 (May 2020), IDC WW IT
Operations Analytics Software Market Shares 2017 and/or
Gartner 2018 & 2019, Research In Action AIOps top 15
global vendors 2019. Gartner, Market Share: Enterprise
Infrastructure Software, Worldwide, 2019 (April 2020).
ITOM
IT Operations Management : tools
to manage provisioning, capacity,
performance and availability of IT
OBSERVE
ITOA
IT Operations Analytics : the
practice of monitoring systems,
and gathering, processing,
analyzing & interpreting data from
ITOps sources to guide decisions
& predict issues
DECIDE
AIOps
Artificial Intelligence Operations :
AIOps platforms enhance IT
operations through greater insights
by combining big data, machine
learning and visualization.
>>
>>
ACCELERATE
SIEM
Security Event Information Management
PROTECT
Splunk among AIOps
market leaders (top 5)
By Research in Action &
#1 - Gartner
Marketshare: Gartner's
Performance Analysis:
AIOps, ITIM and ITOM
APM
Application Performance
management : tools to monitor and
optimize applications
OBSERVE
Splunk named a Visionary in
our first-ever placement in
the Gartner MQ
Splunk #1 in Worldwide
+32.3% YoY
#2 IBM, #3 Microsoft
Splunk #1 in Worldwide
+32.6% YoY
#2 VMware, #3 IBM
Splunk #1 in Worldwide
+37.6% YoY, #2 IBM, #3 MicroFocus
© 2 0 2 0 S P L U N K I N C . © 2 0 1 9 S P L U N K I N C .
A Market Leader
ITOM
IT Operations Management : tools
to manage provisioning, capacity,
performance and availability of IT
ITOA
IT Operations Analytics : the
practice of monitoring systems,
and gathering, processing,
analyzing & interpreting data from
ITOps sources to guide decisions
& predict issues
SIEM
Security Event Information Management
AIOps
Artificial Intelligence Operations :
AIOps platforms enhance IT
operations through greater insights
by combining big data, machine
learning and visualization.
>>
>>
Sources: IDC ww Security Information & event management
Share 2018, IDC worldwide IT Operations Management
Software Market Share 2018, IDC WW IT Operations
Analytics Software Market Shares 2017 and/or Gartner 2018
& 2019, Research In Action AIOps top 15 global vendors
2019. Gartner, Market Share: Enterprise Infrastructure
Software, Worldwide, 2019 (April 2020).
Splunk #1 in Worldwide
+32.3% YoY
#2 IBM, #3 Microsoft
Splunk #1 in Worldwide
+32.6% YoY
#2 VMware, #3 IBM
Splunk #1 in Worldwide
+37.6% YoY, #2 IBM, #3 MicroFocus
OBSERVE
DECIDE
ACCELERATEPROTECT
Splunk among AIOps
market leaders (top 5)
By Research in Action &
#1 - Gartner
Marketshare: Gartner's
Performance Analysis:
AIOps, ITIM and ITOM
APM
Application Performance
management : tools to monitor and
optimize applications
OBSERVE
Splunk named a Visionary in
our first-ever placement in
the Gartner MQ
Splunk ranked #1 in Gartner’s 2019
Market Share for Performance
Analysis: AIOps, ITIM and Other
Monitoring Tools category
#1 Splunk 16.5% market share (+30.4%)
#2 IBM : 13.2% (-6.5%)
#3 Microsoft : 8.4% (+9.1%)
© 2020 SPLUNK INC.
Disjointed data sets
Siloed views
High MTTR
Negative customer experience
Zero downtime
Record sales
Record high customer satisfaction
« Best black Friday ever »
Sr. Director of SRE, Dell EMC
Thank You
© 2020 SPLUNK INC.
1 de 45

Recomendados

Do You Really Need to Evolve From Monitoring to Observability? por
Do You Really Need to Evolve From Monitoring to Observability?Do You Really Need to Evolve From Monitoring to Observability?
Do You Really Need to Evolve From Monitoring to Observability?Splunk
503 visualizações28 slides
Observability por
Observability Observability
Observability Enes Altınok
362 visualizações37 slides
Observability at Scale por
Observability at Scale Observability at Scale
Observability at Scale Knoldus Inc.
327 visualizações12 slides
Observability por
ObservabilityObservability
ObservabilityDiego Pacheco
474 visualizações80 slides
.conf Go 2022 - Observability Session por
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability SessionSplunk
303 visualizações26 slides
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou... por
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...Splunk
1.6K visualizações22 slides

Mais conteúdo relacionado

Mais procurados

Monitoring & Observability por
Monitoring & ObservabilityMonitoring & Observability
Monitoring & ObservabilityLumban Sopian
307 visualizações20 slides
Observability por
ObservabilityObservability
ObservabilityMartin Gross
260 visualizações21 slides
Observability por
ObservabilityObservability
ObservabilityMaganathin Veeraragaloo
1.2K visualizações46 slides
Splunk Overview por
Splunk OverviewSplunk Overview
Splunk OverviewSplunk
1.8K visualizações57 slides
Observability vs APM vs Monitoring Comparison por
Observability vs APM vs  Monitoring ComparisonObservability vs APM vs  Monitoring Comparison
Observability vs APM vs Monitoring Comparisonjeetendra mandal
442 visualizações19 slides
Monitoring and observability por
Monitoring and observabilityMonitoring and observability
Monitoring and observabilityTheo Schlossnagle
7.9K visualizações73 slides

Mais procurados(20)

Monitoring & Observability por Lumban Sopian
Monitoring & ObservabilityMonitoring & Observability
Monitoring & Observability
Lumban Sopian307 visualizações
Observability por Martin Gross
ObservabilityObservability
Observability
Martin Gross260 visualizações
Splunk Overview por Splunk
Splunk OverviewSplunk Overview
Splunk Overview
Splunk1.8K visualizações
Observability vs APM vs Monitoring Comparison por jeetendra mandal
Observability vs APM vs  Monitoring ComparisonObservability vs APM vs  Monitoring Comparison
Observability vs APM vs Monitoring Comparison
jeetendra mandal442 visualizações
Monitoring and observability por Theo Schlossnagle
Monitoring and observabilityMonitoring and observability
Monitoring and observability
Theo Schlossnagle7.9K visualizações
Monitoring and observability por Theo Schlossnagle
Monitoring and observabilityMonitoring and observability
Monitoring and observability
Theo Schlossnagle4K visualizações
Observability – the good, the bad, and the ugly por Timetrix
Observability – the good, the bad, and the uglyObservability – the good, the bad, and the ugly
Observability – the good, the bad, and the ugly
Timetrix199 visualizações
Cloud-Native Observability por Tyler Treat
Cloud-Native ObservabilityCloud-Native Observability
Cloud-Native Observability
Tyler Treat886 visualizações
Observability, what, why and how por Neeraj Bagga
Observability, what, why and howObservability, what, why and how
Observability, what, why and how
Neeraj Bagga215 visualizações
Logging and observability por Anton Drukh
Logging and observabilityLogging and observability
Logging and observability
Anton Drukh390 visualizações
Observability & Datadog por JamesAnderson599331
Observability & DatadogObservability & Datadog
Observability & Datadog
JamesAnderson599331347 visualizações
Road to (Enterprise) Observability por Christoph Engelbert
Road to (Enterprise) ObservabilityRoad to (Enterprise) Observability
Road to (Enterprise) Observability
Christoph Engelbert204 visualizações
Improve monitoring and observability for kubernetes with oss tools por Nilesh Gule
Improve monitoring and observability for kubernetes with oss toolsImprove monitoring and observability for kubernetes with oss tools
Improve monitoring and observability for kubernetes with oss tools
Nilesh Gule82 visualizações
Observability driven development por Geert van der Cruijsen
Observability driven developmentObservability driven development
Observability driven development
Geert van der Cruijsen640 visualizações
Splunk Architecture por Kishore Chaganti
Splunk ArchitectureSplunk Architecture
Splunk Architecture
Kishore Chaganti7.3K visualizações
Demystifying observability por Abigail Bangser
Demystifying observability Demystifying observability
Demystifying observability
Abigail Bangser525 visualizações
Combining Logs, Metrics, and Traces for Unified Observability por Elasticsearch
Combining Logs, Metrics, and Traces for Unified ObservabilityCombining Logs, Metrics, and Traces for Unified Observability
Combining Logs, Metrics, and Traces for Unified Observability
Elasticsearch579 visualizações
Elastic Observability por FaithWestdorp
Elastic Observability Elastic Observability
Elastic Observability
FaithWestdorp237 visualizações
Observability in the world of microservices por Chandresh Pancholi
Observability in the world of microservicesObservability in the world of microservices
Observability in the world of microservices
Chandresh Pancholi344 visualizações

Similar a More Than Monitoring: How Observability Takes You From Firefighting to Fire Prevention

December Bengaluru Splunk User Group Meetup por
December Bengaluru Splunk User Group MeetupDecember Bengaluru Splunk User Group Meetup
December Bengaluru Splunk User Group Meetupkamlesh2410
130 visualizações60 slides
Splunk Discovery Köln - 17-01-2020 - Splunk for ITOps por
Splunk Discovery Köln - 17-01-2020 - Splunk for ITOpsSplunk Discovery Köln - 17-01-2020 - Splunk for ITOps
Splunk Discovery Köln - 17-01-2020 - Splunk for ITOpsSplunk
123 visualizações43 slides
SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat... por
SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat...SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat...
SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat...Splunk
256 visualizações21 slides
SplunkLive! Paris 2018: Integrating Metrics and Logs por
SplunkLive! Paris 2018: Integrating Metrics and LogsSplunkLive! Paris 2018: Integrating Metrics and Logs
SplunkLive! Paris 2018: Integrating Metrics and LogsSplunk
238 visualizações25 slides
Splunk-Presentation por
Splunk-Presentation Splunk-Presentation
Splunk-Presentation PrasadThorat23
2.5K visualizações35 slides
Splunk for IT Operations Breakout Session por
Splunk for IT Operations Breakout SessionSplunk for IT Operations Breakout Session
Splunk for IT Operations Breakout SessionSplunk
860 visualizações23 slides

Similar a More Than Monitoring: How Observability Takes You From Firefighting to Fire Prevention(20)

December Bengaluru Splunk User Group Meetup por kamlesh2410
December Bengaluru Splunk User Group MeetupDecember Bengaluru Splunk User Group Meetup
December Bengaluru Splunk User Group Meetup
kamlesh2410130 visualizações
Splunk Discovery Köln - 17-01-2020 - Splunk for ITOps por Splunk
Splunk Discovery Köln - 17-01-2020 - Splunk for ITOpsSplunk Discovery Köln - 17-01-2020 - Splunk for ITOps
Splunk Discovery Köln - 17-01-2020 - Splunk for ITOps
Splunk123 visualizações
SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat... por Splunk
SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat...SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat...
SplunkLive! Paris 2018: Delivering New Visibility And Analytics For IT Operat...
Splunk256 visualizações
SplunkLive! Paris 2018: Integrating Metrics and Logs por Splunk
SplunkLive! Paris 2018: Integrating Metrics and LogsSplunkLive! Paris 2018: Integrating Metrics and Logs
SplunkLive! Paris 2018: Integrating Metrics and Logs
Splunk238 visualizações
Splunk-Presentation por PrasadThorat23
Splunk-Presentation Splunk-Presentation
Splunk-Presentation
PrasadThorat232.5K visualizações
Splunk for IT Operations Breakout Session por Splunk
Splunk for IT Operations Breakout SessionSplunk for IT Operations Breakout Session
Splunk for IT Operations Breakout Session
Splunk860 visualizações
Webinar: Neuigkeiten zu Splunk Enterprise 6.3 por Splunk
Webinar: Neuigkeiten zu Splunk Enterprise 6.3Webinar: Neuigkeiten zu Splunk Enterprise 6.3
Webinar: Neuigkeiten zu Splunk Enterprise 6.3
Splunk483 visualizações
IoT Analytics @ splunk por Splunk
IoT Analytics @ splunkIoT Analytics @ splunk
IoT Analytics @ splunk
Splunk597 visualizações
Splunk und Multi-Cloud por Splunk
Splunk und Multi-CloudSplunk und Multi-Cloud
Splunk und Multi-Cloud
Splunk336 visualizações
Wie erkenne ich die Auswirkungen von IT Ausfallen auf meine Produktion? por Splunk
Wie erkenne ich die Auswirkungen von IT Ausfallen auf meine Produktion?Wie erkenne ich die Auswirkungen von IT Ausfallen auf meine Produktion?
Wie erkenne ich die Auswirkungen von IT Ausfallen auf meine Produktion?
Splunk440 visualizações
Splunk and Multicloud por Splunk
Splunk and MulticloudSplunk and Multicloud
Splunk and Multicloud
Splunk369 visualizações
Splunk and Multicloud por Splunk
Splunk and Multicloud Splunk and Multicloud
Splunk and Multicloud
Splunk92 visualizações
Splunk bangalore user group 2020-06-01 por NiketNilay
Splunk bangalore user group   2020-06-01Splunk bangalore user group   2020-06-01
Splunk bangalore user group 2020-06-01
NiketNilay309 visualizações
Splunk conf2014 - Getting Deeper Insights into your Virtualization and Storag... por Splunk
Splunk conf2014 - Getting Deeper Insights into your Virtualization and Storag...Splunk conf2014 - Getting Deeper Insights into your Virtualization and Storag...
Splunk conf2014 - Getting Deeper Insights into your Virtualization and Storag...
Splunk1.5K visualizações
Splunk Discovery Köln - 17-01-2020 - Turning Data Into Business Outcomes por Splunk
Splunk Discovery Köln - 17-01-2020 - Turning Data Into Business OutcomesSplunk Discovery Köln - 17-01-2020 - Turning Data Into Business Outcomes
Splunk Discovery Köln - 17-01-2020 - Turning Data Into Business Outcomes
Splunk242 visualizações
Splunk 4 Ninja ITSI Workshop por Marc Serieys
Splunk 4 Ninja ITSI WorkshopSplunk 4 Ninja ITSI Workshop
Splunk 4 Ninja ITSI Workshop
Marc Serieys281 visualizações
Splunk Webinar: IT Operations Demo für Troubleshooting & Dashboarding por Georg Knon
Splunk Webinar: IT Operations Demo für Troubleshooting & DashboardingSplunk Webinar: IT Operations Demo für Troubleshooting & Dashboarding
Splunk Webinar: IT Operations Demo für Troubleshooting & Dashboarding
Georg Knon536 visualizações
Virtual SplunkLive! for Higher Education Overview/Customers por Splunk
Virtual SplunkLive! for Higher Education Overview/CustomersVirtual SplunkLive! for Higher Education Overview/Customers
Virtual SplunkLive! for Higher Education Overview/Customers
Splunk1.7K visualizações
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R... por Splunk
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...
Still Suffering from IT Outages? Accept Failure, Learn from Failure and Get R...
Splunk256 visualizações
SplunkLive! London 2017 - Splunk Enterprise for IT Troubleshooting por Splunk
SplunkLive! London 2017 - Splunk Enterprise for IT TroubleshootingSplunkLive! London 2017 - Splunk Enterprise for IT Troubleshooting
SplunkLive! London 2017 - Splunk Enterprise for IT Troubleshooting
Splunk558 visualizações

Mais de DevOps.com

Modernizing on IBM Z Made Easier With Open Source Software por
Modernizing on IBM Z Made Easier With Open Source SoftwareModernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source SoftwareDevOps.com
841 visualizações37 slides
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla... por
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...DevOps.com
254 visualizações29 slides
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla... por
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...DevOps.com
145 visualizações31 slides
Next Generation Vulnerability Assessment Using Datadog and Snyk por
Next Generation Vulnerability Assessment Using Datadog and SnykNext Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and SnykDevOps.com
246 visualizações37 slides
Vulnerability Discovery in the Cloud por
Vulnerability Discovery in the CloudVulnerability Discovery in the Cloud
Vulnerability Discovery in the CloudDevOps.com
310 visualizações45 slides
2021 Open Source Governance: Top Ten Trends and Predictions por
2021 Open Source Governance: Top Ten Trends and Predictions2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and PredictionsDevOps.com
206 visualizações15 slides

Mais de DevOps.com(20)

Modernizing on IBM Z Made Easier With Open Source Software por DevOps.com
Modernizing on IBM Z Made Easier With Open Source SoftwareModernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source Software
DevOps.com841 visualizações
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla... por DevOps.com
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com254 visualizações
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla... por DevOps.com
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com145 visualizações
Next Generation Vulnerability Assessment Using Datadog and Snyk por DevOps.com
Next Generation Vulnerability Assessment Using Datadog and SnykNext Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and Snyk
DevOps.com246 visualizações
Vulnerability Discovery in the Cloud por DevOps.com
Vulnerability Discovery in the CloudVulnerability Discovery in the Cloud
Vulnerability Discovery in the Cloud
DevOps.com310 visualizações
2021 Open Source Governance: Top Ten Trends and Predictions por DevOps.com
2021 Open Source Governance: Top Ten Trends and Predictions2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and Predictions
DevOps.com206 visualizações
A New Year’s Ransomware Resolution por DevOps.com
A New Year’s Ransomware ResolutionA New Year’s Ransomware Resolution
A New Year’s Ransomware Resolution
DevOps.com187 visualizações
Getting Started with Runtime Security on Azure Kubernetes Service (AKS) por DevOps.com
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
DevOps.com309 visualizações
Don't Panic! Effective Incident Response por DevOps.com
Don't Panic! Effective Incident ResponseDon't Panic! Effective Incident Response
Don't Panic! Effective Incident Response
DevOps.com217 visualizações
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture por DevOps.com
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's CultureCreating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
DevOps.com166 visualizações
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport por DevOps.com
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with TeleportRole Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
DevOps.com448 visualizações
Monitoring Serverless Applications with Datadog por DevOps.com
Monitoring Serverless Applications with DatadogMonitoring Serverless Applications with Datadog
Monitoring Serverless Applications with Datadog
DevOps.com246 visualizações
Deliver your App Anywhere … Publicly or Privately por DevOps.com
Deliver your App Anywhere … Publicly or PrivatelyDeliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or Privately
DevOps.com254 visualizações
Securing medical apps in the age of covid final por DevOps.com
Securing medical apps in the age of covid finalSecuring medical apps in the age of covid final
Securing medical apps in the age of covid final
DevOps.com132 visualizações
How to Build a Healthy On-Call Culture por DevOps.com
How to Build a Healthy On-Call CultureHow to Build a Healthy On-Call Culture
How to Build a Healthy On-Call Culture
DevOps.com113 visualizações
The Evolving Role of the Developer in 2021 por DevOps.com
The Evolving Role of the Developer in 2021The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021
DevOps.com155 visualizações
Service Mesh: Two Big Words But Do You Need It? por DevOps.com
Service Mesh: Two Big Words But Do You Need It?Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?
DevOps.com189 visualizações
Secure Data Sharing in OpenShift Environments por DevOps.com
Secure Data Sharing in OpenShift EnvironmentsSecure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift Environments
DevOps.com156 visualizações
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S... por DevOps.com
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
DevOps.com179 visualizações
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T... por DevOps.com
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
DevOps.com75 visualizações

Último

Business Analyst Series 2023 - Week 4 Session 7 por
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7DianaGray10
42 visualizações31 slides
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... por
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...ShapeBlue
44 visualizações13 slides
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue por
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueShapeBlue
89 visualizações23 slides
Microsoft Power Platform.pptx por
Microsoft Power Platform.pptxMicrosoft Power Platform.pptx
Microsoft Power Platform.pptxUni Systems S.M.S.A.
61 visualizações38 slides
HTTP headers that make your website go faster - devs.gent November 2023 por
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023Thijs Feryn
26 visualizações151 slides
Igniting Next Level Productivity with AI-Infused Data Integration Workflows por
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
317 visualizações86 slides

Último(20)

Business Analyst Series 2023 - Week 4 Session 7 por DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray1042 visualizações
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... por ShapeBlue
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
ShapeBlue44 visualizações
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue por ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
ShapeBlue89 visualizações
Microsoft Power Platform.pptx por Uni Systems S.M.S.A.
Microsoft Power Platform.pptxMicrosoft Power Platform.pptx
Microsoft Power Platform.pptx
Uni Systems S.M.S.A.61 visualizações
HTTP headers that make your website go faster - devs.gent November 2023 por Thijs Feryn
HTTP headers that make your website go faster - devs.gent November 2023HTTP headers that make your website go faster - devs.gent November 2023
HTTP headers that make your website go faster - devs.gent November 2023
Thijs Feryn26 visualizações
Igniting Next Level Productivity with AI-Infused Data Integration Workflows por Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software317 visualizações
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online por ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue75 visualizações
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... por ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue54 visualizações
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T por ShapeBlue
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&TCloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
CloudStack and GitOps at Enterprise Scale - Alex Dometrius, Rene Glover - AT&T
ShapeBlue38 visualizações
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院 por IttrainingIttraining
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
IttrainingIttraining69 visualizações
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... por ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue46 visualizações
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue por ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
ShapeBlue25 visualizações
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive por Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Network Automation Forum43 visualizações
Business Analyst Series 2023 - Week 3 Session 5 por DianaGray10
Business Analyst Series 2023 -  Week 3 Session 5Business Analyst Series 2023 -  Week 3 Session 5
Business Analyst Series 2023 - Week 3 Session 5
DianaGray10345 visualizações
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... por ShapeBlue
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
ShapeBlue55 visualizações
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... por TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc72 visualizações
DRBD Deep Dive - Philipp Reisner - LINBIT por ShapeBlue
DRBD Deep Dive - Philipp Reisner - LINBITDRBD Deep Dive - Philipp Reisner - LINBIT
DRBD Deep Dive - Philipp Reisner - LINBIT
ShapeBlue44 visualizações
Network Source of Truth and Infrastructure as Code revisited por Network Automation Forum
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisited
Network Automation Forum32 visualizações

More Than Monitoring: How Observability Takes You From Firefighting to Fire Prevention

  • 1. © 2020 SPLUNK INC. More Than Monitoring: How Observability Take You From Firefighting to Fire Prevention
  • 2. © 2 0 1 9 S P L U N K I N C . Stephane Estevez EMEA Product Marketing Director, IT Markets, Splunk
  • 3. During the course of this presentation, we may make forward-looking statements regarding future events or plans of the company. We caution you that such statements reflect our current expectations and estimates based on factors currently known to us and that actual events or results may differ materially. The forward-looking statements made in the this presentation are being made as of the time and date of its live presentation. If reviewed after its live presentation, it may not contain current or accurate information. We do not assume any obligation to update any forward-looking statements made herein. In addition, any information about our roadmap outlines our general product direction and is subject to change at any time without notice. It is for informational purposes only, and shall not be incorporated into any contract or other commitment. Splunk undertakes no obligation either to develop the features or functionalities described or to include any such feature or functionality in a future release. Splunk, Splunk>, Turn Data Into Doing, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respective owners. © 2020 Splunk Inc. All rights reserved. Forward- Looking Statements © 2020 SPLUNK INC.
  • 4. © 2020 SPLUNK INC. • Observability in a nutshell • Key Observability use cases • Adding Observability to Monitoring • Demo • Adding AIOps • About Splunk Agenda
  • 5. © 2020 SPLUNK INC. Observability in a Nutshell
  • 6. © 2020 SPLUNK INC. Distributed Services with High-Velocity Releases = New Organizational Challenges Investment in new observability and incident management tools becomes critical
  • 7. © 2020 SPLUNK INC. Understanding Observability Mindset Source: Wikipedia Survivorship bias or survival bias is the logical error of concentrating on the people or things that made it past some selection process, and overlooking those that did not, typically because of their lack of visibility. This can lead to false conclusions in several different ways. “Gentlemen, you need to put more armour- plate where the holes aren’t because that’s where the holes where on the airplane that didn’t return” –(Abraham Wald 1942) A shot down aircraft doesn’t externalize its state
  • 8. © 2020 SPLUNK INC. Analyze Monitoring Observability A Noun A thing you have – a property of a system A Verb Something you do to determine the state of an application, a system, a service… Act If you are observable I can monitor you and take actions find patterns Turning Observability into Action
  • 9. © 2020 SPLUNK INC. Cloud-Native Journey Increases Operating Complexity Retain & Optimize Lift & Shift Re-Factor Re-Architect/ Cloud-Native DEV OPS DEV OPS DEV OPS DEV OPS Cloud Managed e.g. RDS, DynamoDB, SaaS Cloud First Architecture Tightly Coupled Apps, Slow Deployment Cycles Primarily using Cloud IaaS More Modular, but Dependent App Components Loosely Coupled Microservices, and Serverless Functions VM VM VMVM VM VM VM VM VM Private Public VM VM VM VM VM VM Private Public Private Public
  • 10. © 2020 SPLUNK INC. Adding Observability to Support Cloud- Native Environments Observability helps detect, investigate and resolve the unknown unknowns Monitoring Keep an eye on things we know can go wrong Observability Find the unexpected and explain why it happened
  • 11. © 2 0 1 9 S P L U N K I N C . “Focus on what you can’t see, the unknowns. If the root cause of a failure stays invisible (the bullet holes) your IT-plane will be shot down again” So what is Observability? METRICS TRACES LOGS
  • 12. © 2 0 1 9 S P L U N K I N C . WHAT’S HAPPENING? Observability The Three Pillars WHY IS IT HAPPENING? WHERE IS IT HAPPENING? METRICS EVENTS / LOGS TRACES
  • 13. © 2020 SPLUNK INC. Enhancing Incident / Problem Management Correlation / Investigation Monitoring / Alerting AIOps Incident Response Automation VM VM VM VM VM VM Private Public LOGS METRICS TRACESImonitoryou Observability Private Public Iamobservable
  • 14. © 2020 SPLUNK INC. All the data Real-time and scalable Analytics /ML What’s required for Observability Customer experience Release quality and velocity Developer efficiency Business Adaptability
  • 15. © 2020 SPLUNK INC. Key Use Cases
  • 16. © 2020 SPLUNK INC. Frequent Use Cases • Hybrid cloud monitoring • Cloud cost management • Cloud capacity planning • Public cloud monitoring • Kubernetes & container monitoring • Serverless monitoring • KPIs monitoring using custom metrics • Observability-as-a-Service • Application modernization • Microservices monitoring & troubleshooting • Business SLx monitoring • DevOps application lifecycle monitoring Cloud Migration Multi-Cloud Monitoring Application Performance Monitoring • Reduce remediation time & Improve on-call (“Incident Response”) Incident Response
  • 17. © 2020 SPLUNK INC. Observability with Splunk
  • 18. © 2020 SPLUNK INC. “Observability means that you have the data that you need (logs, metrics and traces) for every single unit of work that is of interest to the business.”
  • 19. © 2020 SPLUNK INC. Complexity is everywhere even when you only have one public cloud EVENTS LOGS & REPORTS Elastic Load Balancing Access Logs Amazon CloudFront Access logs Amazon CloudTrail logs Billing Reports Application Logs Application S3 access Logs Other service logs AWS configs snapshots & history files METRICS EMR Cluster Auto Scaling EVENTS LOGS RULES/EVENTS Events Logs Push path (via Splunk HEC) Your IT team
  • 20. © 2 0 1 9 S P L U N K I N C . CISO DevSysAdmin MKT ?? ? ? ? Storage Admin DBA GREEN OUTSIDE RED INSIDE SILOED TEAMS SILOED TOOLS+ = WATERMELON EFFECT CONSEQUENCE: THE WAR ROOM WATERMELON EFFECT
  • 21. © 2020 SPLUNK INC. ENTERPRISE MANAGEMENT AND USABILITY Infra Agent Metrics for Host Containers VM, etc. App Libraries Custom Metrics Cloud Services Integrations Multi Region Multi Cloud Tracing / APM APM Agent Library Event Collector DATACOLLECTORS DEPLOYMENT QUOTA / TEAMS SELF-SERVICE DATA ACCESS API AGGREGATION METRICS PIPELINE TRACES PIPELINE EVENTS PIPELINE Metrics Dashboard Grafana / Chronograph Traces Dashboard Alerts DS / ML SPARK CI / CD Automation TRACES DB TSDB EVENTS DB Replicated / Clustered Replicated / Clustered Replicated / Clustered Long-Term Data Retention CLOUD STORAGE COLLECTION PIPELINE STORAGE VISUALIZATION ALTERING The DIY Approach is Too Complex
  • 22. © 2020 SPLUNK INC. Configs, Tickets, Changes… DATA VOLUME FORMAT LOCATION Metrics Logs Clouds WHAT’S HAPPENING ? WHY IT IS HAPPENING ? WHY IT IS HAPPENING ? Traces Real User Monitoring (new) WHO IS IMPACTED ? WHERE IT IS HAPPENING ? ANY : On-call WHO SHOULD I CALL ? AutomationRELAX
  • 23. © 2020 SPLUNK INC. Configs, Tickets, Changes… DATA VOLUME FORMAT LOCATION Metrics Logs Clouds Traces Real User Monitoring (new) ANY : On-call Automation Sources 2000+ apps available on splunkbase.splunk.com Logs industry-leading solution to consolidate and index any log and machine data (structured, unstructured, complex multi-line application logs…) regardless of volume, format or location Metrics Infrastructure Metrics: massively scalable streaming architecture Traces NoSampleTM Full- Fidelity Tracing & Open Standards Events unified operational console of all your events and service-impacting issues RUM leveraging our NoSample Full-fidelity Tracing that ingests ALL front-end traces and connects them with their corresponding backend traces On-call Mobile- first incident response using AI, ChatOps, virtual war rooms, Incident timelines for blameless incident management Orchestration & Automation Codify your workflows into automated playbooks using our visual editor (no coding required) or the integrated Python development environment.
  • 24. © 2020 SPLUNK INC. Observability Suite Single, tightly integrated user experience NoSample™ Full-Fidelity Real-Time Streaming Massively Scalable AI/ML-Driven Analytics OpenTelemetry Logs | Metrics | Traces Digital Experience Monitoring Infrastructure MonitoringApplication Performance Monitoring Log Investigation Incident Response
  • 25. © 2020 SPLUNK INC. Observability Suite Single, tightly integrated user experience NoSample™ Full-Fidelity Real-Time Streaming Massively Scalable AI/ML-Driven Analytics OpenTelemetry Logs | Metrics | Traces Digital Experience Monitoring Infrastructure MonitoringApplication Performance Monitoring Log Investigation Incident Response DEMO
  • 26. © 2 0 1 9 S P L U N K I N C .
  • 27. © 2020 SPLUNK INC. Adding AIOps and Business insights
  • 28. © 2 0 2 0 S P L U N K I N C . Keyword: visibility Correlatingbusiness outcomes from all ‘altitudes’ is nowa must have INFRASTRUCTURE APP Cloud Networks Security API WEB Smartphones and Devices Custom Applications Storage Servers DB APM Containers / microservices APP logs Syslogs TraditionalITOps Monitoring BIZ / SERVICE Call center Revenue NPS Customer retention Funnel Exec MBO’s Business-value Monitoring Digital Online
  • 29. © 2 0 2 0 S P L U N K I N C . Business & IT service monitoring See across silos DeepDive whenneeded Metrics,traces andlogs inone place for you INFRASTRUCTURE APP Cloud Networks Security API WEB Smartphones and Devices Custom Applications Storage Servers DB APM Containers / microservices APP logs Syslogs TraditionalITOps Monitoring BIZ / SERVICE Call center Revenue NPS Customer retention Funnel Exec MBO’s Business-value Monitoring Digital Online
  • 30. © 2 0 2 0 S P L U N K I N C . Business & IT service monitoring See across silos DeepDive whenneeded Metrics,traces andlogs inone place for you INFRASTRUCTURE APP Cloud Networks Security API WEB Smartphones and Devices Custom Applications Storage Servers DB APM Containers / microservices APP logs Syslogs TraditionalITOps Monitoring BIZ / SERVICE Call center Revenue NPS Customer retention Funnel Exec MBO’s Business-value Monitoring Digital Online
  • 31. © 2020 SPLUNK INC. Enhancing Incident / Problem Management Correlation / Investigation Monitoring / Alerting AIOps Incident Response Automation VM VM VM VM VM VM Private Public LOGS METRICS TRACESImonitoryou Observability Private Public Iamobservable AIOps
  • 32. © 2020 SPLUNK INC. Machine Learning: Overview
  • 33. © 2018 SPLUNK INC. How to find a needle in multiple haystacks? (choose your tool) Network? Database? Middleware? Hardware? Wrong command? Connection? Apache? VM? Mainframe? Load balancer?Wrong code released? Collect ALL data • Collect from all silos • Data in original raw format • Add open sources apps to ingest data on the fly • Schema on the fly • Dynamic thresholding • Realtime correlation Clustering & aggregation • Real time event clustering/correlation • Reduce alert noise • Behavioural analytics • Deduplication Add context • Measure / report on indicators that matters • Add service / business context • Add actionable information to detection Salessso Claims Anomaly detection • Catch issues that thresholds cannot • Reduce event clutter • Deviation from past behaviour • Deviation from peers • Unusual change in features Assisted deep dive investigation • Root cause analysis • Powerful & easy to use search & investigate language ? Predictive Analytics • Predict service health • Predict events • Trend forecasting • Detect influencing entities • Early warning of failure 70% to 90% Reduction in investigation time 15% to 45% Reduction in high priority incidents 67% to 82% Reduction in business impact
  • 34. © 2020 SPLUNK INC. Machine Learning: Predictive Analytics
  • 35. © 2020 SPLUNK INC. Predictive Analytics WHAT IT IS Applying machine learning to predict issues up to 30 minutes before they happen WHY IT MATTERS Find and fix issues before they impact your end users KPIPredictions Servicehealth Predictions
  • 36. © 2020 SPLUNK INC. Machine Learning: Event Analytics
  • 37. © 2020 SPLUNK INC. Event Analytics Applications Servers Databases We can extend the grouping across siloed monitoring tools, and across layers of the stack. What if I told you that all the events in orange were associated with machines that run the Ecommerce Store. Silo views Silo views Silo views War room Fatigue + Noise eCommerce store incident Mobile app incident
  • 38. © 2020 SPLUNK INC. Event Analytics WHAT IT IS Bring together events from Splunk or any other tool to analyze events together, reduce noise and enhance triage WHY IT MATTERS A holistic view of your events can provide better insights into the root cause of issues and reduce Operations Center workload
  • 39. © 2020 SPLUNK INC. Working with episodes Machine Learning supported investigation SMART IMPACT EVALUATION • Blast radius • Impacted entities • Impacted business services • Impact on KPIs and service health • Service topology context • Related tickets in ServiceNow ROOT CAUSE ANALYSIS • Auto identification of probable root cause • Use of future alert prediction to score episodes • Contextual access to advanced diagnostic data and tools KNOWLEDGE REUSE • Auto identifies similar episodes • Allows operator to jump into solved episodes for faster resolution • Contextual access to full diagnostic data • Access to past episodes’ resolution activities and people
  • 40. © 2020 SPLUNK INC. Your virtual War Room Deep Dive Episode investigation Deep dives is a powerful investigation tool that allows users to drill down into the collective behavior of multiple elements related to an episode. • View KPIs, metrics, events… in context • Direct access to raw data for full investigation visibility • Navigate through service trees to bring additional elements to the investigation, easily • Compare observed episode with past behavior and quickly find differences • One click creation of new multi dimensional alerts when suspected correlation of KPI behavior is identified
  • 41. © 2020 SPLUNK INC. About Splunk
  • 42. © 2 0 2 0 S P L U N K I N C . © 2 0 1 9 S P L U N K I N C . A Market LeaderSources: IDC ww Security Information & event management Share 2018, IDC worldwide IT Operations Management Software Market Share 2019 (May 2020), IDC WW IT Operations Analytics Software Market Shares 2017 and/or Gartner 2018 & 2019, Research In Action AIOps top 15 global vendors 2019. Gartner, Market Share: Enterprise Infrastructure Software, Worldwide, 2019 (April 2020). ITOM IT Operations Management : tools to manage provisioning, capacity, performance and availability of IT OBSERVE ITOA IT Operations Analytics : the practice of monitoring systems, and gathering, processing, analyzing & interpreting data from ITOps sources to guide decisions & predict issues DECIDE AIOps Artificial Intelligence Operations : AIOps platforms enhance IT operations through greater insights by combining big data, machine learning and visualization. >> >> ACCELERATE SIEM Security Event Information Management PROTECT Splunk among AIOps market leaders (top 5) By Research in Action & #1 - Gartner Marketshare: Gartner's Performance Analysis: AIOps, ITIM and ITOM APM Application Performance management : tools to monitor and optimize applications OBSERVE Splunk named a Visionary in our first-ever placement in the Gartner MQ Splunk #1 in Worldwide +32.3% YoY #2 IBM, #3 Microsoft Splunk #1 in Worldwide +32.6% YoY #2 VMware, #3 IBM Splunk #1 in Worldwide +37.6% YoY, #2 IBM, #3 MicroFocus
  • 43. © 2 0 2 0 S P L U N K I N C . © 2 0 1 9 S P L U N K I N C . A Market Leader ITOM IT Operations Management : tools to manage provisioning, capacity, performance and availability of IT ITOA IT Operations Analytics : the practice of monitoring systems, and gathering, processing, analyzing & interpreting data from ITOps sources to guide decisions & predict issues SIEM Security Event Information Management AIOps Artificial Intelligence Operations : AIOps platforms enhance IT operations through greater insights by combining big data, machine learning and visualization. >> >> Sources: IDC ww Security Information & event management Share 2018, IDC worldwide IT Operations Management Software Market Share 2018, IDC WW IT Operations Analytics Software Market Shares 2017 and/or Gartner 2018 & 2019, Research In Action AIOps top 15 global vendors 2019. Gartner, Market Share: Enterprise Infrastructure Software, Worldwide, 2019 (April 2020). Splunk #1 in Worldwide +32.3% YoY #2 IBM, #3 Microsoft Splunk #1 in Worldwide +32.6% YoY #2 VMware, #3 IBM Splunk #1 in Worldwide +37.6% YoY, #2 IBM, #3 MicroFocus OBSERVE DECIDE ACCELERATEPROTECT Splunk among AIOps market leaders (top 5) By Research in Action & #1 - Gartner Marketshare: Gartner's Performance Analysis: AIOps, ITIM and ITOM APM Application Performance management : tools to monitor and optimize applications OBSERVE Splunk named a Visionary in our first-ever placement in the Gartner MQ Splunk ranked #1 in Gartner’s 2019 Market Share for Performance Analysis: AIOps, ITIM and Other Monitoring Tools category #1 Splunk 16.5% market share (+30.4%) #2 IBM : 13.2% (-6.5%) #3 Microsoft : 8.4% (+9.1%)
  • 44. © 2020 SPLUNK INC. Disjointed data sets Siloed views High MTTR Negative customer experience Zero downtime Record sales Record high customer satisfaction « Best black Friday ever » Sr. Director of SRE, Dell EMC
  • 45. Thank You © 2020 SPLUNK INC.