SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
From Zero To Visibility
Bridget Kromhout
8thbridge.com
small social commerce startup
acquired in the last month by Fluid, Inc.
small devteam
I am the ops team
http://www.thedirtbox.com/wp-content/uploads/2013/01/ping-pongart.jpg
twisty maze of little shell scripts
http://www.pcgameshardware.de/screenshots/1280x1024/2007/07/CA01.jpg
time-consuming to understand
difficult to modify
doesn’t scale
artisanal monitoring?!
http://shop.bespokebacon.com/images/bespoke-logo.final(3).png
New Relic
pros:
nice graphs
application-level view
good error analysis
cons:
slow to update
many false-positive alerts
high prices (better now)
motivating change
http://99designs.
com/illustrations/contests/illustration-
pagerduty-161025/entries
as hideous as you remember
“Horrendous interface”
“Well, it’s more “old” than anything
else. At least everything is in the
same place as you left it because it’s
been the same since 1912.”
https://laur.ie/blog/2014/02/why-ill-be-letting-nagios-live-on-a-bit-longer-thank-you-very-much/
not alone!
“Sensu has so many
moving parts that I
wouldn’t be able to
sleep at night unless
I set up a Nagios
instance to make
sure they were all
running.”
who watches the RabbitMQ?
-- @murphy_slaw (via @lozzd)
http://images.sodahead.com/profiles/0/0/0/5/1/6/6/3/9/Watchmen-trademark-symbol-62141795529.jpeg
http://portertech.ca/images/2011-11-01/sensu-diagram.png
hating on nagios: the middle years
“hadoop does not suffer from a paucity of configuration options”
http://jaganesundar.wordpress.com/2011/12/05/installing-and-configuring-hadoop-0-20-205-using-it-rpm/
monitor all the ports?!
best way to monitor HBase:
hbck: the HBase consistency checker
nagios -> bash script -> parsing output of hbck
http://www.ymc.ch/en/how-to-monitor-hbase-health-by-nagios
http://modiinhub.com/wp-content/uploads/2014/02/logo-mongodb-tagline.png
“Cyber” monday: 1988 called; wants its word back.
wow. such nosql. very webscale.
“a single write operation holds the lock exclusively, and
no other read or write operations may share the lock.”
“If it moves, we track it. Sometimes we’ll
draw a graph of something that isn’t
moving yet, just in case it decides to make
a run for it.” Ian Malpass, Etsy
http://codeascraft.com/2011/02/15/measure-anything-measure-everything/
the (former) state of our graphite & statsd
● Graphite 0.9.9
○ hand-rolled
○ over 2 years old
○ missing new features (Consolidate by!)
● StatsD was newish, but…
○ hand-rolled
○ running in a screen session
○ on a special snowflake box
http://media-cache-ec0.pinimg.com/736x/68/c2/9d/68c29deb72bad94cd4e3c1aa0f3cdcd8.jpg
this is wrong tool. never use this.
Community cookbooks?
● StatsD
○ https://github.com/librato/statsd-cookbook
● Graphite ones good, but…
○ focus on Apache (we use nginx)
○ we haven’t moved to Chef 11 (gasp!)
when in doubt: tcpdump is your friend
http://blog.johngoulah.com/2012/10/looking-under-the-covers-of-statsd/
carbon-aggravator (between 0.9.10 & 0.9.12)
# If set true, metric received will be forwarded to
# DESTINATIONS in addition to
# the output of the aggregation rules. If set false
# the carbon-aggregator will
# only ever send the output of aggregation.
FORWARD_ALL = True
carbonate: A+++ would clone again
whisper-fill.py
backfill datapoints between whisper files
life as a third wheel party
thresholds: because not every outage is abrupt
normal traffic
decision
to turn off
decision
to turn
back on
accidental removal
open-source error reporting
all the things
StatsD
Application-level error
analysis
Alarms for autoscaling
Timers &
counters
Log & host-level
Hadoop & HBase
visualization
MongoDB
Graphs
Time-series
data graphing
client-side
plugins
Threshold-based alarmsDashboard
external checks
What’s next?
http://blog.xebia.fr/wp-content/uploads/2013/12/file-logstash-es-kibana.png
what even is ideal monitoring solution
http://www.quickmeme.com/img/f5/f512ff9bee084263df5571d3c81388019dcb063173e1dbcfa2babac9274576b6.jpg
❏ finds real problems
❏ actionable alerting
❏ usable by all
❏ …?
questions; comments; whatnot
Twitter: @bridgetkromhout
Email: bridget@kromhout.org
In person: DevOps Days Minneapolis
(devopsdays.org)

Mais conteúdo relacionado

Semelhante a From Zero To Visibility

Web challenge cmm
Web challenge cmmWeb challenge cmm
Web challenge cmm
Herman Lee
 

Semelhante a From Zero To Visibility (20)

Leveling up your JavaScipt - DrupalJam 2017
Leveling up your JavaScipt - DrupalJam 2017Leveling up your JavaScipt - DrupalJam 2017
Leveling up your JavaScipt - DrupalJam 2017
 
A Period of Transition
A Period of TransitionA Period of Transition
A Period of Transition
 
Extending Gradle
Extending GradleExtending Gradle
Extending Gradle
 
Workflow Engines + Luigi
Workflow Engines + LuigiWorkflow Engines + Luigi
Workflow Engines + Luigi
 
Running HTML5 Mobile Web Games at 60fps
Running HTML5 Mobile Web Games at 60fpsRunning HTML5 Mobile Web Games at 60fps
Running HTML5 Mobile Web Games at 60fps
 
There is something about serverless
There is something about serverlessThere is something about serverless
There is something about serverless
 
Shifting Gears
Shifting GearsShifting Gears
Shifting Gears
 
iOS Indie Developer Toolkit - CocoaHeads 3city
iOS Indie Developer Toolkit - CocoaHeads 3cityiOS Indie Developer Toolkit - CocoaHeads 3city
iOS Indie Developer Toolkit - CocoaHeads 3city
 
Design ♥ HTML5 – The Future of Mobile Editorial Design
Design ♥ HTML5 – The Future of Mobile Editorial DesignDesign ♥ HTML5 – The Future of Mobile Editorial Design
Design ♥ HTML5 – The Future of Mobile Editorial Design
 
Educate 2017: Quick 'n Lazy: How we keep things speedy while staying out of y...
Educate 2017: Quick 'n Lazy: How we keep things speedy while staying out of y...Educate 2017: Quick 'n Lazy: How we keep things speedy while staying out of y...
Educate 2017: Quick 'n Lazy: How we keep things speedy while staying out of y...
 
Continuous integration is not a solved problem
Continuous integration is not a solved problemContinuous integration is not a solved problem
Continuous integration is not a solved problem
 
Web challenge cmm
Web challenge cmmWeb challenge cmm
Web challenge cmm
 
Y U NO CRAFTSMAN
Y U NO CRAFTSMANY U NO CRAFTSMAN
Y U NO CRAFTSMAN
 
Look ma! No images!
Look ma! No images!Look ma! No images!
Look ma! No images!
 
Reactive datastore demo (2020 03-21)
Reactive datastore demo (2020 03-21)Reactive datastore demo (2020 03-21)
Reactive datastore demo (2020 03-21)
 
Visualizing data fall2011
Visualizing data fall2011Visualizing data fall2011
Visualizing data fall2011
 
Bridging the Gap - Designer Developer Collaboration
Bridging the Gap - Designer Developer CollaborationBridging the Gap - Designer Developer Collaboration
Bridging the Gap - Designer Developer Collaboration
 
Prototyping user interactions in web apps
Prototyping user interactions in web appsPrototyping user interactions in web apps
Prototyping user interactions in web apps
 
Dispensa software per la stampa 3D (in italiano)
Dispensa software per la stampa 3D (in italiano)Dispensa software per la stampa 3D (in italiano)
Dispensa software per la stampa 3D (in italiano)
 
Kazoo billing
Kazoo billingKazoo billing
Kazoo billing
 

Mais de bridgetkromhout

Mais de bridgetkromhout (20)

An introduction to Helm - KubeCon EU 2020
An introduction to Helm - KubeCon EU 2020An introduction to Helm - KubeCon EU 2020
An introduction to Helm - KubeCon EU 2020
 
Join Our Party: The Cloud Native Adventure Brigade (Kubernetes Belgium 2019)
Join Our Party: The Cloud Native Adventure Brigade (Kubernetes Belgium 2019)Join Our Party: The Cloud Native Adventure Brigade (Kubernetes Belgium 2019)
Join Our Party: The Cloud Native Adventure Brigade (Kubernetes Belgium 2019)
 
devops, distributed (devopsdays Ghent 2019)
devops, distributed (devopsdays Ghent 2019)devops, distributed (devopsdays Ghent 2019)
devops, distributed (devopsdays Ghent 2019)
 
Join Our Party: The Cloud Native Adventure Brigade (devopsdays Philly 2019)
Join Our Party: The Cloud Native Adventure Brigade (devopsdays Philly 2019)Join Our Party: The Cloud Native Adventure Brigade (devopsdays Philly 2019)
Join Our Party: The Cloud Native Adventure Brigade (devopsdays Philly 2019)
 
Join Our Party: The Cloud Native Adventure Brigade (TCSW 2019)
Join Our Party: The Cloud Native Adventure Brigade (TCSW 2019)Join Our Party: The Cloud Native Adventure Brigade (TCSW 2019)
Join Our Party: The Cloud Native Adventure Brigade (TCSW 2019)
 
Increasing Reliability via Helm Pre-Release Checks (Helm Summit 2019)
Increasing Reliability via Helm Pre-Release Checks (Helm Summit 2019)Increasing Reliability via Helm Pre-Release Checks (Helm Summit 2019)
Increasing Reliability via Helm Pre-Release Checks (Helm Summit 2019)
 
Kubernetes for the Impatient (devopsdays Cape Town 2019)
Kubernetes for the Impatient (devopsdays Cape Town 2019)Kubernetes for the Impatient (devopsdays Cape Town 2019)
Kubernetes for the Impatient (devopsdays Cape Town 2019)
 
Join Our Party: The Cloud Native Adventure Brigade (OSS 2019)
Join Our Party: The Cloud Native Adventure Brigade (OSS 2019)Join Our Party: The Cloud Native Adventure Brigade (OSS 2019)
Join Our Party: The Cloud Native Adventure Brigade (OSS 2019)
 
Helm 3: Navigating To Distant Shores (OSS NA 2019)
Helm 3: Navigating To Distant Shores (OSS NA 2019)Helm 3: Navigating To Distant Shores (OSS NA 2019)
Helm 3: Navigating To Distant Shores (OSS NA 2019)
 
Helm 3: Navigating to Distant Shores (OSCON 2019)
Helm 3: Navigating to Distant Shores (OSCON 2019)Helm 3: Navigating to Distant Shores (OSCON 2019)
Helm 3: Navigating to Distant Shores (OSCON 2019)
 
Kubernetes for the Impatient (Velocity San Jose 2019)
Kubernetes for the Impatient (Velocity San Jose 2019)Kubernetes for the Impatient (Velocity San Jose 2019)
Kubernetes for the Impatient (Velocity San Jose 2019)
 
Community projects inform enterprise products (Velocity San Jose 2019)
Community projects inform enterprise products (Velocity San Jose 2019)Community projects inform enterprise products (Velocity San Jose 2019)
Community projects inform enterprise products (Velocity San Jose 2019)
 
Helm 3: Navigating to Distant Shores (KubeCon EU 2019)
Helm 3: Navigating to Distant Shores (KubeCon EU 2019)Helm 3: Navigating to Distant Shores (KubeCon EU 2019)
Helm 3: Navigating to Distant Shores (KubeCon EU 2019)
 
Kubernetes Operability Tooling (GOTO Chicago 2019)
Kubernetes Operability Tooling (GOTO Chicago 2019)Kubernetes Operability Tooling (GOTO Chicago 2019)
Kubernetes Operability Tooling (GOTO Chicago 2019)
 
Kubernetes Operability Tooling (Minnebar 2019)
Kubernetes Operability Tooling (Minnebar 2019)Kubernetes Operability Tooling (Minnebar 2019)
Kubernetes Operability Tooling (Minnebar 2019)
 
Livetweeting Tech Conferences - SREcon Americas 2019
Livetweeting Tech Conferences - SREcon Americas 2019Livetweeting Tech Conferences - SREcon Americas 2019
Livetweeting Tech Conferences - SREcon Americas 2019
 
Kubernetes Operability Tooling (devopsdays Seattle 2019)
Kubernetes Operability Tooling (devopsdays Seattle 2019)Kubernetes Operability Tooling (devopsdays Seattle 2019)
Kubernetes Operability Tooling (devopsdays Seattle 2019)
 
Kubernetes Operability Tooling (LEAP 2019)
Kubernetes Operability Tooling (LEAP 2019)Kubernetes Operability Tooling (LEAP 2019)
Kubernetes Operability Tooling (LEAP 2019)
 
Day 2 Kubernetes - Tools for Operability (KubeCon)
Day 2 Kubernetes - Tools for Operability (KubeCon)Day 2 Kubernetes - Tools for Operability (KubeCon)
Day 2 Kubernetes - Tools for Operability (KubeCon)
 
Cloud, Containers, Kubernetes (YOW Melbourne 2018)
Cloud, Containers, Kubernetes (YOW Melbourne 2018)Cloud, Containers, Kubernetes (YOW Melbourne 2018)
Cloud, Containers, Kubernetes (YOW Melbourne 2018)
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

From Zero To Visibility