SlideShare uma empresa Scribd logo
1 de 34
Baixar para ler offline
How to run a 5 Whys (With Humans, Not Robots)
Dan Milstein
@danmil
Director of Product Development, Wingu
What Is a 5 Whys Anyways?

• Something you do when your company has badly screwed up


• E.g. your CEO demos your cloud storage system to an early prospective
  customer, and, when he runs a search, it shows other customers’ data (I have
  done this, it was not awesome)


• You get a bunch of people into a room and say: “How on earth did that
  happen? And how can we make sure it never, ever happens again?”


• That’s a 5 Whys (aka, a Post-Mortem)


• But, there’s a problem....
Shameful Mistakes: Humans vs Robots
Human Beings Will Eff It Up

• Humans (unlike robots) feel this intense emotion called shame


• Shame will suggest (strongly) “Slow Down, Stop Making So Many Mistakes”


• Aka “Throw overboard everything the Lean Startup tells you is important”


• Has potential to be incredibly damaging to your startup


• And I have some bad news...
You Will Totally Experience Shame (I Still Do)




   F.A.E.
This Emotional Experience Can Not Be Avoided

• I’ve run c. 50 post-mortems, have studied failure... and I still have this
  emotional reaction


• You will, too. And so will your team.


• Much more strongly than you realize right now


• This is the “Fundamental Attribution Error” (FAE), from psychology


• FAE = humans vastly underestimate the power of a situation on our behavior
Big Idea: Adopt Economic, Not Moral Mindset




           $, FTW
What Does That Mean

• Let me tell you a story...
Parable: A Tale of Two Factories
Two Factories

• Both make widgets


• Both are missing their monthly Widget Production goals by 10%


• But for different reasons...
Factory 1... Broken Machine
When The Machine Breaks...

• Belt slips off every once in a while


• Ruins a bunch of widgets


• Gotta replace it, drift a little behind plan


• So... what questions do humans ask in this situation?
Economic Mindset = Broken Machine

• “How much is it costing us?”

• “How much does it cost to repair?”

• “Can we kludge a partial fix?”

• “What are risks if we delay a fix?”
Note the Key Words

• “Cost”, “Partial”, “Risk”


• These are things you hear a lot in an economic discussion


• Okay, meanwhile in Factory 2, also missing by 10%, different reason...
Factory 2... One Employee Is an Axe Murderer
After Every Axe Murdering...

• Have to, like, hire a new guy, train him on the machine, takes forever


• Questions we asked before are now somehow deeply wrong:


• “What if we just cut down on the rate, so there’s less axe murdering?”


• “Hey, we can train a pool of temps on all the machines, when someone gets
  killed, we’ll just swap some new guy in, bang, problem solved!”


• “How much is it really costing us, anyways?”


• These ideas seem obscene, not merely bad
Moral Mindset = Axe Murderer


“Search for villains,
elevation of accusers,
and mobilization of authority to
mete out punishment”
(Pinker, The Blank Slate)
Moral Mindset, Key Words

• “Villains”, “Accusers”, “Authority”, “Punishment”


• I believe that most companies, in investigating outages, act much more
  like they’re looking for an axe murderer, than trying to fix a broken
  machine
Your Challenge, As Person Running 5 Whys




      Get team out of moral mindset.

      Note: this is not, in fact, easy.
Why It’s Hard

• Mindsets control how we interpret the world...


• ...including what people say to us


• So, a team sitting there, fearing moral censure, hears you say “We’re not
  looking to blame anyone”, they just think you’re lying. How could you mean
  that, when the thing that happened was so terrible and wrong?


• The deep trick (and this is the point of this whole presentation, frankly), is that
  you have to take advantage of the thing that separates humans and robots...
Fundamental Tool: Make ‘Em Laugh
Humor == Breaking Frames

• That’s what humor actually is -- something that stretches or breaks the
  mental frame that people are using to interpret a situation


• So, you use humor to break the frame, release people from the blame/fear/
  punishment of the moral mindset, and then refocus them on the economic
  challenges you’re facing


• The humor is, IMHO, not a nice-to-have. It’s absolutely central. I’ve seen
  smart, caring leaders get this one wrong, and finish their post-mortems with a
  room full of tense, closed-up team members (and no good ideas on the table)


• Rest of talk is specific examples of this, but this is the main point
Tip 1: Always Share Worse “Bad Things”
Place The Bad Thing on a Continuum

• Moral mindset is very absolutist: this bad thing is The Worst Thing Ever


• I like to say “Okay, well it’s pretty bad, let’s compare it to some things”


• Did we irretrievably lose customer data? (I’ve done that, not awesome)


• Did we almost get our customer fired by her boss (also, not awesome)


• Did we send hundreds of emails to everyone on our customer’s mailing list...
  but the emails were all question marks? For a customer who was in the
  proofreading business? (done that, very much not awesome)


• People laugh, and then say “Okay, how bad was this, really?” Win.
More Stories of Actual Failures (Just For Fun)

• Did we break our allergies-to-medicines module, and risk having a doctor
  prescribe the wrong medication to someone?


• Did our internet-connected home thermostat system have a server crash,
  causing all the thermostats to set the temp to the default... of 85 degrees?


• Did our high-frequency trading program have flaws that led to our company
  losing 450 million dollars? (that is a tough one to beat, IMHO)


• Collect your own! It’s fun!
Tip 2: Mock Hindsight Bias To Its Face



    “Let’s plan for a future
    where we’re all as stupid
    as we are today.”
How Hindsight Bias Shows up in a 5 Whys

• Someone says “Oh, yeah, I screwed that one up, I knew I had to run the
  deploy in that one order, and I just forgot. I’m really sorry, I won’t make that
  mistake again, totally my bad.”


• You have to utterly reject this. It’s pure hindsight bias (easy to see errors after
  the fact, very difficult in the moment).


• I say “It’s like we’re saying ‘I was stupid, this one time, and we’ll fix that
  problem by never being stupid again.’”


• Hence: “planning for a future where we’re as stupid as we are today”


• aka “Must create a system which is resilient to occasional bouts of really
  intense stupidity”.
Tip 3: Relish Absurdities of Your System
5 Whys Will Highlight That Your Code is a Mess

• E.g. you’ve refactored, and rewritten in python (or node or something), and
  moved to the cloud, but this 5 whys is making clear that your most important
  report is still run by a VisualCron job on a Windows server that never quite
  made it out of the office... and someone just tripped on the power cord


• Team will feel ashamed, you have to give them license to relish absurdity


• I often point out “There are two kinds of startups: the ones that achieve some
  modest traction on top of a pile of code of which they are vaguely ashamed...
  and the ones that go out of business. That’s it. No third kind.”


• Also sometimes it helps to just laugh: “It’s kind of amazing this works at all”
Tip 4: “Broadest Fixes” vs. “Root Causes”
Handling a Fork in the Road

• Example: bad outage at Wingu: was triggered by a mistake in db access
  code. But we couldn’t fix it for three hours, because our error reporting
  system was trying to send us hundreds of emails/minute, so our email
  provider throttled us, and we didn’t get those email until hours later.


• Which is the Root Cause? DB access bug or monitoring failure?


• Answer: don’t care about “root causes”. They don’t exist (multiple things
  conspire for failures to happen). Also, kind of moral/blame-ish.


• Ask instead: if we made an incremental improvement in area A or area B,
  which would prevent the broadest class of problems going ahead?


• Much better conversation. Answer here is clear: monitoring.
Remember, There Is No Axe Murderer (Probably)
Photo Credits

• “Robot de Martillo”, by Luis Perez, http://www.flickr.com/photos/pe5pe/2454661748/

• “Helios-Factory floor”, http://commons.wikimedia.org/wiki/File:Helioshall2.jpg

• “old machine”, by Jun Aoyama, http://www.flickr.com/photos/jam343/1730140/

• “Axe Marks The Spot”, by Alan Levine, http://www.flickr.com/photos/cogdog/4461665810/

• “Failboat Has Arrived”, http://www.rotskyinstitute.com/rotsky/wp-content/uploads/2008/02/
  failboat2.jpg

• “14 plugs but only 6 sockets”, by Jason Rogers, http://www.flickr.com/photos/restlessglobetrotter/
  2661016046/

• “Life is like that… a fork in the road… decision required”, by Roger Price, http://www.flickr.com/
  photos/rwp-roger/6687024883/
Thanks...

Dan Milstein
@danmil

Mais conteúdo relacionado

Mais procurados

PopcornFlow: Continuous Evolution Through Ultra-Rapid Experimentation
PopcornFlow: Continuous Evolution Through Ultra-Rapid ExperimentationPopcornFlow: Continuous Evolution Through Ultra-Rapid Experimentation
PopcornFlow: Continuous Evolution Through Ultra-Rapid ExperimentationClaudio Perrone
 
Agile Performance Metrics
Agile Performance MetricsAgile Performance Metrics
Agile Performance MetricsACM
 
Management 3.0 - Complexity Thinking
Management 3.0 - Complexity ThinkingManagement 3.0 - Complexity Thinking
Management 3.0 - Complexity ThinkingJurgen Appelo
 
Doing Agile Isnt The Same As Being Agile
Doing Agile Isnt The Same As Being AgileDoing Agile Isnt The Same As Being Agile
Doing Agile Isnt The Same As Being Agilelazygolfer
 
Leading With Authenticity, Vulnerability, Inclusivity, Trust, and Reflection
Leading With Authenticity, Vulnerability, Inclusivity, Trust, and ReflectionLeading With Authenticity, Vulnerability, Inclusivity, Trust, and Reflection
Leading With Authenticity, Vulnerability, Inclusivity, Trust, and ReflectionLisa D'Adamo-Weinstein
 
Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...
Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...
Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...AgileSparks
 
Kanban Metrics in practice at Sky Network Services
Kanban Metrics in practice at Sky Network ServicesKanban Metrics in practice at Sky Network Services
Kanban Metrics in practice at Sky Network ServicesMattia Battiston
 
Site reliability engineering
Site reliability engineeringSite reliability engineering
Site reliability engineeringJason Loeffler
 
Specification-By-Example with Gherkin
Specification-By-Example with GherkinSpecification-By-Example with Gherkin
Specification-By-Example with GherkinChristian Hassa
 
User Story Workshop
User Story WorkshopUser Story Workshop
User Story WorkshopPeter Antman
 
21 Story Splitting Patterns
21 Story Splitting Patterns21 Story Splitting Patterns
21 Story Splitting PatternsKent McDonald
 
Agile leadership for the future
Agile leadership for the futureAgile leadership for the future
Agile leadership for the futureNasima Shafiul
 
Black Box Thinking - The Surprising Truth About Success
Black Box Thinking - The Surprising Truth About SuccessBlack Box Thinking - The Surprising Truth About Success
Black Box Thinking - The Surprising Truth About SuccessMatthew Syed
 
Coaching Case Studies Case Study Number 1   Maria i.docx
Coaching Case Studies Case Study Number 1   Maria i.docxCoaching Case Studies Case Study Number 1   Maria i.docx
Coaching Case Studies Case Study Number 1   Maria i.docxclarebernice
 
Steal this presentation
Steal this presentationSteal this presentation
Steal this presentationScott Hoag
 
Book summary The Squiggly Career
Book summary The Squiggly CareerBook summary The Squiggly Career
Book summary The Squiggly CareerAbhishek Ghosh PMP
 
Test Driven Development (TDD)
Test Driven Development (TDD)Test Driven Development (TDD)
Test Driven Development (TDD)David Ehringer
 

Mais procurados (20)

PopcornFlow: Continuous Evolution Through Ultra-Rapid Experimentation
PopcornFlow: Continuous Evolution Through Ultra-Rapid ExperimentationPopcornFlow: Continuous Evolution Through Ultra-Rapid Experimentation
PopcornFlow: Continuous Evolution Through Ultra-Rapid Experimentation
 
Agile Performance Metrics
Agile Performance MetricsAgile Performance Metrics
Agile Performance Metrics
 
Management 3.0 - Complexity Thinking
Management 3.0 - Complexity ThinkingManagement 3.0 - Complexity Thinking
Management 3.0 - Complexity Thinking
 
Doing Agile Isnt The Same As Being Agile
Doing Agile Isnt The Same As Being AgileDoing Agile Isnt The Same As Being Agile
Doing Agile Isnt The Same As Being Agile
 
Leading With Authenticity, Vulnerability, Inclusivity, Trust, and Reflection
Leading With Authenticity, Vulnerability, Inclusivity, Trust, and ReflectionLeading With Authenticity, Vulnerability, Inclusivity, Trust, and Reflection
Leading With Authenticity, Vulnerability, Inclusivity, Trust, and Reflection
 
Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...
Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...
Dealing with Shifting Priorities using Lean/Kanban Flow, WIP Limits and Capac...
 
Kanban Metrics in practice at Sky Network Services
Kanban Metrics in practice at Sky Network ServicesKanban Metrics in practice at Sky Network Services
Kanban Metrics in practice at Sky Network Services
 
Site reliability engineering
Site reliability engineeringSite reliability engineering
Site reliability engineering
 
Specification-By-Example with Gherkin
Specification-By-Example with GherkinSpecification-By-Example with Gherkin
Specification-By-Example with Gherkin
 
User Story Workshop
User Story WorkshopUser Story Workshop
User Story Workshop
 
A3 & Kaizen: Here's How
A3 & Kaizen: Here's HowA3 & Kaizen: Here's How
A3 & Kaizen: Here's How
 
21 Story Splitting Patterns
21 Story Splitting Patterns21 Story Splitting Patterns
21 Story Splitting Patterns
 
Agile leadership for the future
Agile leadership for the futureAgile leadership for the future
Agile leadership for the future
 
Black Box Thinking - The Surprising Truth About Success
Black Box Thinking - The Surprising Truth About SuccessBlack Box Thinking - The Surprising Truth About Success
Black Box Thinking - The Surprising Truth About Success
 
Coaching Case Studies Case Study Number 1   Maria i.docx
Coaching Case Studies Case Study Number 1   Maria i.docxCoaching Case Studies Case Study Number 1   Maria i.docx
Coaching Case Studies Case Study Number 1   Maria i.docx
 
Agile Leadership
Agile LeadershipAgile Leadership
Agile Leadership
 
Steal this presentation
Steal this presentationSteal this presentation
Steal this presentation
 
DevOps 101
DevOps 101DevOps 101
DevOps 101
 
Book summary The Squiggly Career
Book summary The Squiggly CareerBook summary The Squiggly Career
Book summary The Squiggly Career
 
Test Driven Development (TDD)
Test Driven Development (TDD)Test Driven Development (TDD)
Test Driven Development (TDD)
 

Semelhante a How To Run a 5 Whys (With Humans, Not Robots)

How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013
How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013
How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013Dan Milstein
 
Get Kudos from customers (without bribing them)
Get Kudos from customers (without bribing them)Get Kudos from customers (without bribing them)
Get Kudos from customers (without bribing them)Clairetalbott
 
Corp Web Risks and Concerns
Corp Web Risks and ConcernsCorp Web Risks and Concerns
Corp Web Risks and ConcernsPINT Inc
 
The basics of e-service
The basics of e-serviceThe basics of e-service
The basics of e-serviceEric Reiss
 
Supercharging your bug reports
Supercharging your bug reportsSupercharging your bug reports
Supercharging your bug reportsNeil Studd
 
Startups and Smalltak - Presented at Smalltalks2014 Córdoba, Argentina
Startups and Smalltak - Presented at Smalltalks2014 Córdoba, ArgentinaStartups and Smalltak - Presented at Smalltalks2014 Córdoba, Argentina
Startups and Smalltak - Presented at Smalltalks2014 Córdoba, Argentinasebastian sastre
 
Cross Functional Teams and the Product Manager
Cross Functional Teams and the Product ManagerCross Functional Teams and the Product Manager
Cross Functional Teams and the Product ManagerSVPMA
 
Blue Start-Ups - Pau Hana
Blue Start-Ups - Pau HanaBlue Start-Ups - Pau Hana
Blue Start-Ups - Pau HanaBen Sykes
 
Seven Classic Startup Failure Modes
Seven Classic Startup Failure ModesSeven Classic Startup Failure Modes
Seven Classic Startup Failure ModesJoe Dunn
 
Conflict Management in Technology
Conflict Management in Technology Conflict Management in Technology
Conflict Management in Technology Denton Farley
 
Architecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose TutorialArchitecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose TutorialWill Gallego
 
Griffin Been technology Report
Griffin Been technology ReportGriffin Been technology Report
Griffin Been technology ReportMarq2014
 
Reiss On E Service UX London
Reiss On E Service UX LondonReiss On E Service UX London
Reiss On E Service UX LondonEric Reiss
 
Five Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our UsersFive Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our UsersSajid Reshamwala
 
Uncharted lands, or why games are not designed but discovered
Uncharted lands, or why games are not designed but discoveredUncharted lands, or why games are not designed but discovered
Uncharted lands, or why games are not designed but discoveredJakub Stokalski
 
How PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup TechniquesHow PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup TechniquesDavid E. Weekly
 
All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"
All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"
All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"⌨️ Steven Proctor
 

Semelhante a How To Run a 5 Whys (With Humans, Not Robots) (20)

How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013
How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013
How to Run a Post-Mortem (With Humans, Not Robots), Velocity 2013
 
Get Kudos from customers (without bribing them)
Get Kudos from customers (without bribing them)Get Kudos from customers (without bribing them)
Get Kudos from customers (without bribing them)
 
Corp Web Risks and Concerns
Corp Web Risks and ConcernsCorp Web Risks and Concerns
Corp Web Risks and Concerns
 
The basics of e-service
The basics of e-serviceThe basics of e-service
The basics of e-service
 
Supercharging your bug reports
Supercharging your bug reportsSupercharging your bug reports
Supercharging your bug reports
 
The alignment
The alignmentThe alignment
The alignment
 
Startups and Smalltak - Presented at Smalltalks2014 Córdoba, Argentina
Startups and Smalltak - Presented at Smalltalks2014 Córdoba, ArgentinaStartups and Smalltak - Presented at Smalltalks2014 Córdoba, Argentina
Startups and Smalltak - Presented at Smalltalks2014 Córdoba, Argentina
 
Cross Functional Teams and the Product Manager
Cross Functional Teams and the Product ManagerCross Functional Teams and the Product Manager
Cross Functional Teams and the Product Manager
 
Think epic be epic
Think epic be epicThink epic be epic
Think epic be epic
 
Blue Start-Ups - Pau Hana
Blue Start-Ups - Pau HanaBlue Start-Ups - Pau Hana
Blue Start-Ups - Pau Hana
 
Seven Classic Startup Failure Modes
Seven Classic Startup Failure ModesSeven Classic Startup Failure Modes
Seven Classic Startup Failure Modes
 
Conflict Management in Technology
Conflict Management in Technology Conflict Management in Technology
Conflict Management in Technology
 
Architecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose TutorialArchitecting a Post Mortem - Velocity 2018 San Jose Tutorial
Architecting a Post Mortem - Velocity 2018 San Jose Tutorial
 
Growth meetup-q4-2014
Growth meetup-q4-2014Growth meetup-q4-2014
Growth meetup-q4-2014
 
Griffin Been technology Report
Griffin Been technology ReportGriffin Been technology Report
Griffin Been technology Report
 
Reiss On E Service UX London
Reiss On E Service UX LondonReiss On E Service UX London
Reiss On E Service UX London
 
Five Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our UsersFive Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our Users
 
Uncharted lands, or why games are not designed but discovered
Uncharted lands, or why games are not designed but discoveredUncharted lands, or why games are not designed but discovered
Uncharted lands, or why games are not designed but discovered
 
How PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup TechniquesHow PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup Techniques
 
All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"
All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"
All That Glitters Is Not Gold: Usability Design for "When Things Go Wrong"
 

Último

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 

Último (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 

How To Run a 5 Whys (With Humans, Not Robots)

  • 1. How to run a 5 Whys (With Humans, Not Robots) Dan Milstein @danmil Director of Product Development, Wingu
  • 2. What Is a 5 Whys Anyways? • Something you do when your company has badly screwed up • E.g. your CEO demos your cloud storage system to an early prospective customer, and, when he runs a search, it shows other customers’ data (I have done this, it was not awesome) • You get a bunch of people into a room and say: “How on earth did that happen? And how can we make sure it never, ever happens again?” • That’s a 5 Whys (aka, a Post-Mortem) • But, there’s a problem....
  • 4. Human Beings Will Eff It Up • Humans (unlike robots) feel this intense emotion called shame • Shame will suggest (strongly) “Slow Down, Stop Making So Many Mistakes” • Aka “Throw overboard everything the Lean Startup tells you is important” • Has potential to be incredibly damaging to your startup • And I have some bad news...
  • 5. You Will Totally Experience Shame (I Still Do) F.A.E.
  • 6. This Emotional Experience Can Not Be Avoided • I’ve run c. 50 post-mortems, have studied failure... and I still have this emotional reaction • You will, too. And so will your team. • Much more strongly than you realize right now • This is the “Fundamental Attribution Error” (FAE), from psychology • FAE = humans vastly underestimate the power of a situation on our behavior
  • 7. Big Idea: Adopt Economic, Not Moral Mindset $, FTW
  • 8. What Does That Mean • Let me tell you a story...
  • 9. Parable: A Tale of Two Factories
  • 10. Two Factories • Both make widgets • Both are missing their monthly Widget Production goals by 10% • But for different reasons...
  • 12. When The Machine Breaks... • Belt slips off every once in a while • Ruins a bunch of widgets • Gotta replace it, drift a little behind plan • So... what questions do humans ask in this situation?
  • 13. Economic Mindset = Broken Machine • “How much is it costing us?” • “How much does it cost to repair?” • “Can we kludge a partial fix?” • “What are risks if we delay a fix?”
  • 14. Note the Key Words • “Cost”, “Partial”, “Risk” • These are things you hear a lot in an economic discussion • Okay, meanwhile in Factory 2, also missing by 10%, different reason...
  • 15. Factory 2... One Employee Is an Axe Murderer
  • 16. After Every Axe Murdering... • Have to, like, hire a new guy, train him on the machine, takes forever • Questions we asked before are now somehow deeply wrong: • “What if we just cut down on the rate, so there’s less axe murdering?” • “Hey, we can train a pool of temps on all the machines, when someone gets killed, we’ll just swap some new guy in, bang, problem solved!” • “How much is it really costing us, anyways?” • These ideas seem obscene, not merely bad
  • 17. Moral Mindset = Axe Murderer “Search for villains, elevation of accusers, and mobilization of authority to mete out punishment” (Pinker, The Blank Slate)
  • 18. Moral Mindset, Key Words • “Villains”, “Accusers”, “Authority”, “Punishment” • I believe that most companies, in investigating outages, act much more like they’re looking for an axe murderer, than trying to fix a broken machine
  • 19. Your Challenge, As Person Running 5 Whys Get team out of moral mindset. Note: this is not, in fact, easy.
  • 20. Why It’s Hard • Mindsets control how we interpret the world... • ...including what people say to us • So, a team sitting there, fearing moral censure, hears you say “We’re not looking to blame anyone”, they just think you’re lying. How could you mean that, when the thing that happened was so terrible and wrong? • The deep trick (and this is the point of this whole presentation, frankly), is that you have to take advantage of the thing that separates humans and robots...
  • 21. Fundamental Tool: Make ‘Em Laugh
  • 22. Humor == Breaking Frames • That’s what humor actually is -- something that stretches or breaks the mental frame that people are using to interpret a situation • So, you use humor to break the frame, release people from the blame/fear/ punishment of the moral mindset, and then refocus them on the economic challenges you’re facing • The humor is, IMHO, not a nice-to-have. It’s absolutely central. I’ve seen smart, caring leaders get this one wrong, and finish their post-mortems with a room full of tense, closed-up team members (and no good ideas on the table) • Rest of talk is specific examples of this, but this is the main point
  • 23. Tip 1: Always Share Worse “Bad Things”
  • 24. Place The Bad Thing on a Continuum • Moral mindset is very absolutist: this bad thing is The Worst Thing Ever • I like to say “Okay, well it’s pretty bad, let’s compare it to some things” • Did we irretrievably lose customer data? (I’ve done that, not awesome) • Did we almost get our customer fired by her boss (also, not awesome) • Did we send hundreds of emails to everyone on our customer’s mailing list... but the emails were all question marks? For a customer who was in the proofreading business? (done that, very much not awesome) • People laugh, and then say “Okay, how bad was this, really?” Win.
  • 25. More Stories of Actual Failures (Just For Fun) • Did we break our allergies-to-medicines module, and risk having a doctor prescribe the wrong medication to someone? • Did our internet-connected home thermostat system have a server crash, causing all the thermostats to set the temp to the default... of 85 degrees? • Did our high-frequency trading program have flaws that led to our company losing 450 million dollars? (that is a tough one to beat, IMHO) • Collect your own! It’s fun!
  • 26. Tip 2: Mock Hindsight Bias To Its Face “Let’s plan for a future where we’re all as stupid as we are today.”
  • 27. How Hindsight Bias Shows up in a 5 Whys • Someone says “Oh, yeah, I screwed that one up, I knew I had to run the deploy in that one order, and I just forgot. I’m really sorry, I won’t make that mistake again, totally my bad.” • You have to utterly reject this. It’s pure hindsight bias (easy to see errors after the fact, very difficult in the moment). • I say “It’s like we’re saying ‘I was stupid, this one time, and we’ll fix that problem by never being stupid again.’” • Hence: “planning for a future where we’re as stupid as we are today” • aka “Must create a system which is resilient to occasional bouts of really intense stupidity”.
  • 28. Tip 3: Relish Absurdities of Your System
  • 29. 5 Whys Will Highlight That Your Code is a Mess • E.g. you’ve refactored, and rewritten in python (or node or something), and moved to the cloud, but this 5 whys is making clear that your most important report is still run by a VisualCron job on a Windows server that never quite made it out of the office... and someone just tripped on the power cord • Team will feel ashamed, you have to give them license to relish absurdity • I often point out “There are two kinds of startups: the ones that achieve some modest traction on top of a pile of code of which they are vaguely ashamed... and the ones that go out of business. That’s it. No third kind.” • Also sometimes it helps to just laugh: “It’s kind of amazing this works at all”
  • 30. Tip 4: “Broadest Fixes” vs. “Root Causes”
  • 31. Handling a Fork in the Road • Example: bad outage at Wingu: was triggered by a mistake in db access code. But we couldn’t fix it for three hours, because our error reporting system was trying to send us hundreds of emails/minute, so our email provider throttled us, and we didn’t get those email until hours later. • Which is the Root Cause? DB access bug or monitoring failure? • Answer: don’t care about “root causes”. They don’t exist (multiple things conspire for failures to happen). Also, kind of moral/blame-ish. • Ask instead: if we made an incremental improvement in area A or area B, which would prevent the broadest class of problems going ahead? • Much better conversation. Answer here is clear: monitoring.
  • 32. Remember, There Is No Axe Murderer (Probably)
  • 33. Photo Credits • “Robot de Martillo”, by Luis Perez, http://www.flickr.com/photos/pe5pe/2454661748/ • “Helios-Factory floor”, http://commons.wikimedia.org/wiki/File:Helioshall2.jpg • “old machine”, by Jun Aoyama, http://www.flickr.com/photos/jam343/1730140/ • “Axe Marks The Spot”, by Alan Levine, http://www.flickr.com/photos/cogdog/4461665810/ • “Failboat Has Arrived”, http://www.rotskyinstitute.com/rotsky/wp-content/uploads/2008/02/ failboat2.jpg • “14 plugs but only 6 sockets”, by Jason Rogers, http://www.flickr.com/photos/restlessglobetrotter/ 2661016046/ • “Life is like that… a fork in the road… decision required”, by Roger Price, http://www.flickr.com/ photos/rwp-roger/6687024883/