SlideShare a Scribd company logo
1 of 25
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog
PRINCIPLES OF OBSERVABILITY // DANIEL MAHER, DataDog

More Related Content

More from DevOpsDays Tel Aviv

HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearBHOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
DevOpsDays Tel Aviv
 
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, FireflyDON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
DevOpsDays Tel Aviv
 
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHubSOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
DevOpsDays Tel Aviv
 

More from DevOpsDays Tel Aviv (20)

THE PLEASURES OF ON-PREM, TOMER GABEL
THE PLEASURES OF ON-PREM, TOMER GABELTHE PLEASURES OF ON-PREM, TOMER GABEL
THE PLEASURES OF ON-PREM, TOMER GABEL
 
CONFIGURATION MANAGEMENT IN THE CLOUD NATIVE ERA, SHAHAR MINTZ, EggPack
CONFIGURATION MANAGEMENT IN THE CLOUD NATIVE ERA, SHAHAR MINTZ, EggPackCONFIGURATION MANAGEMENT IN THE CLOUD NATIVE ERA, SHAHAR MINTZ, EggPack
CONFIGURATION MANAGEMENT IN THE CLOUD NATIVE ERA, SHAHAR MINTZ, EggPack
 
SOLVING THE DEVOPS CRISIS, ONE PERSON AT A TIME, CHRISTINA BABITSKI, Develeap
SOLVING THE DEVOPS CRISIS, ONE PERSON AT A TIME, CHRISTINA BABITSKI, DeveleapSOLVING THE DEVOPS CRISIS, ONE PERSON AT A TIME, CHRISTINA BABITSKI, Develeap
SOLVING THE DEVOPS CRISIS, ONE PERSON AT A TIME, CHRISTINA BABITSKI, Develeap
 
OPTIMIZING PERFORMANCE USING CONTINUOUS PRODUCTION PROFILING ,YONATAN GOLDSCH...
OPTIMIZING PERFORMANCE USING CONTINUOUS PRODUCTION PROFILING ,YONATAN GOLDSCH...OPTIMIZING PERFORMANCE USING CONTINUOUS PRODUCTION PROFILING ,YONATAN GOLDSCH...
OPTIMIZING PERFORMANCE USING CONTINUOUS PRODUCTION PROFILING ,YONATAN GOLDSCH...
 
HOW TO SCALE YOUR ONCALL OPERATION, AND SURVIVE TO TELL, ANTON DRUKH
HOW TO SCALE YOUR ONCALL OPERATION, AND SURVIVE TO TELL, ANTON DRUKHHOW TO SCALE YOUR ONCALL OPERATION, AND SURVIVE TO TELL, ANTON DRUKH
HOW TO SCALE YOUR ONCALL OPERATION, AND SURVIVE TO TELL, ANTON DRUKH
 
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearBHOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
HOW TO OPTIMIZE NON-CODING TIME, ORI KEREN, LinearB
 
FLYING BLIND - ACCESSIBILITY IN MONITORING, FEU MOUREK, Icinga
FLYING BLIND - ACCESSIBILITY IN MONITORING, FEU MOUREK, IcingaFLYING BLIND - ACCESSIBILITY IN MONITORING, FEU MOUREK, Icinga
FLYING BLIND - ACCESSIBILITY IN MONITORING, FEU MOUREK, Icinga
 
(Ignite) WHAT'S BURNING THROUGH YOUR CLOUD BILL - GIL BAHAT, CIDER SECURITY
(Ignite) WHAT'S BURNING THROUGH YOUR CLOUD BILL - GIL BAHAT, CIDER SECURITY(Ignite) WHAT'S BURNING THROUGH YOUR CLOUD BILL - GIL BAHAT, CIDER SECURITY
(Ignite) WHAT'S BURNING THROUGH YOUR CLOUD BILL - GIL BAHAT, CIDER SECURITY
 
SLO DRIVEN DEVELOPMENT, ALON NATIV, Tomorrow.io
SLO DRIVEN DEVELOPMENT, ALON NATIV, Tomorrow.ioSLO DRIVEN DEVELOPMENT, ALON NATIV, Tomorrow.io
SLO DRIVEN DEVELOPMENT, ALON NATIV, Tomorrow.io
 
ONBOARDING IN LOCKDOWN, HILA FOX, Augury
ONBOARDING IN LOCKDOWN, HILA FOX, AuguryONBOARDING IN LOCKDOWN, HILA FOX, Augury
ONBOARDING IN LOCKDOWN, HILA FOX, Augury
 
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, FireflyDON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly
 
KEYNOTE | WHAT'S COMING IN THE NEXT 10 YEARS OF DEVOPS? // ELLEN CHISA, bolds...
KEYNOTE | WHAT'S COMING IN THE NEXT 10 YEARS OF DEVOPS? // ELLEN CHISA, bolds...KEYNOTE | WHAT'S COMING IN THE NEXT 10 YEARS OF DEVOPS? // ELLEN CHISA, bolds...
KEYNOTE | WHAT'S COMING IN THE NEXT 10 YEARS OF DEVOPS? // ELLEN CHISA, bolds...
 
(Ignite) OPEN SOURCE - OPEN CHOICE: HOW TO CHOOSE AN OPEN-SOURCE PROJECT, HIL...
(Ignite) OPEN SOURCE - OPEN CHOICE: HOW TO CHOOSE AN OPEN-SOURCE PROJECT, HIL...(Ignite) OPEN SOURCE - OPEN CHOICE: HOW TO CHOOSE AN OPEN-SOURCE PROJECT, HIL...
(Ignite) OPEN SOURCE - OPEN CHOICE: HOW TO CHOOSE AN OPEN-SOURCE PROJECT, HIL...
 
(Ignite) HISTORY IS A WHEEL. TECH IS A SPIRAL, ERAN ZIMBLER, Alibaba Cloud
(Ignite) HISTORY IS A WHEEL. TECH IS A SPIRAL, ERAN ZIMBLER, Alibaba Cloud(Ignite) HISTORY IS A WHEEL. TECH IS A SPIRAL, ERAN ZIMBLER, Alibaba Cloud
(Ignite) HISTORY IS A WHEEL. TECH IS A SPIRAL, ERAN ZIMBLER, Alibaba Cloud
 
LGBTech at DevOpsDays Tel Aviv
LGBTech at DevOpsDays Tel AvivLGBTech at DevOpsDays Tel Aviv
LGBTech at DevOpsDays Tel Aviv
 
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioTHE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
 
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHubSOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
SOLVING MLOPS FROM FIRST PRINCIPLES, DEAN PLEBAN, DagsHub
 
WHEN THE FIREFIGHTERS COME KNOCKING, ARIEL PISETZKY & BARAK LUZON, TABOOLA
WHEN THE FIREFIGHTERS COME KNOCKING, ARIEL PISETZKY & BARAK LUZON, TABOOLAWHEN THE FIREFIGHTERS COME KNOCKING, ARIEL PISETZKY & BARAK LUZON, TABOOLA
WHEN THE FIREFIGHTERS COME KNOCKING, ARIEL PISETZKY & BARAK LUZON, TABOOLA
 
ONE-SIZE DOESN'T FIT ALL - EFFECTIVELY (RE)EVALUATE A DATA SOLUTION FOR YOUR ...
ONE-SIZE DOESN'T FIT ALL - EFFECTIVELY (RE)EVALUATE A DATA SOLUTION FOR YOUR ...ONE-SIZE DOESN'T FIT ALL - EFFECTIVELY (RE)EVALUATE A DATA SOLUTION FOR YOUR ...
ONE-SIZE DOESN'T FIT ALL - EFFECTIVELY (RE)EVALUATE A DATA SOLUTION FOR YOUR ...
 
HOW TO REPLACE A FLAT TIRE ON YOUR CAR WHILE IT'S ON FIRE, AVI KONSTANTINI, Wix
HOW TO REPLACE A FLAT TIRE ON YOUR CAR WHILE IT'S ON FIRE, AVI KONSTANTINI, WixHOW TO REPLACE A FLAT TIRE ON YOUR CAR WHILE IT'S ON FIRE, AVI KONSTANTINI, Wix
HOW TO REPLACE A FLAT TIRE ON YOUR CAR WHILE IT'S ON FIRE, AVI KONSTANTINI, Wix
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Editor's Notes

  1. Hi, my name's Dan and I'm an engineer on the Community team at Datadog. The aim of our brief time here together is to better understand the word "observability". It's a big word—13 letters, actually—and it gets thrown around a lot in our industry. But before we get into what it is (which is actually fairly complicated), let's start with what it isn't.
  2. Let's start with monitoring. To be clear, monitoring is important, and you should have lots of it—and it's usually part of a broader observability strategy—but it's not in and of itself observability. Well, not exactly at any rate (as we'll discover).
  3. Everybody loves dashboards right? And we look at (or observe) dashboards, so, that's observability, right? No. Again, dashboards can be useful, and can be part of that aforementioned strategy, but they're not what we're talking about today.
  4. Also—and this one is important—devops is not it either. That said, the CAMS can and does fit very nicely with observability principles and practices—and that's something that I would encourage us all to talk about in the open spaces over the next two days. So now that we've established what observability isn't, let's talk about what it is. And we're going to start with… **next slide**
  5. …a brief history lesson. This is a centrifugal governor. These were invented in the 17th century by Christian `Ha-hruns`, who is perhaps best known as an astronomer, but like a lot of naturalists in the 1600's, he was basically an expert at everything. The governor was—and is—used to regulate the speed of an engine—by which I mean an engine in the classical mechanical sense, as in something that converts energy into mechanical force—in this case to regulate distance and pressure between millstones.
  6. Along comes this man: James Clark Maxwell. The modern world exists in large part due to this man. When you talk about the great physicists of all time, you've got Newton, Einstein, and Maxwell. I would need an hour just to list the things that Maxwell accomplished. What's pertinent to us today is an 1868 he wrote a paper entitled "On Governors" which was an analysis of how centrifugal governors function. As a mere side-effect of this paper, a whole new field of research emerged, and that is **next slide**
  7. If you look it up in Wikipedia this is what it will tell you: Control theory deals with the control of dynamical systems in engineered processes and machines. The objective is to develop a model or algorithm governing the application of system inputs to drive the system to a desired state … and ensuring a level of control stability. Ok, I'm going to pause here for a moment. Why the history lesson? Because it's important to understand that control theory has been around for a long time. It's an extremely mature field of study, and on that is extremely focused on mechanical and industrial concepts. But it's also where we get the word observability. But before we get to that gold nugget, we need to talk about **next slide**
  8. … this. In order to accomplish the aforementioned objectives of control theory, something called a controller is itself theorised. The controller examines a value as measured from a system at a given point in time, and compares it to a reference value. The difference between these values is called the "error signal" (sound familiar?). This signal can then be used to apply some sort of corrective behaviour, the aim of which is to bring the delta towards zero. How does it do this? Via a property of a system called observability.
  9. This is the "classic" definition, taken directly from control theory. But there are lots of words in this definition that, themselves, are worth diving into, because every single word here means something specific.
  10. Let's start with "measure". It means to "ascertain the size, amount, or degree of something", but also to "assess the importance, effect, or value of something". In other words, we can measure both quantity and quality, and this dual-nature is important to understand.
  11. In English, "state" has a few definitions, but the one we want is "the particular condition that something is in at a specific time." The key words there are condition and time, and that last one is critical; as they say in life, timing is everything.
  12. So, system. This is "a set of things working together as parts of a mechanism or interconnecting network", but it's also "a set of principles or procedures by which something is done", like a scheme or method. So it's either literally a complex thing, or figuratively, a way of describing a complex thing.
  13. This brings us to knowledge, which, uh, is a big one (and honestly we'd need more than a few minutes to really get going), but for our purposes—here, today—let's go with this: "it is the sum of what is known".
  14. Historically, output is related to production—like how much fabric a power loom can produce in 1850. But today we can also think of output as information—in particular, the information that is produced by a system (hopefully on purpose).
  15. That basic definition of observability, then, belies a frankly incredible level of complexity. As I said, it's taken from control theory—which is important, because the definition taken in a vacuum (as it often is) is missing a key bit of context.
  16. …and that's the State Observer. In control theory, this is the mechanism that actually takes the measurements of the inputs and outputs—that actually gathers the quantities and qualities and times and conditions—and provides data that can become knowledge.
  17. Observers in control theory are all maths and algorithms and processes, and it gets deep fast. So, let's step back from the brink of madness and re-center ourselves. For us, as computer people, how can we understand observability in practical terms?
  18. The Three Pillars is an outdated definition but it's worth mentioning because it still pops up all over the place in docs and blogs posts, and was the first widely distributed and accepted interpretation of o11y as something that goes beyond monitoring. But today, as an industry, we've really moved past this basic definition.
  19. It's about a wide variety of perspectives, about having many different ways of introspecting and interpreting and exploring and explaining complex systems. You need to look at both sides of the coin, except in this case, the coin exists in 72 dimensions and it's impossible for a human to actually perceive it all at once.
  20. O11y is all about our capacity to ask questions in order to better understand and comprehend complex systems. There's two important things that are implicitly stated here: you need data, and you need the means to interact with and make sense of that data. To be clear, those are two separate things (and you need both).
  21. Once you have the data, and you have the ability to interact with that data in arbitrary ways, you can start to ask questions about things that you didn't know about ahead of time. And that's the difference between o11y and monitoring. Monitoring is about what you've already discovered; o11y is about what you have yet to explore.
  22. So for us as technologists, we can—and should—have a more expansive understanding of the word observability. For us, it's more than than a property—more than just state estimation via inferrence. It's about perspectives, and our capacity to understand and reason about our systems from different angles. It's also about data, and our capacity to both collect and interpret it. And finally it's about asking questions and gaining new insight—information that we didn't have before.
  23. This is where we start to diverge from that basic, oft-repeated definition. There's more to it than self-governing industrial machinery. The capacity to be observed is necessary but insufficient; we must consider who or what is ultimately using this data, and their half of the relationship, as well.
  24. In other words, the consumer of the information shares a relationship with what is being observed. And that relationship is important to recognise. Classical observability is today merely the starting point. We, as an industry, as technologists, as programmers and product managers and SREs, are poised for the next great evolution in how we understand and interact with complex systems.
  25. I submit this as a talking point, for us here today, and for us as an industry, to consider, debate, and—hopefully—use as a vehicle to improve ourselves and the work that we do: that observability as a property of a system is important, but so is the consumer's capacity to make use of that property. Thank you.