Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game

•

1 gostou•233 visualizações

Dynatrace

Sentry

Tecnologia

#Perform2018
Enterprise monitoring
Baselining, Cloud-Scale Monitoring
and Auto-Remediation with
AppMon and how Dynatrace
changes the game
Brian Perrault
Platform Engineer
Thorsten Roth
Director, Product Management
#Perform2018

Who am I?
▪ Platform Engineer at Sentry Insurance
▪ Utilizing AppMon for 6 years
▪ Written or Updated many AppMon
Plugins for the community
▪ RabbitMQ, DB Query, Linux Filesystem, etc.
▪ Monitoring 82 Applications across
~1500 servers in all environments
#Perform2018
Brian Perrault

Moving to a Cloud environment
▪ Why move to the Cloud?
▪ Scalability
▪ Versatility
▪ Microarchitecture
▪ Challenges for monitoring?
▪ Servers no longer constant
▪ Not all applications have the
same performance expectations
▪ Automation pipeline integration
#Perform2018

Amazon Web Services (AWS)
▪ AWS is the chosen cloud provider
▪ Robust API
▪ Many different locations across the US
▪ Easy Server Image Management
▪ Elastic Load Balancer (ELB)
▪ Automatically add new instances to
load balancer
#Perform2018

Implementing baselining
▪ Automatic for all new added
applications
▪ Great for cloud where new instances
are spun up and down
▪ Grants flow rate calculation
▪ Not a hard limit
▪ Detect abnormal activity levels
▪ Granular detection for errors on the
page level
▪ Load Balancer only knows if the page is
serving HTML, not if it is operating properly
#Perform2018

#Perform2018
Example Baseline
#Perform2018

Implementing automation
▪ Integration with corporate automation solution
▪ Ability to pull details from AppMon Incident
▪ Ability to escalate issue if automated
remediation does not work
#Perform2018

#Perform2018
AppMon
incident action
config
#Perform2018

Server health automation
▪ Treat servers as cattle not pets
▪ Auto detection of server issues
▪ Call to automation tool and spin up
a new server then kill the old server
#Perform2018

#Perform2018
Example HPOO
flow – Resource
exhaustion
#Perform2018

Error automation
▪ Load Balancer cannot detect all issues an
application may have
▪ Automatic detection of errors
▪ Call to automation tool can remove a server
from the Load Balancer
▪ Try restarting the server
▪ If errors persist the server can be killed and replaced
#Perform2018

Scalability automation
▪ Staying ahead of the load
▪ Baseline monitoring detects an abnormal load
▪ Call out to automation tool which adds new
servers to config
▪ When incident ends a new flow to remove the
servers is run
#Perform2018

#Perform2018
Improve MTTR: Automate Mitigate with AI Data
#Perform2018
Auto Mitigate!
1 CPU Exhausted? Add a new service instance!
Escalate at 2AM?
2 High Garbage Collection? Adjust/Revert Memory Settings!
3 Hung threads? Restart Service!
5 Still ongoing? Initiate Rollback!
Escalate
? Still ongoing?
5
1
2
3
Mark Bad Commits
Update Dev Tickets
…
…
Impact Mitigated??
?

#Perform2018
Key takeaways
▪ Threshold based monitoring does not work in the cloud
▪ Alerts should be applied to all applications
▪ Automation, Automation, Automation
#Perform2018

Mais conteúdo relacionado

Mais procurados

Cloud-Native Workshop New York- DynatraceVMware Tanzu

Paypal, Barbri: Lost in the cloud? Top challenges facing CIOs in a cloud nati...Dynatrace

Zurich: Monitoring a sales force-based insurance application using dynatrace ...Dynatrace

An API-focused approach to Agile IntegrationJudy Breedlove

How to Make the API Economy a RealityWSO2

Red Hat: Three Pillars of IntegrationJudy Breedlove

Barbri barbri's journey from on-prem to cloud, featuring auto-remediation wi...Laura Stack

Dynatrace: The untouchables - the Dynatrace offering here and nowDynatrace

The 3 pillars of agile integration: Container, Connector and APIJudy Breedlove

Why Developers Care About Cloud PlatformsSonian

Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterpriseVMware Tanzu

Putting data to workJudy Breedlove

An API-focused approach to Agile IntegrationJudy Breedlove

Transform the internal it landscape with APIsJudy Breedlove

AppSphere 15 - Monitoring Cloud & Asynchronous ApplicationsAppDynamics

Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019Codit

Monetization: Unlock More Value from Your APIs Apigee | Google Cloud

DOES16 San Francisco - DevOps Workshop: Modern Technical PracticesGene Kim

Navigating Cloud Adoption: Trends that Challenge and Inspire DesignersJudy Breedlove

APIdays Paris 2019 - Michelin APIfication, Yes. But done right! by Antonin ...apidays

Mais procurados (20)

Cloud-Native Workshop New York- Dynatrace

Paypal, Barbri: Lost in the cloud? Top challenges facing CIOs in a cloud nati...

Zurich: Monitoring a sales force-based insurance application using dynatrace ...

An API-focused approach to Agile Integration

How to Make the API Economy a Reality

Red Hat: Three Pillars of Integration

Barbri barbri's journey from on-prem to cloud, featuring auto-remediation wi...

Dynatrace: The untouchables - the Dynatrace offering here and now

The 3 pillars of agile integration: Container, Connector and API

Why Developers Care About Cloud Platforms

Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise

Putting data to work

An API-focused approach to Agile Integration

Transform the internal it landscape with APIs

AppSphere 15 - Monitoring Cloud & Asynchronous Applications

Real time Analytics in IoT - Marcel Lattmann Codit Switzerland @.NET Day 2019

Monetization: Unlock More Value from Your APIs

DOES16 San Francisco - DevOps Workshop: Modern Technical Practices

Navigating Cloud Adoption: Trends that Challenge and Inspire Designers

APIdays Paris 2019 - Michelin APIfication, Yes. But done right! by Antonin ...

Semelhante a Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game

Office 365 Monitoring Best PracticesThousandEyes

Big Data LDN 2018: STREAM PROCESSING TAKES ON EVERYTHINGMatt Stubbs

Breaking Up the Monolith While Migrating to AWS (GPSTEC320) - AWS re:Invent 2018Amazon Web Services

AppDynamics User GroupMike Ruangutai

Stream Processing with Apache ApexPramod Immaneni

Building a Monitoring Plan.pdfAmazon Web Services

MuleSoft RPA Automation as APIs.pdfsumitahuja94

MuleSoft Composer | Patna MuleSoft Meetup #14shyamraj55

Fearless From Monolith to Serverless with DynatraceAmazon Web Services

Coordinating Microservices with AWS Step Functions.pdfAmazon Web Services

Slack in the Age of PrometheusGeorge Luong

MIE Trak Pro Introduction JonathanBurgmayer

Upgrade and Unleash the Power of CA Workload Automation AutoSys (AE) and CA W...CA Technologies

AWS 기반 Microservice 운영을 위한 데브옵스 사례와 Spinnaker 소개::김영욱::AWS Summit Seoul 2018Amazon Web Services Korea

Migrating database to cloudAmazon Web Services

REI: Evolving performance engineering for the move to cloud, microservices, c...Dynatrace

Company presentation english 1 2015Locanisag

SAP Change Control Management & TR import automation toolJustAcademy

Principal: How Principal takes monitoring into the future to face new technol...Dynatrace

Next generation business automation with the red hat decision manager and red...Masahiko Umeno

Semelhante a Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game (20)

Office 365 Monitoring Best Practices

Big Data LDN 2018: STREAM PROCESSING TAKES ON EVERYTHING

Breaking Up the Monolith While Migrating to AWS (GPSTEC320) - AWS re:Invent 2018

AppDynamics User Group

Stream Processing with Apache Apex

Building a Monitoring Plan.pdf

MuleSoft RPA Automation as APIs.pdf

MuleSoft Composer | Patna MuleSoft Meetup #14

Fearless From Monolith to Serverless with Dynatrace

Coordinating Microservices with AWS Step Functions.pdf

Slack in the Age of Prometheus

MIE Trak Pro Introduction

Upgrade and Unleash the Power of CA Workload Automation AutoSys (AE) and CA W...

AWS 기반 Microservice 운영을 위한 데브옵스 사례와 Spinnaker 소개::김영욱::AWS Summit Seoul 2018

Migrating database to cloud

REI: Evolving performance engineering for the move to cloud, microservices, c...

Company presentation english 1 2015

SAP Change Control Management & TR import automation tool

Principal: How Principal takes monitoring into the future to face new technol...

Next generation business automation with the red hat decision manager and red...

Mais de Dynatrace

Virgin Money: Virgin Money's quest for digital performance perfectionDynatrace

SITA: How smart apps are making air travel easier, every step of the wayDynatrace

Pivotal: Join us for a fireside chat with CEO of PivotalDynatrace

Harrods: Re-inventing the luxury retail marketDynatrace

Altimeter Group: The new face of changeDynatrace

Alastair Humphreys: Life stories and inspiration from Alastair HumphreysDynatrace

AWS: Serverless Architecture - Beyond functions and into the future Dynatrace

SAP: How SAP fully automates the provisioning and operations of its dynatrace...Dynatrace

Pay pal paypal continuous performance as a self-service with fully-automated...Dynatrace

Optum: Optum user focused insight from mobile to mainframeDynatrace

Neiman Marcus: Neiman Marcus's journey into the cloud with dynatrace with an ...Dynatrace

Intuit shifting left by continuous performance testing in intuitDynatrace

First Tech: From bricks and mortar to cloud first api driven bankingDynatrace

Experian: Dynatrace real time feedback changed the development culture at exp...Dynatrace

Dynatrace: DevOps, shift-left & self-healing a performance clinic with andiDynatrace

Citrix: The transformation from waterfall to agile operations at citrixDynatrace

Beachbody: Beachbody's smarter operations with zero-dashboard monitoring lets...Dynatrace

Mais de Dynatrace (17)

Virgin Money: Virgin Money's quest for digital performance perfection

SITA: How smart apps are making air travel easier, every step of the way

Pivotal: Join us for a fireside chat with CEO of Pivotal

Harrods: Re-inventing the luxury retail market

Altimeter Group: The new face of change

Alastair Humphreys: Life stories and inspiration from Alastair Humphreys

AWS: Serverless Architecture - Beyond functions and into the future

SAP: How SAP fully automates the provisioning and operations of its dynatrace...

Pay pal paypal continuous performance as a self-service with fully-automated...

Optum: Optum user focused insight from mobile to mainframe

Neiman Marcus: Neiman Marcus's journey into the cloud with dynatrace with an ...

Intuit shifting left by continuous performance testing in intuit

First Tech: From bricks and mortar to cloud first api driven banking

Experian: Dynatrace real time feedback changed the development culture at exp...

Dynatrace: DevOps, shift-left & self-healing a performance clinic with andi

Citrix: The transformation from waterfall to agile operations at citrix

Beachbody: Beachbody's smarter operations with zero-dashboard monitoring lets...

Último

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game

1. #Perform2018 Enterprise monitoring Baselining, Cloud-Scale Monitoring and Auto-Remediation with AppMon and how Dynatrace changes the game Brian Perrault Platform Engineer Thorsten Roth Director, Product Management #Perform2018

2. Who am I? ▪ Platform Engineer at Sentry Insurance ▪ Utilizing AppMon for 6 years ▪ Written or Updated many AppMon Plugins for the community ▪ RabbitMQ, DB Query, Linux Filesystem, etc. ▪ Monitoring 82 Applications across ~1500 servers in all environments #Perform2018 Brian Perrault

3. Moving to a Cloud environment ▪ Why move to the Cloud? ▪ Scalability ▪ Versatility ▪ Microarchitecture ▪ Challenges for monitoring? ▪ Servers no longer constant ▪ Not all applications have the same performance expectations ▪ Automation pipeline integration #Perform2018

4. Amazon Web Services (AWS) ▪ AWS is the chosen cloud provider ▪ Robust API ▪ Many different locations across the US ▪ Easy Server Image Management ▪ Elastic Load Balancer (ELB) ▪ Automatically add new instances to load balancer #Perform2018

5. #Perform2018 Showing all ELB instances

6. #Perform2018

7. Implementing baselining ▪ Automatic for all new added applications ▪ Great for cloud where new instances are spun up and down ▪ Grants flow rate calculation ▪ Not a hard limit ▪ Detect abnormal activity levels ▪ Granular detection for errors on the page level ▪ Load Balancer only knows if the page is serving HTML, not if it is operating properly #Perform2018

8. #Perform2018 Example Baseline #Perform2018

10.

11. Implementing automation ▪ Integration with corporate automation solution ▪ Ability to pull details from AppMon Incident ▪ Ability to escalate issue if automated remediation does not work #Perform2018

12. #Perform2018 Remediation flow diagram

13. #Perform2018 Remediation flow diagram

14. #Perform2018 Remediation flow diagram

15. #Perform2018 Remediation flow diagram

16. #Perform2018 Remediation flow diagram

17. #Perform2018 Remediation flow diagram

18. #Perform2018 Remediation flow diagram

19. #Perform2018 AppMon incident action config #Perform2018

20.

21.

22. Server health automation ▪ Treat servers as cattle not pets ▪ Auto detection of server issues ▪ Call to automation tool and spin up a new server then kill the old server #Perform2018

23. #Perform2018 Example HPOO flow – Resource exhaustion #Perform2018

24. Error automation ▪ Load Balancer cannot detect all issues an application may have ▪ Automatic detection of errors ▪ Call to automation tool can remove a server from the Load Balancer ▪ Try restarting the server ▪ If errors persist the server can be killed and replaced #Perform2018

25. Scalability automation ▪ Staying ahead of the load ▪ Baseline monitoring detects an abnormal load ▪ Call out to automation tool which adds new servers to config ▪ When incident ends a new flow to remove the servers is run #Perform2018

26.

27. #Perform2018

28. #Perform2018

29. #Perform2018

30. #Perform2018 Improve MTTR: Automate Mitigate with AI Data #Perform2018 Auto Mitigate! 1 CPU Exhausted? Add a new service instance! Escalate at 2AM? 2 High Garbage Collection? Adjust/Revert Memory Settings! 3 Hung threads? Restart Service! 5 Still ongoing? Initiate Rollback! Escalate ? Still ongoing? 5 1 2 3 Mark Bad Commits Update Dev Tickets … … Impact Mitigated?? ?

31. #Perform2018 Key takeaways ▪ Threshold based monitoring does not work in the cloud ▪ Alerts should be applied to all applications ▪ Automation, Automation, Automation #Perform2018

32. OPEN Q&A

33. Thank you

Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game

Semelhante a Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game (20)

Mais de Dynatrace

Mais de Dynatrace (17)

Último

Último (20)

Sentry: Baselining, cloud-scale monitoring and auto-remediation with app mon and how dynatrace changes the game