Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing

•Transferir como PPTX, PDF•

6 gostaram•1,432 visualizações

The document discusses how artificial intelligence can be applied to performance engineering to make it self-healing and self-service. It describes how monitoring needs have evolved from just looking at dashboards and logs to dealing with dynamic cloud environments. It outlines how AI can be used for full-stack monitoring with one agent, automated end-to-end tracing, automated log analytics and change detection. It then discusses how AI can enable shifting work left to break the pipeline earlier, improve mean time to resolution with auto-mitigation, and shift work right with tags, deployments and events to create actionable feedback loops across development, operations and business teams.

Software

Confidential, Dynatrace, LLC
Applying Artificial Intelligence to Performance
Engineering – Self-Healing, Self-Service
Andreas Grabner - December 2017
@grabnerandi

Monitoring used to
be about looking at
dashboards …

.. and about
analyzing logs &
exceptions …

But the apps and
services we build
have transformed to
something more
dynamic…

Develop
Ship
Deploy
Run
Scale
Compute
nodejs mongo db netty cassandra redis
ansible jenkins puppet chef
docker cloudfoundry rh openshift rh atomic rocket
core os rancher kvm busybox
mesos marathon kubernetes swarm
Amazon azure openstack mesosphere calico weave
eureka/hystrix
A whole new technology stack & polyglot development
Amazon
DynamoDB AWS Lambda
AWS
CodeDeploy
Amazon EC2
Container Services
Amazon EC2
AWS Elastic
Beanstalk
Amazon
API Gateway

Cloud
OS, Disks
Containers, Processes,
Logs
Application- & Webserver
Mobile*
Services
Network
Browser
3rd parties
FullStackDiscovery,Modeling&Analytics

confidential
One Agent to monitor them all

confidential
DynatraceFullStackMonitoring

confidential
Automated End-to-End Tracing

All Timeseries Data you can wish for 
Network Container
Servers Hosts
Cloud

confidential
Everything automatically baselined!

confidential
Automated Log Analytics and Change Detection

Your Apps/ServicesYour Users Dynatrace OneAgent AI Supported Performance Engineering

Dev Perf/Test Ops Biz
Shift-Left: Break Pipeline Earlier
Improve MTTR: Auto-Mitigation
Shift-Right: Tags, Deploys, Events
Actionable Feedback Loops

$Shift-Right: Tags, Deployments & Events docker run –e DT_TAGS=BLUE dtcli tag srv CartServicev2 GREEN dtcli evt push host .*demo version=123 source={git_commit} dtcli evt push pg tomcat1 desc=JVMMemIncr hint=+100MB Dynatrace Smartscape Release Automation Dynatrace Automation API, CLI, Auto-Detection$

Improve MTTR: Automate Mitigate with AI Data
Auto Mitigate!
1 CPU Exhausted? Add a new service instance!
3 Issue with BLUE only? Switch back to GREEN!
?Escalate at 2AM?
2 High Garbage Collection? Adjust/Revert Memory Settings!
4 Hung threads? Restart Service!
5 Still ongoing? Initiate Rollback!
Escalate
? Still ongoing?5
1
2
3
4
Mark Bad Commits
Update Dev Tickets
…
…
Impact Mitigated??
?

Shift-Left: Break Pipeline Earlier
c0123bd
nov17
myservice:nov17 myservice:nov17
space:UAT
space:PERF
myservice:nov17
Selenium Perf Data
space:PERF
myservice:nov17myservice:nov16 space:PROD
myservice:BLUE
myservice:GREEN
myservice:nov17
space:PROD
myservice:BLUE
myservice:GREEN
space:PROD
myservice:nov16

Shift-Left: Performance as Self-Service
myservice:tmp57 myservice:tmp57
space:PERF
c0123bd

Actionable Feedback Loops: Business
Success Criteria
Labels become Key User Action
Live Data Queries
New Requirement Definition

Filter by tags, versions, …
Access to all key metrics
Access to every service
Actionable Feedback Loops: Architects

C:dynatrace-cli> py dtcli.py dqlr srv tags/?key=.*prod.* service.requestspermin[count%180:0],service.failurerate[avg%180:0]
Actionable Feedback Loops: SRE’s
Live data access
through REST API/CLI

Actionable Feedback Loops: Load Testing
Dynatrace Data

Actionable Feedback Loops: Load Testing
Extracted from HTTP Header Tag(s)
PurePath for Load
Test Requests

Actionable Feedback Loops: Operations
Filter by Infrastructure,
Service, Application

Confidential, Dynatrace, LLC
Applying Artificial Intelligence to Performance
Engineering
Andreas Grabner - November 2017
@grabnerandi

Mais conteúdo relacionado

Mais procurados

Shipping Code like a keptn: Continuous Delivery & Automated Operations on k8sAndreas Grabner

How to explain DevOps to your momAndreas Grabner

Continuous Delivery and Automated Operations on k8s with keptnAndreas Grabner

DevOps Pipelines and Metrics Driven Feedback LoopsAndreas Grabner

A Guide to Event-Driven SRE-inspired DevOpsAndreas Grabner

Jenkins Online Meetup - Automated SLI based Build Validation with KeptnAndreas Grabner

Metrics Driven DevOps - Automate Scalability and Performance Into your PipelineAndreas Grabner

AWS and Dynatrace: Moving your Cloud Strategy to the Next LevelDynatrace

Boston DevOps Days 2016: Implementing Metrics Driven DevOps - Why and HowAndreas Grabner

4 Node.js Gotchas: What your ops team needs to knowDynatrace

DevOps for AI AppsRichin Jain

Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...Mike Villiger

Smarter Monitoring for Highly Distributed Cloud Foundry Application Environme...Dynatrace

Metrics-driven Continuous DeliveryAndrew Phillips

Modern Operations at Scale within Viasat – How to Structure Teams and Build A...Atlassian

OOP 2016 - Building Software That Eats The WorldAndreas Grabner

3 Tips to Deliver Fast Performance Across Mobile WebDynatrace

Full Stack Application Monitoring for AWS Powered by AIDynatrace

A DevOps State of Mind with Microservices, Containers and KubernetesAll Things Open

Top Java Performance Problems and Metrics To Check in Your PipelineAndreas Grabner

Mais procurados (20)

Shipping Code like a keptn: Continuous Delivery & Automated Operations on k8s

How to explain DevOps to your mom

Continuous Delivery and Automated Operations on k8s with keptn

DevOps Pipelines and Metrics Driven Feedback Loops

A Guide to Event-Driven SRE-inspired DevOps

Jenkins Online Meetup - Automated SLI based Build Validation with Keptn

Metrics Driven DevOps - Automate Scalability and Performance Into your Pipeline

AWS and Dynatrace: Moving your Cloud Strategy to the Next Level

Boston DevOps Days 2016: Implementing Metrics Driven DevOps - Why and How

4 Node.js Gotchas: What your ops team needs to know

DevOps for AI Apps

Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...

Smarter Monitoring for Highly Distributed Cloud Foundry Application Environme...

Metrics-driven Continuous Delivery

Modern Operations at Scale within Viasat – How to Structure Teams and Build A...

OOP 2016 - Building Software That Eats The World

3 Tips to Deliver Fast Performance Across Mobile Web

Full Stack Application Monitoring for AWS Powered by AI

A DevOps State of Mind with Microservices, Containers and Kubernetes

Top Java Performance Problems and Metrics To Check in Your Pipeline

Semelhante a Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing

YOW2018 Cloud Performance Root Cause Analysis at NetflixBrendan Gregg

Fallacies in Platform Engineering.pdfLibbySchulze

Serverless Architectures on AWS in practice - OSCON 2018Manish Pandit

NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA Taiwan

Выявление и локализация проблем в сети с помощью инструментов RiverbedElena Marianenko

Launching Your First Big Data Project on AWSAmazon Web Services

Real time serverless data pipelines on AWSThe Incredible Automation Day

Lightbend Fast Data PlatformLightbend

All the Ops: DataOps with GitOps for Streaming data on Kafka and KubernetesDevOps.com

Lightbend Fast Data PlatformLightbend

Become a Performance Diagnostics HeroTechWell

Top conf serverlezzAntons Kranga

Cloud Native Applications on OpenShiftSerhat Dirik

Cloud-native Java EE-volutionQAware GmbH

cncf overview and building edge computing using kubernetesKrishna-Kumar

Big Data in the CloudAmazon Web Services

Innovation with ai at scale on the edge vt sept 2019 v0Ganesan Narayanasamy

Dynatrace: Going beyond APM and soaring to the futureDynatrace

Dynatrace: Davis - Hololens - AI update - Cloud announcements - Self driving ITDynatrace

Fluentd meetup #3Treasure Data, Inc.

Semelhante a Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing (20)

YOW2018 Cloud Performance Root Cause Analysis at Netflix

Fallacies in Platform Engineering.pdf

Serverless Architectures on AWS in practice - OSCON 2018

NVIDIA DGX-1 超級電腦與人工智慧及深度學習

Выявление и локализация проблем в сети с помощью инструментов Riverbed

Launching Your First Big Data Project on AWS

Real time serverless data pipelines on AWS

Lightbend Fast Data Platform

All the Ops: DataOps with GitOps for Streaming data on Kafka and Kubernetes

Lightbend Fast Data Platform

Become a Performance Diagnostics Hero

Top conf serverlezz

Cloud Native Applications on OpenShift

Cloud-native Java EE-volution

cncf overview and building edge computing using kubernetes

Big Data in the Cloud

Innovation with ai at scale on the edge vt sept 2019 v0

Dynatrace: Going beyond APM and soaring to the future

Dynatrace: Davis - Hololens - AI update - Cloud announcements - Self driving IT

Fluentd meetup #3

Mais de Andreas Grabner

KCD Munich - Cloud Native Platform Dilemma - Turning it into an OpportunityAndreas Grabner

OpenTelemetry For GitOps: Tracing Deployments from Git Commit to ProductionAndreas Grabner

Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps DeploymentsAndreas Grabner

Observability and Orchestration of your GitOps Deployments with KeptnAndreas Grabner

Adding Security to your SLO-based Release Validation with KeptnAndreas Grabner

Four Practices to Fix Your Top .NET Performance ProblemsAndreas Grabner

Docker/DevOps Meetup: Metrics-Driven Continuous Performance and ScalabiltyAndreas Grabner

JavaOne 2015: Top Performance Patterns Deep DiveAndreas Grabner

Application Quality Gates in Continuous Delivery: Deliver Better Software Fas...Andreas Grabner

Deploy Faster Without Failing Faster - Metrics-Driven - Dynatrace User Groups...Andreas Grabner

BTD2015 - Your Place In DevTOps is Finding Solutions - Not Just Bugs!Andreas Grabner

Mobile User Experience:Auto Drive through Performance MetricsAndreas Grabner

HSPS 2015 - SharePoint Performance Santiy ChecksAndreas Grabner

Mais de Andreas Grabner (13)

KCD Munich - Cloud Native Platform Dilemma - Turning it into an Opportunity

OpenTelemetry For GitOps: Tracing Deployments from Git Commit to Production

Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments

Observability and Orchestration of your GitOps Deployments with Keptn

Adding Security to your SLO-based Release Validation with Keptn

Four Practices to Fix Your Top .NET Performance Problems

Docker/DevOps Meetup: Metrics-Driven Continuous Performance and Scalabilty

JavaOne 2015: Top Performance Patterns Deep Dive

Application Quality Gates in Continuous Delivery: Deliver Better Software Fas...

Deploy Faster Without Failing Faster - Metrics-Driven - Dynatrace User Groups...

BTD2015 - Your Place In DevTOps is Finding Solutions - Not Just Bugs!

Mobile User Experience:Auto Drive through Performance Metrics

HSPS 2015 - SharePoint Performance Santiy Checks

Último

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveCall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

5 Signs You Need a Fashion PLM Software.pdfWave PLM

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions

Microsoft AI Transformation Partner Playbook.pdfWilly Marroquin (WillyDevNET)

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab

CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812

How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI

HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai

Diamond Application Development Crafting Solutions with PrecisionSolGuruz

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Right Money Management App For Your Financial GoalsJhone kinadey

Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing

1. Confidential, Dynatrace, LLC Applying Artificial Intelligence to Performance Engineering – Self-Healing, Self-Service Andreas Grabner - December 2017 @grabnerandi

2. Monitoring used to be about looking at dashboards …

3. Process Memory (GB) CPU Graphs (%)

4. .. and about analyzing logs & exceptions …

5. confidential Top Exceptions Top Logs

6. But the apps and services we build have transformed to something more dynamic…

7. Develop Ship Deploy Run Scale Compute nodejs mongo db netty cassandra redis ansible jenkins puppet chef docker cloudfoundry rh openshift rh atomic rocket core os rancher kvm busybox mesos marathon kubernetes swarm Amazon azure openstack mesosphere calico weave eureka/hystrix A whole new technology stack & polyglot development Amazon DynamoDB AWS Lambda AWS CodeDeploy Amazon EC2 Container Services Amazon EC2 AWS Elastic Beanstalk Amazon API Gateway

8. Cloud OS, Disks Containers, Processes, Logs Application- & Webserver Mobile* Services Network Browser 3rd parties FullStackDiscovery,Modeling&Analytics

9. confidential One Agent to monitor them all

10. confidential DynatraceFullStackMonitoring

11. confidential Automated End-to-End Tracing

12. All Timeseries Data you can wish for  Network Container Servers Hosts Cloud

13. confidential Everything automatically baselined!

14. confidential Automated Log Analytics and Change Detection

15. Your Apps/ServicesYour Users Dynatrace OneAgent AI Supported Performance Engineering

16. Dev Perf/Test Ops Biz Shift-Left: Break Pipeline Earlier Improve MTTR: Auto-Mitigation Shift-Right: Tags, Deploys, Events Actionable Feedback Loops

17. Shift-Right: Tags, Deployments & Events docker run –e DT_TAGS=BLUE dtcli tag srv CartServicev2 GREEN dtcli evt push host .*demo version=123 source={git_commit} dtcli evt push pg tomcat1 desc=JVMMemIncr hint=+100MB Dynatrace Smartscape Release Automation Dynatrace Automation API, CLI, Auto-Detection

18. Improve MTTR: Automate Mitigate with AI Data Auto Mitigate! 1 CPU Exhausted? Add a new service instance! 3 Issue with BLUE only? Switch back to GREEN! ?Escalate at 2AM? 2 High Garbage Collection? Adjust/Revert Memory Settings! 4 Hung threads? Restart Service! 5 Still ongoing? Initiate Rollback! Escalate ? Still ongoing?5 1 2 3 4 Mark Bad Commits Update Dev Tickets … … Impact Mitigated?? ?

19. Shift-Left: Break Pipeline Earlier c0123bd nov17 myservice:nov17 myservice:nov17 space:UAT space:PERF myservice:nov17 Selenium Perf Data space:PERF myservice:nov17myservice:nov16 space:PROD myservice:BLUE myservice:GREEN myservice:nov17 space:PROD myservice:BLUE myservice:GREEN space:PROD myservice:nov16

20. Shift-Left: Performance as Self-Service myservice:tmp57 myservice:tmp57 space:PERF c0123bd

21. Actionable Feedback Loops: Business Success Criteria Labels become Key User Action Live Data Queries New Requirement Definition

22. Filter by tags, versions, … Access to all key metrics Access to every service Actionable Feedback Loops: Architects

23. C:dynatrace-cli> py dtcli.py dqlr srv tags/?key=.*prod.* service.requestspermin[count%180:0],service.failurerate[avg%180:0] Actionable Feedback Loops: SRE’s Live data access through REST API/CLI

24. Actionable Feedback Loops: Load Testing Dynatrace Data

25. Actionable Feedback Loops: Load Testing Extracted from HTTP Header Tag(s) PurePath for Load Test Requests

26. Actionable Feedback Loops: Operations Filter by Infrastructure, Service, Application

27. Dev Perf/Test Ops Biz Shift-Left: Break Pipeline Earlier Improve MTTR: Auto-Mitigation Shift-Right: Tags, Deploys, Events Actionable Feedback Loops

28. confidential Demo Time or Done 

29. Confidential, Dynatrace, LLC Applying Artificial Intelligence to Performance Engineering Andreas Grabner - November 2017 @grabnerandi

Notas do Editor

That may have worked well for static environments where you knew what you are looking at
If your apps gave you logs you could use log analytics to analyze the log files ->in case you knew what to look for and in case the log messages were actually written We could also correlate logs and exceptions to identify strange patterns
All of this worked well in case the applications were rather static – not too large and you had the people that understood how to analyze data provided by different tools BUT – the world has changed
These is the new technology stack we are dealing with – and it is by far not complete New players coming and going – allowing us to implement new types of apps with new architectural and deployment options
But there is more than production! There is more we can do throughout the whole DevOps Toolchain
But there is more than production! There is more we can do throughout the whole DevOps Toolchain

Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing

Semelhante a Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing (20)

Mais de Andreas Grabner

Mais de Andreas Grabner (13)

Último

Último (20)

Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing

Notas do Editor