SlideShare a Scribd company logo
1 of 18
Download to read offline
Not everything that happens
in Vegas stays in Vegas
DevOps
or “getting devs to be on call for what they ship” :-)
Netflix development
Priorities
1. Speed of innovation
2. Availability
3. Running costs
a. “It’ll cost what it ends up costing”
In practise, they found that holding to the first two ended
up costing way less than otherwise expected.
Riot Games + League of Legends
Cloud == ideal for MMOs. Solve launch issues.
● chef gets used a lot here.
○ talked about their evolution with it, lessons learned
● What sucked?
○ 25 minute bootstrap runs
○ External dependencies (including S3)
○ Duplicating application deployment recipes
● golden masters and immutable servers simplify your
life drastically.
● “if you’re doing chef without BerkShelf you’re doing it
wrong”
● Make it easy to throw up new things
Testing in production
Netflix, Riot, Kickstarter - they all do this.
At scale.
Netflix
● 10s to 100s of code pushes per day
● 1000s to 100,000s of config changes per day
○ they tune their A/B testing constantly
Of course, they also have the instrumentation to react to
this.
How’re other people doing DevOps?
Good news - we’re at the “more sophisticated” end of the
spectrum.
Every “cloud native” was doing this.
Things other people did better:
● “Golden master” AMIs
● Immutable instances
● Absolute ownership of vertical slices
● Config-managment (chef/puppet) featured
prominently
● Extensive monitoring+logs+visibility == “table stakes”
○ for developers!
● Easy to throw up new things
● Run many small, simple, collaborating things
Who? Riot Games, Netflix, change.org, Kickstarter
Logging aggregation is
important
Logging aggregation is important
Lots of 3rd party companies are offering centralized
logging services, there's a huge appetite for logging
and monitoring.
● http://logentries.com/
● http://www.loggly.com/
● http://papertrailapp.com/
● https://www.splunkstorm.com/tour
● http://www.datadoghq.com/
● DIY - Lumberjacking slides
DEMO: Monitoring & Logging
https://app.datadoghq.com/infrastructure
● Tag Metrics, awesome Metric discoverability
● Cloud Watch integration
○ I never knew I could see ELB metrics :-)
● Alarms are integrated
● You can template Dashboards
https://papertrailapp.com/
● Can Search, Save Searches, Alerts on searches
● No alert on patterns
● Archive to S3 / Push to Redshift
Logging aggregation is FOR DEVELOPERS!!!
Saves lots of time when you’re on call.
Loggly Session
Benefit of logging as a service.
● When your infrastructure is in trouble, you do not
want to have your logging analytic system on the
same infrastructure.
AWS Services that loggly could use:
● Kafka + Storm vs Kinesis
● Elastic Search vs Cloud Search
Predictive Analytics using Storm, Hadoop, R and
AWS
http://www.youtube.com/watch?v=6Sl3eBmDheE
Loggly Session
● Provisioned IOPS solve all issues :)
● ELB do not perform with extremely high volume
of requests.
● DNS round robin is a very good basic load
balancing solution
● Cassandra works very well for application data.
● Cassandra does not work well as a queue system,
hard to track order of events.
● Keep the architecture simple.
Large Scale Load
Testing on AWS
Many types of load
● Load testing
○ (running a marathon), predict future load and
plan in advance
● Stress testing
○ Break things (figure out limits), mitigation
plans
● Resilience test
○ Figure out how many parts of the architecture
you can lose and still operate
● Performance test
○ How is latency and throughput changing when
the load increase
Phase roll out and measure
● Load Testing is necessary but not sufficient.
○ Deploy to alpha cluster.
○ The release cycle is important, phased
deployment, one box, monitor and ramp up.
○ Monitor performance and behaviour, look at
99% of the traffic, not at the average.
● Netflix record 1.2 billion metrics per day
○ 5 minutes SLA
Gameday
Gameday
We took part to the AWS Gameday
http://www.awsgameday.com/whatisgameday.html
Inspired by the 2012 Obama For America DevOps
and Amazon.com ops teams
● Build an Autoscaling application
● Exchange administrative IAM credentials with
other team
● Break your opponent's systems
● Restore your system
● Lessons learned
Who is interested if we wanted to run this?
It needs a full day, ~ 6 hours.
Weekday?
Weekend?
Twitter: @petemounce

More Related Content

What's hot

Swagger Code Generation
Swagger Code GenerationSwagger Code Generation
Swagger Code GenerationPlain Concepts
 
Git ops: Git based application deployment patterns for Kubernetes
Git ops: Git based application deployment patterns for KubernetesGit ops: Git based application deployment patterns for Kubernetes
Git ops: Git based application deployment patterns for KubernetesShahidh K Muhammed
 
Evolution of GitLab Frontend
Evolution of GitLab FrontendEvolution of GitLab Frontend
Evolution of GitLab FrontendFatih Acet
 
Building a µservice with Kotlin, Micronaut & GCP
Building a µservice with Kotlin, Micronaut & GCPBuilding a µservice with Kotlin, Micronaut & GCP
Building a µservice with Kotlin, Micronaut & GCPRobert MacLean
 
Serverless with Google Cloud Functions
Serverless with Google Cloud FunctionsServerless with Google Cloud Functions
Serverless with Google Cloud FunctionsJerry Jalava
 
Monitoring in the cloud with Puppet
Monitoring in the cloud with PuppetMonitoring in the cloud with Puppet
Monitoring in the cloud with PuppetKris Buytaert
 
A GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CD
A GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CDA GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CD
A GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CDJulian Mazzitelli
 
Lightning talk how to edit the Silverstripe CMS docs
Lightning talk how to edit the Silverstripe CMS docsLightning talk how to edit the Silverstripe CMS docs
Lightning talk how to edit the Silverstripe CMS docsMichaelPritchard21
 
Gitops: the kubernetes way
Gitops: the kubernetes wayGitops: the kubernetes way
Gitops: the kubernetes waysparkfabrik
 
KegKong: Automated Keg System, presented at the Flatiron School
KegKong: Automated Keg System, presented at the Flatiron SchoolKegKong: Automated Keg System, presented at the Flatiron School
KegKong: Automated Keg System, presented at the Flatiron SchoolJordan Trevino
 
Is a ORCHESTRATION a new milestone?
Is a ORCHESTRATION  a new milestone?Is a ORCHESTRATION  a new milestone?
Is a ORCHESTRATION a new milestone?Piotr Perzyna
 
Cloud Study Jam Exploring the Cloud
Cloud Study Jam Exploring the CloudCloud Study Jam Exploring the Cloud
Cloud Study Jam Exploring the CloudProttay Karim
 
Devops Porto - CI/CD at Gitlab
Devops Porto - CI/CD at GitlabDevops Porto - CI/CD at Gitlab
Devops Porto - CI/CD at GitlabFilipa Lacerda
 
Jump into React-Native (Class 5)
Jump into React-Native  (Class 5)Jump into React-Native  (Class 5)
Jump into React-Native (Class 5)Waqqas Jabbar
 

What's hot (20)

Lambda architecture
Lambda architectureLambda architecture
Lambda architecture
 
Swagger Code Generation
Swagger Code GenerationSwagger Code Generation
Swagger Code Generation
 
Git ops: Git based application deployment patterns for Kubernetes
Git ops: Git based application deployment patterns for KubernetesGit ops: Git based application deployment patterns for Kubernetes
Git ops: Git based application deployment patterns for Kubernetes
 
Evolution of GitLab Frontend
Evolution of GitLab FrontendEvolution of GitLab Frontend
Evolution of GitLab Frontend
 
Building a µservice with Kotlin, Micronaut & GCP
Building a µservice with Kotlin, Micronaut & GCPBuilding a µservice with Kotlin, Micronaut & GCP
Building a µservice with Kotlin, Micronaut & GCP
 
Serverless with Google Cloud Functions
Serverless with Google Cloud FunctionsServerless with Google Cloud Functions
Serverless with Google Cloud Functions
 
Monitoring in the cloud with Puppet
Monitoring in the cloud with PuppetMonitoring in the cloud with Puppet
Monitoring in the cloud with Puppet
 
A GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CD
A GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CDA GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CD
A GitOps Kubernetes Native CICD Solution with Argo Events, Workflows, and CD
 
Lightning talk how to edit the Silverstripe CMS docs
Lightning talk how to edit the Silverstripe CMS docsLightning talk how to edit the Silverstripe CMS docs
Lightning talk how to edit the Silverstripe CMS docs
 
Git
GitGit
Git
 
Argocd up and running
Argocd up and runningArgocd up and running
Argocd up and running
 
Gitops: the kubernetes way
Gitops: the kubernetes wayGitops: the kubernetes way
Gitops: the kubernetes way
 
KegKong: Automated Keg System, presented at the Flatiron School
KegKong: Automated Keg System, presented at the Flatiron SchoolKegKong: Automated Keg System, presented at the Flatiron School
KegKong: Automated Keg System, presented at the Flatiron School
 
GitOps with Gitkube
GitOps with GitkubeGitOps with Gitkube
GitOps with Gitkube
 
Is a ORCHESTRATION a new milestone?
Is a ORCHESTRATION  a new milestone?Is a ORCHESTRATION  a new milestone?
Is a ORCHESTRATION a new milestone?
 
Cloud Study Jam - 2019
Cloud Study Jam - 2019Cloud Study Jam - 2019
Cloud Study Jam - 2019
 
Cloud Study Jam Exploring the Cloud
Cloud Study Jam Exploring the CloudCloud Study Jam Exploring the Cloud
Cloud Study Jam Exploring the Cloud
 
Devops Porto - CI/CD at Gitlab
Devops Porto - CI/CD at GitlabDevops Porto - CI/CD at Gitlab
Devops Porto - CI/CD at Gitlab
 
Meetup 23 - 03 - Application Delivery on K8S with GitOps
Meetup 23 - 03 - Application Delivery on K8S with GitOpsMeetup 23 - 03 - Application Delivery on K8S with GitOps
Meetup 23 - 03 - Application Delivery on K8S with GitOps
 
Jump into React-Native (Class 5)
Jump into React-Native  (Class 5)Jump into React-Native  (Class 5)
Jump into React-Native (Class 5)
 

Viewers also liked

How Chinese Government Fabricates Social Media Posts for Strategic Distractio...
How Chinese Government Fabricates Social Media Posts for Strategic Distractio...How Chinese Government Fabricates Social Media Posts for Strategic Distractio...
How Chinese Government Fabricates Social Media Posts for Strategic Distractio...Hasbulnallah Mohamed Razali
 
The Art of the Start... with a little help from management theory Jan Rosier
The Art of the Start... with a little help from management theory Jan Rosier The Art of the Start... with a little help from management theory Jan Rosier
The Art of the Start... with a little help from management theory Jan Rosier University College Dublin
 
Fy13 q2 plan by jinmei
Fy13 q2 plan by jinmeiFy13 q2 plan by jinmei
Fy13 q2 plan by jinmeiSammi Yu
 
profile mediavision
profile mediavisionprofile mediavision
profile mediavisionEhab Hamid
 
Fy13 q2 plan by jinmei
Fy13 q2 plan by jinmeiFy13 q2 plan by jinmei
Fy13 q2 plan by jinmeiSammi Yu
 
Free Trade Agreements Colombia
Free Trade Agreements ColombiaFree Trade Agreements Colombia
Free Trade Agreements ColombiaJHON JAIRO VERGARA
 
JUST EAT: Tools we use to enable our culture
JUST EAT: Tools we use to enable our cultureJUST EAT: Tools we use to enable our culture
JUST EAT: Tools we use to enable our culturePeter Mounce
 
MonkeyDough Webinar Sept 11, 2013
MonkeyDough Webinar Sept 11, 2013MonkeyDough Webinar Sept 11, 2013
MonkeyDough Webinar Sept 11, 2013WedoAuctions
 
JUST EAT: Embracing DevOps
JUST EAT: Embracing DevOpsJUST EAT: Embracing DevOps
JUST EAT: Embracing DevOpsPeter Mounce
 

Viewers also liked (14)

How Chinese Government Fabricates Social Media Posts for Strategic Distractio...
How Chinese Government Fabricates Social Media Posts for Strategic Distractio...How Chinese Government Fabricates Social Media Posts for Strategic Distractio...
How Chinese Government Fabricates Social Media Posts for Strategic Distractio...
 
The Art of the Start... with a little help from management theory Jan Rosier
The Art of the Start... with a little help from management theory Jan Rosier The Art of the Start... with a little help from management theory Jan Rosier
The Art of the Start... with a little help from management theory Jan Rosier
 
Dịch vụ thành lập công ty
Dịch vụ thành lập công tyDịch vụ thành lập công ty
Dịch vụ thành lập công ty
 
Fy13 q2 plan by jinmei
Fy13 q2 plan by jinmeiFy13 q2 plan by jinmei
Fy13 q2 plan by jinmei
 
profile mediavision
profile mediavisionprofile mediavision
profile mediavision
 
Fy13 q2 plan by jinmei
Fy13 q2 plan by jinmeiFy13 q2 plan by jinmei
Fy13 q2 plan by jinmei
 
Summer is over
Summer is overSummer is over
Summer is over
 
Free Trade Agreements Colombia
Free Trade Agreements ColombiaFree Trade Agreements Colombia
Free Trade Agreements Colombia
 
JUST EAT: Tools we use to enable our culture
JUST EAT: Tools we use to enable our cultureJUST EAT: Tools we use to enable our culture
JUST EAT: Tools we use to enable our culture
 
MonkeyDough Webinar Sept 11, 2013
MonkeyDough Webinar Sept 11, 2013MonkeyDough Webinar Sept 11, 2013
MonkeyDough Webinar Sept 11, 2013
 
JUST EAT: Embracing DevOps
JUST EAT: Embracing DevOpsJUST EAT: Embracing DevOps
JUST EAT: Embracing DevOps
 
YAHEEYA
YAHEEYAYAHEEYA
YAHEEYA
 
Outliers, the story of success
Outliers, the story of successOutliers, the story of success
Outliers, the story of success
 
Quasi Turbine
Quasi TurbineQuasi Turbine
Quasi Turbine
 

Similar to Aws uk ug #8 not everything that happens in vegas stay in vegas

Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriDemi Ben-Ari
 
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)Martin Spier
 
There is something about serverless
There is something about serverlessThere is something about serverless
There is something about serverlessgjdevos
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Demi Ben-Ari
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion
 
Serverless? How (not) to develop, deploy and operate serverless applications.
Serverless? How (not) to develop, deploy and operate serverless applications.Serverless? How (not) to develop, deploy and operate serverless applications.
Serverless? How (not) to develop, deploy and operate serverless applications.gjdevos
 
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...StormForge .io
 
DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)
DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)
DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)John Schneider
 
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Codemotion
 
Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...Demi Ben-Ari
 
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...Kaxil Naik
 
Automated Testing Environments With Kubernetes & GitLab
Automated Testing Environments With Kubernetes & GitLabAutomated Testing Environments With Kubernetes & GitLab
Automated Testing Environments With Kubernetes & GitLabVladislav Supalov
 
OSMC 2012 | Shinken by Jean Gabès
OSMC 2012 | Shinken by Jean GabèsOSMC 2012 | Shinken by Jean Gabès
OSMC 2012 | Shinken by Jean GabèsNETWAYS
 
DevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and ProjectsDevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and ProjectsFedir RYKHTIK
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixC4Media
 
Triangle Devops Meetup 10/2015
Triangle Devops Meetup 10/2015Triangle Devops Meetup 10/2015
Triangle Devops Meetup 10/2015aspyker
 
DevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
DevOpsDays Taipei 2019 - Mastering IaC the DevOps WayDevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
DevOpsDays Taipei 2019 - Mastering IaC the DevOps Waysmalltown
 
AWS Techniques and lessons writing low cost autoscaling GitLab runners
AWS Techniques and lessons writing low cost autoscaling GitLab runnersAWS Techniques and lessons writing low cost autoscaling GitLab runners
AWS Techniques and lessons writing low cost autoscaling GitLab runnersAnthony Scata
 

Similar to Aws uk ug #8 not everything that happens in vegas stay in vegas (20)

Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
 
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
 
There is something about serverless
There is something about serverlessThere is something about serverless
There is something about serverless
 
TestIstanbul 2015
TestIstanbul 2015TestIstanbul 2015
TestIstanbul 2015
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
 
Serverless? How (not) to develop, deploy and operate serverless applications.
Serverless? How (not) to develop, deploy and operate serverless applications.Serverless? How (not) to develop, deploy and operate serverless applications.
Serverless? How (not) to develop, deploy and operate serverless applications.
 
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
 
DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)
DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)
DevOps at Obama for America(2012) and the DNC (DevOps Days NYC Jan 2013)
 
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
 
Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems "Done the simple way" - Demi Ben-Ari - Codemotion...
 
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
 
Automated Testing Environments With Kubernetes & GitLab
Automated Testing Environments With Kubernetes & GitLabAutomated Testing Environments With Kubernetes & GitLab
Automated Testing Environments With Kubernetes & GitLab
 
OSMC 2012 | Shinken by Jean Gabès
OSMC 2012 | Shinken by Jean GabèsOSMC 2012 | Shinken by Jean Gabès
OSMC 2012 | Shinken by Jean Gabès
 
DevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and ProjectsDevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and Projects
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFix
 
Triangle Devops Meetup 10/2015
Triangle Devops Meetup 10/2015Triangle Devops Meetup 10/2015
Triangle Devops Meetup 10/2015
 
DevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
DevOpsDays Taipei 2019 - Mastering IaC the DevOps WayDevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
DevOpsDays Taipei 2019 - Mastering IaC the DevOps Way
 
AWS Techniques and lessons writing low cost autoscaling GitLab runners
AWS Techniques and lessons writing low cost autoscaling GitLab runnersAWS Techniques and lessons writing low cost autoscaling GitLab runners
AWS Techniques and lessons writing low cost autoscaling GitLab runners
 

Aws uk ug #8 not everything that happens in vegas stay in vegas

  • 1. Not everything that happens in Vegas stays in Vegas
  • 2. DevOps or “getting devs to be on call for what they ship” :-)
  • 3. Netflix development Priorities 1. Speed of innovation 2. Availability 3. Running costs a. “It’ll cost what it ends up costing” In practise, they found that holding to the first two ended up costing way less than otherwise expected.
  • 4. Riot Games + League of Legends Cloud == ideal for MMOs. Solve launch issues. ● chef gets used a lot here. ○ talked about their evolution with it, lessons learned ● What sucked? ○ 25 minute bootstrap runs ○ External dependencies (including S3) ○ Duplicating application deployment recipes ● golden masters and immutable servers simplify your life drastically. ● “if you’re doing chef without BerkShelf you’re doing it wrong” ● Make it easy to throw up new things
  • 5. Testing in production Netflix, Riot, Kickstarter - they all do this. At scale. Netflix ● 10s to 100s of code pushes per day ● 1000s to 100,000s of config changes per day ○ they tune their A/B testing constantly Of course, they also have the instrumentation to react to this.
  • 6. How’re other people doing DevOps? Good news - we’re at the “more sophisticated” end of the spectrum. Every “cloud native” was doing this. Things other people did better: ● “Golden master” AMIs ● Immutable instances ● Absolute ownership of vertical slices ● Config-managment (chef/puppet) featured prominently ● Extensive monitoring+logs+visibility == “table stakes” ○ for developers! ● Easy to throw up new things ● Run many small, simple, collaborating things Who? Riot Games, Netflix, change.org, Kickstarter
  • 8. Logging aggregation is important Lots of 3rd party companies are offering centralized logging services, there's a huge appetite for logging and monitoring. ● http://logentries.com/ ● http://www.loggly.com/ ● http://papertrailapp.com/ ● https://www.splunkstorm.com/tour ● http://www.datadoghq.com/ ● DIY - Lumberjacking slides
  • 9. DEMO: Monitoring & Logging https://app.datadoghq.com/infrastructure ● Tag Metrics, awesome Metric discoverability ● Cloud Watch integration ○ I never knew I could see ELB metrics :-) ● Alarms are integrated ● You can template Dashboards https://papertrailapp.com/ ● Can Search, Save Searches, Alerts on searches ● No alert on patterns ● Archive to S3 / Push to Redshift Logging aggregation is FOR DEVELOPERS!!! Saves lots of time when you’re on call.
  • 10. Loggly Session Benefit of logging as a service. ● When your infrastructure is in trouble, you do not want to have your logging analytic system on the same infrastructure. AWS Services that loggly could use: ● Kafka + Storm vs Kinesis ● Elastic Search vs Cloud Search Predictive Analytics using Storm, Hadoop, R and AWS http://www.youtube.com/watch?v=6Sl3eBmDheE
  • 11. Loggly Session ● Provisioned IOPS solve all issues :) ● ELB do not perform with extremely high volume of requests. ● DNS round robin is a very good basic load balancing solution ● Cassandra works very well for application data. ● Cassandra does not work well as a queue system, hard to track order of events. ● Keep the architecture simple.
  • 13. Many types of load ● Load testing ○ (running a marathon), predict future load and plan in advance ● Stress testing ○ Break things (figure out limits), mitigation plans ● Resilience test ○ Figure out how many parts of the architecture you can lose and still operate ● Performance test ○ How is latency and throughput changing when the load increase
  • 14. Phase roll out and measure ● Load Testing is necessary but not sufficient. ○ Deploy to alpha cluster. ○ The release cycle is important, phased deployment, one box, monitor and ramp up. ○ Monitor performance and behaviour, look at 99% of the traffic, not at the average. ● Netflix record 1.2 billion metrics per day ○ 5 minutes SLA
  • 16. Gameday We took part to the AWS Gameday http://www.awsgameday.com/whatisgameday.html Inspired by the 2012 Obama For America DevOps and Amazon.com ops teams ● Build an Autoscaling application ● Exchange administrative IAM credentials with other team ● Break your opponent's systems ● Restore your system ● Lessons learned
  • 17. Who is interested if we wanted to run this? It needs a full day, ~ 6 hours. Weekday? Weekend?