SlideShare uma empresa Scribd logo
1 de 30
SRE vs DevOps
Feel the difference
1
Levon Avakyan / Competetive
Gaming /
l_avakyan@wargaming.net
Content 2
• Definitions – to be one page
• SRE vs DevOps – little bit of phylosophy
• Approach – how to do well
• Cases – how we are doing in Competitive
Gaming
What I will speak about
3
Definitions
To be on one page
Reliability 4
Little bit of the theory
Reliability is theoretically defined as the probability of
success (𝑹𝒆𝒍𝒊𝒂𝒃𝒊𝒍𝒊𝒕𝒚 = 𝟏 − 𝐏𝐫𝐨𝐛𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐨𝐟 𝐅𝐚𝐢𝐥𝐮𝐫𝐞), as the
frequency of failures; or in terms of availability, as a
probability derived from reliability, testability and
maintainability. Reliability plays a key role in the cost-
effectiveness of systems.
Reliability Engineering 5
Little bit of the theory
• Reliability engineering is engineering that
emphasizes dependability in the lifecycle
management of a product.
• Reliability engineering deals with the estimation,
prevention and management of high levels of
"lifetime" engineering uncertainty and risks of
failure.
Software Reliability 6
Little bit of the theory
• Software Reliability (SR) depends on good
requirements, design and implementation. Software
reliability engineering relies heavily on a disciplined
software engineering process to anticipate and
design against unintended consequences.
Site reliability engineering 7
Little bit of the theory
Site reliability engineering (SRE) is a discipline that
incorporates aspects of software engineering and
applies that to operations whose goals are to create
ultra-scalable and highly-reliable software systems.
SRE might be considered a subset of Devops that
possesses additional skill sets.
Development Operations 8
Little bit of the theory
DevOps is a term used to refer to a set of practices that
emphasize the collaboration and communication of
both software developers and information technology
(IT) professionals while automating the process of
software delivery and infrastructure changes. It aims at
establishing a culture and environment where building,
testing, and releasing software can happen rapidly,
frequently, and more reliably
9
SRE VS DevOps
Little bit of philosofy
10
Site Reliability Engineering
• Main focus on to creation ultra-
scalable and highly reliable
software systems.
• It is a one of engineering
specializations
• Fully embedded in the lifecycle of
product
Development Operations
• Main focus on automated
deployment process on
production and staging
environments
• It is a role
• Mostly working with environments
SRE (SR) vs DevOps
Comprasion
SRE (SR) vs DevOps 11
Conclusion
• SRE (SR) is a broader concept than DevOps
• We cannot put versus between SRE (SR) and Devops
because they achieves the similar goals, but with
different approaches
12
Approach
How to do well
Product lifecycle 13
Paste one content item here. Field is obligatory to complete.
Pre-production 14
Main purpose:
• Create specification for Development
• Clarify with business all details
Main artefacts are requirements and high level design (HLD) of new
feature/product
SRE Role:
• Review and clarify HLD
• Adding specifically requirements to improve reliability and
reduce impact to players in case of failures
Development 15
Main purpose:
• To develop the application
• To test the application
Main artefacts are release tag, SDD, test suites,
regulations/automation for release
SRE Role:
• Review and clarify SDD
• Monitoring design
• Load and performance test (tooling, environments)
• Stress tests
• Release preparations (tooling, massive migrations, release time
estimation)
Release 16
Main purpose:
• Check that application is ready to go production
• To deliver application to production environment
Main artefacts are released application and release postmortem
SRE Role:
• Review regulations
• Automatize process with standard tools
Post-Release 17
Main purpose:
• Monitoring
• Maintains
• Mitigating risks and decrease impact for user in case of outgages
Main artefacts are bugs and improvments for dev team and data for
product management team to analyze it
SRE Role:
• L2+-L3 maintains
• Data collection tools
Conclusion 18
• SRE is embedded in all life cycle of life
product
• Main aim of SRE it is increase reliability
• The scope of the responsibilities is very
variable and depends on company layout
19
Сases
How we are doing in Competitive
Gaming
Cases 20
• World of Tanks football tournament
• Companies on WoT Global Map
World of Tanks Football Tournament 21
Features:
• Cross project product
• Great importance for players and company
• New battle type
Architecture 22
Wotld of Tanks Football Tournament
Risks 23
World of Tanks Football Tournament
• High load
• A very long route for battle - a lot of points of outage
• First big load for Team Management System
• A lot of separated teams are working on event
What we have done 24
World of Tanks Football Tournament
• Did end to end load and performance test of system
• Got the prediction of players count from publisher
• Based on numbers create recommendation for the
schedule
• Added safe day in schedule
• Created tooling to move groups, steps, battels of
tournament to the other date
• Isolated battle processing and API
• Created auto scale configuration for workers
Global Map 25
Global Map
Features:
• Potentially increasing battle counts to proccess
• Have no chance to fault because it will influence to
the results of 3-week event
Architecture 26
Global Map
Risks 27
Global Map
• High load
• New gameplay features
• New vector tiles engines
• No chances to move battles
What we have done 28
Global Map
• Massive load test of new tiles vector engine
• Additional monitoring that based on game logic
• Added requirements to have opportunity to scale
most of workers
Conclusion 29
• SRE (SR) is a broader concept than DevOps
• We cannot put versus between SRE (SR) and Devops
because they achieves the similar goals, but with
different approaches
• SRE is embedded in all life cycle of life product
• Main aim of SRE it is increase reliability
• The scope of the responsibilities is very variable and
depends on company layout
Thank you
30
Levon Avakyan/Competitive Gaming Reliability Team Lead/l_avakyan@wargaming.net

Mais conteúdo relacionado

Mais procurados

A Crash Course in Building Site Reliability
A Crash Course in Building Site ReliabilityA Crash Course in Building Site Reliability
A Crash Course in Building Site Reliability
Acquia
 

Mais procurados (20)

Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
 
SRE in Startup
SRE in StartupSRE in Startup
SRE in Startup
 
A Crash Course in Building Site Reliability
A Crash Course in Building Site ReliabilityA Crash Course in Building Site Reliability
A Crash Course in Building Site Reliability
 
Site Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
Site Reliability Engineering (SRE) - Tech Talk by Keet SugathadasaSite Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
Site Reliability Engineering (SRE) - Tech Talk by Keet Sugathadasa
 
SRE-iously! Reliability!
SRE-iously! Reliability!SRE-iously! Reliability!
SRE-iously! Reliability!
 
Site reliability engineering - Lightning Talk
Site reliability engineering - Lightning TalkSite reliability engineering - Lightning Talk
Site reliability engineering - Lightning Talk
 
DevOps & SRE at Google Scale
DevOps & SRE at Google ScaleDevOps & SRE at Google Scale
DevOps & SRE at Google Scale
 
Sre summary
Sre summarySre summary
Sre summary
 
What is Site Reliability Engineering (SRE)
What is Site Reliability Engineering (SRE)What is Site Reliability Engineering (SRE)
What is Site Reliability Engineering (SRE)
 
Site (Service) Reliability Engineering
Site (Service) Reliability EngineeringSite (Service) Reliability Engineering
Site (Service) Reliability Engineering
 
SRE-iously: Defining the Principles, Habits, and Practices of Site Reliabilit...
SRE-iously: Defining the Principles, Habits, and Practices of Site Reliabilit...SRE-iously: Defining the Principles, Habits, and Practices of Site Reliabilit...
SRE-iously: Defining the Principles, Habits, and Practices of Site Reliabilit...
 
DevOps Vs SRE Major Differences That You Need To Know - Hidden Brains Infotech
DevOps Vs SRE Major Differences That You Need To Know - Hidden Brains InfotechDevOps Vs SRE Major Differences That You Need To Know - Hidden Brains Infotech
DevOps Vs SRE Major Differences That You Need To Know - Hidden Brains Infotech
 
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
Implementing SRE practices: SLI/SLO deep dive - David Blank-Edelman - DevOpsD...
 
Site reliability engineering
Site reliability engineeringSite reliability engineering
Site reliability engineering
 
Kks sre book_ch1,2
Kks sre book_ch1,2Kks sre book_ch1,2
Kks sre book_ch1,2
 
How to SRE when you have no SRE
How to SRE when you have no SREHow to SRE when you have no SRE
How to SRE when you have no SRE
 
Reconstructing the SRE
Reconstructing the SREReconstructing the SRE
Reconstructing the SRE
 
DevOps Torino Meetup - SRE Concepts
DevOps Torino Meetup - SRE ConceptsDevOps Torino Meetup - SRE Concepts
DevOps Torino Meetup - SRE Concepts
 
SRE From Scratch
SRE From ScratchSRE From Scratch
SRE From Scratch
 
Rapid Strategic SRE Assessments
Rapid Strategic SRE AssessmentsRapid Strategic SRE Assessments
Rapid Strategic SRE Assessments
 

Semelhante a SRE vs DevOps

boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifijboughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
akd3143
 
Introduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptxIntroduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptx
aasssss1
 
Software Engineering The Multiview Approach And Wisdm
Software Engineering   The Multiview Approach And WisdmSoftware Engineering   The Multiview Approach And Wisdm
Software Engineering The Multiview Approach And Wisdm
guestc990b6
 
Site-Reliability-Engineering-v2[6241].pdf
Site-Reliability-Engineering-v2[6241].pdfSite-Reliability-Engineering-v2[6241].pdf
Site-Reliability-Engineering-v2[6241].pdf
DeepakGupta747774
 

Semelhante a SRE vs DevOps (20)

From the sandbox to production
From the sandbox to productionFrom the sandbox to production
From the sandbox to production
 
From Sandbox to Production by Vadym Fedorov
From Sandbox to Production by Vadym FedorovFrom Sandbox to Production by Vadym Fedorov
From Sandbox to Production by Vadym Fedorov
 
Devops
DevopsDevops
Devops
 
Software engineering introduction
Software engineering   introductionSoftware engineering   introduction
Software engineering introduction
 
Prepare the sled in summer and project release at its beginning
Prepare the sled in summer and project release at its beginningPrepare the sled in summer and project release at its beginning
Prepare the sled in summer and project release at its beginning
 
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdfADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
 
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifijboughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
 
1 introduction
1 introduction1 introduction
1 introduction
 
1 introduction (1)
1 introduction (1)1 introduction (1)
1 introduction (1)
 
Challenges Scaling DevOps
Challenges Scaling DevOpsChallenges Scaling DevOps
Challenges Scaling DevOps
 
DevOps CD and Multispeed IT in regulated industries (FUG Presentation)
DevOps CD and Multispeed IT in regulated industries (FUG Presentation)DevOps CD and Multispeed IT in regulated industries (FUG Presentation)
DevOps CD and Multispeed IT in regulated industries (FUG Presentation)
 
Introduction to DevSecOps. An intuitiv approach
Introduction to DevSecOps. An intuitiv approachIntroduction to DevSecOps. An intuitiv approach
Introduction to DevSecOps. An intuitiv approach
 
Unlocking Faster Product Development Cycles
Unlocking Faster Product Development CyclesUnlocking Faster Product Development Cycles
Unlocking Faster Product Development Cycles
 
Introduction to devops
Introduction to devopsIntroduction to devops
Introduction to devops
 
Introduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptxIntroduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptx
 
Software Engineering The Multiview Approach And Wisdm
Software Engineering   The Multiview Approach And WisdmSoftware Engineering   The Multiview Approach And Wisdm
Software Engineering The Multiview Approach And Wisdm
 
Measuring Performance: See the Science of DevOps Measurement in Action
Measuring Performance: See the Science of DevOps Measurement in ActionMeasuring Performance: See the Science of DevOps Measurement in Action
Measuring Performance: See the Science of DevOps Measurement in Action
 
Modern software architect post the agile wave
Modern software architect post the agile waveModern software architect post the agile wave
Modern software architect post the agile wave
 
Measuring Performance: See the Science of DevOps Measurement in Action
Measuring Performance: See the Science of DevOps Measurement in ActionMeasuring Performance: See the Science of DevOps Measurement in Action
Measuring Performance: See the Science of DevOps Measurement in Action
 
Site-Reliability-Engineering-v2[6241].pdf
Site-Reliability-Engineering-v2[6241].pdfSite-Reliability-Engineering-v2[6241].pdf
Site-Reliability-Engineering-v2[6241].pdf
 

Mais de Levon Avakyan

Оперирование высоко нагруженными проектами. Или "Клановые войны" каждый день
Оперирование высоко нагруженными проектами. Или "Клановые войны" каждый деньОперирование высоко нагруженными проектами. Или "Клановые войны" каждый день
Оперирование высоко нагруженными проектами. Или "Клановые войны" каждый день
Levon Avakyan
 

Mais de Levon Avakyan (10)

Надежность World of Tanks Server
Надежность World of Tanks ServerНадежность World of Tanks Server
Надежность World of Tanks Server
 
Grpahite&grafana
Grpahite&grafanaGrpahite&grafana
Grpahite&grafana
 
Реляционные базы данных
Реляционные базы данныхРеляционные базы данных
Реляционные базы данных
 
Программирование как способ выражения мыслей.
Программирование как способ выражения мыслей. Программирование как способ выражения мыслей.
Программирование как способ выражения мыслей.
 
World of Tanks Experience of Using Kafka
World of Tanks Experience of Using KafkaWorld of Tanks Experience of Using Kafka
World of Tanks Experience of Using Kafka
 
Wargaming Clan Platform
Wargaming Clan PlatformWargaming Clan Platform
Wargaming Clan Platform
 
Архитектура мета игры Wargaming. Глобальная карта 2.0.
Архитектура мета игры Wargaming. Глобальная карта 2.0.Архитектура мета игры Wargaming. Глобальная карта 2.0.
Архитектура мета игры Wargaming. Глобальная карта 2.0.
 
Осознанный выбор. Python 3 для реализации сервисного шлюза клиента World of T...
Осознанный выбор. Python 3 для реализации сервисного шлюза клиента World of T...Осознанный выбор. Python 3 для реализации сервисного шлюза клиента World of T...
Осознанный выбор. Python 3 для реализации сервисного шлюза клиента World of T...
 
Кланы в Wargaming. От странички на танковом портале до мультиплатфермнного с...
Кланы в Wargaming. От странички на танковом портале до  мультиплатфермнного с...Кланы в Wargaming. От странички на танковом портале до  мультиплатфермнного с...
Кланы в Wargaming. От странички на танковом портале до мультиплатфермнного с...
 
Оперирование высоко нагруженными проектами. Или "Клановые войны" каждый день
Оперирование высоко нагруженными проектами. Или "Клановые войны" каждый деньОперирование высоко нагруженными проектами. Или "Клановые войны" каждый день
Оперирование высоко нагруженными проектами. Или "Клановые войны" каждый день
 

Último

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 

Último (20)

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 

SRE vs DevOps

  • 1. SRE vs DevOps Feel the difference 1 Levon Avakyan / Competetive Gaming / l_avakyan@wargaming.net
  • 2. Content 2 • Definitions – to be one page • SRE vs DevOps – little bit of phylosophy • Approach – how to do well • Cases – how we are doing in Competitive Gaming What I will speak about
  • 4. Reliability 4 Little bit of the theory Reliability is theoretically defined as the probability of success (𝑹𝒆𝒍𝒊𝒂𝒃𝒊𝒍𝒊𝒕𝒚 = 𝟏 − 𝐏𝐫𝐨𝐛𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐨𝐟 𝐅𝐚𝐢𝐥𝐮𝐫𝐞), as the frequency of failures; or in terms of availability, as a probability derived from reliability, testability and maintainability. Reliability plays a key role in the cost- effectiveness of systems.
  • 5. Reliability Engineering 5 Little bit of the theory • Reliability engineering is engineering that emphasizes dependability in the lifecycle management of a product. • Reliability engineering deals with the estimation, prevention and management of high levels of "lifetime" engineering uncertainty and risks of failure.
  • 6. Software Reliability 6 Little bit of the theory • Software Reliability (SR) depends on good requirements, design and implementation. Software reliability engineering relies heavily on a disciplined software engineering process to anticipate and design against unintended consequences.
  • 7. Site reliability engineering 7 Little bit of the theory Site reliability engineering (SRE) is a discipline that incorporates aspects of software engineering and applies that to operations whose goals are to create ultra-scalable and highly-reliable software systems. SRE might be considered a subset of Devops that possesses additional skill sets.
  • 8. Development Operations 8 Little bit of the theory DevOps is a term used to refer to a set of practices that emphasize the collaboration and communication of both software developers and information technology (IT) professionals while automating the process of software delivery and infrastructure changes. It aims at establishing a culture and environment where building, testing, and releasing software can happen rapidly, frequently, and more reliably
  • 9. 9 SRE VS DevOps Little bit of philosofy
  • 10. 10 Site Reliability Engineering • Main focus on to creation ultra- scalable and highly reliable software systems. • It is a one of engineering specializations • Fully embedded in the lifecycle of product Development Operations • Main focus on automated deployment process on production and staging environments • It is a role • Mostly working with environments SRE (SR) vs DevOps Comprasion
  • 11. SRE (SR) vs DevOps 11 Conclusion • SRE (SR) is a broader concept than DevOps • We cannot put versus between SRE (SR) and Devops because they achieves the similar goals, but with different approaches
  • 13. Product lifecycle 13 Paste one content item here. Field is obligatory to complete.
  • 14. Pre-production 14 Main purpose: • Create specification for Development • Clarify with business all details Main artefacts are requirements and high level design (HLD) of new feature/product SRE Role: • Review and clarify HLD • Adding specifically requirements to improve reliability and reduce impact to players in case of failures
  • 15. Development 15 Main purpose: • To develop the application • To test the application Main artefacts are release tag, SDD, test suites, regulations/automation for release SRE Role: • Review and clarify SDD • Monitoring design • Load and performance test (tooling, environments) • Stress tests • Release preparations (tooling, massive migrations, release time estimation)
  • 16. Release 16 Main purpose: • Check that application is ready to go production • To deliver application to production environment Main artefacts are released application and release postmortem SRE Role: • Review regulations • Automatize process with standard tools
  • 17. Post-Release 17 Main purpose: • Monitoring • Maintains • Mitigating risks and decrease impact for user in case of outgages Main artefacts are bugs and improvments for dev team and data for product management team to analyze it SRE Role: • L2+-L3 maintains • Data collection tools
  • 18. Conclusion 18 • SRE is embedded in all life cycle of life product • Main aim of SRE it is increase reliability • The scope of the responsibilities is very variable and depends on company layout
  • 19. 19 Сases How we are doing in Competitive Gaming
  • 20. Cases 20 • World of Tanks football tournament • Companies on WoT Global Map
  • 21. World of Tanks Football Tournament 21 Features: • Cross project product • Great importance for players and company • New battle type
  • 22. Architecture 22 Wotld of Tanks Football Tournament
  • 23. Risks 23 World of Tanks Football Tournament • High load • A very long route for battle - a lot of points of outage • First big load for Team Management System • A lot of separated teams are working on event
  • 24. What we have done 24 World of Tanks Football Tournament • Did end to end load and performance test of system • Got the prediction of players count from publisher • Based on numbers create recommendation for the schedule • Added safe day in schedule • Created tooling to move groups, steps, battels of tournament to the other date • Isolated battle processing and API • Created auto scale configuration for workers
  • 25. Global Map 25 Global Map Features: • Potentially increasing battle counts to proccess • Have no chance to fault because it will influence to the results of 3-week event
  • 27. Risks 27 Global Map • High load • New gameplay features • New vector tiles engines • No chances to move battles
  • 28. What we have done 28 Global Map • Massive load test of new tiles vector engine • Additional monitoring that based on game logic • Added requirements to have opportunity to scale most of workers
  • 29. Conclusion 29 • SRE (SR) is a broader concept than DevOps • We cannot put versus between SRE (SR) and Devops because they achieves the similar goals, but with different approaches • SRE is embedded in all life cycle of life product • Main aim of SRE it is increase reliability • The scope of the responsibilities is very variable and depends on company layout
  • 30. Thank you 30 Levon Avakyan/Competitive Gaming Reliability Team Lead/l_avakyan@wargaming.net

Notas do Editor

  1. Надежность может теоритический определятся как вероятность успеха, то есть надежность = 1 – вероятность отказа, частотой отказов с другой стороны в терминах доступности как вероятность полученная из надежности, тестируемости и ремонтопригодности. Надежность играет ключевую роль в экономической эффективности систем.
  2. Reliability engineering является разработка, которая подчеркивает надежность в управлении жизненным циклом продукта. Reliability engineering касается оценки, предотвращения и управления высокими уровнями «пожизненной» инженерной неопределенности и рисков отказа.
  3. SR зависит от корректных требований, архитектуры и реализации. SR программного обеспечения в значительной степени зависит от процесса разработки ПО, чтобы предугадывать и проектировать его , чтобы противостоять непредвиденным последствиям.
  4. SRE- это дисциплина, которая включает аспекты разработки программного обеспечения и применяется к операциям, целью которых является создание ультрамасштабируемых и высоконадежных программных систем. SRE можно рассматривать как подмножество Devops, обладающее дополнительными наборами навыков.
  5. DevOps - термин, используемый для обозначения набора практических методов, которые подчеркивают сотрудничество и коммуникацию как разработчиков программного обеспечения, так и специалистов в области информационных технологий (ИТ), в то же время автоматизируя процесс доставки программного обеспечения и изменения инфраструктуры. Он нацелен на создание культуры и среды, где создание, тестирование и выпуск программного обеспечения могут происходить быстро, часто и надежно