Customer Sharing: HTC - What is in AWS Cloud for me?

© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Edwin Chou, Enterprise Application Development, IT, HTC
May, 2016
What is in AWS Cloud for us?

Agenda
• Background
• Why we use AWS Cloud
• What you should know before start-up
• Case sharing
• Summary & next steps

Background
• Started to use AWS Cloud since 2010.
• Focus on managing multiple cloud platform to cooperate with
existing on premise environment.
• Support cloud management, software architect & cost
optimization
• High performance, availability & scalability AP development

Because
We did not have other choice in 2010

Actually
It is still a good choice in 2016

Reason - Time to market
• No lead time
• One spot for everything
• Global capability
• Security

Reason - Time to market - Security

Reason - Save money
• Avoid Resource Overprovision
• Project Risk Management
• Technique Refresh / Price Cut
• Pay by usage
• Computing/ Network/ Storage
• Software/ Service

Reason - Innovation
• Failure is the mother of innovation
• Experiment often
• Fail at low cost
• No need to reinvent the wheel
• Synergy from application & infrastructure
• API-based collaboration with Infrastructure
• Change design thinking
• Elastic Nature / Focus on business

What you should know
before start-up

Financial Part
Cost Visibility UP !
• CAPEX to OPEX
• Payment Process
• Communicate with Cost Center
• No Initial Investment
• Clear usage report
• Clear expectation (SLA)
• How we can help on cost optimization

Management Part
• Central Support Spot
• Volume discount
• Central AWS account creation
• Management Strategy
• Account management (IAM)
• Network & IP management
• Resource Naming & Tagging
• Foundation tool set
• Avoid reinventing everywhere
Payer account
?? ? ? ? ?

Cost saving part
Pay by usage Every usage you pay

Cost Saving Part – Use cloud in right way
• Understand price model & limitation of AWS service.
• Internal traffic across AZ, VPC, Region
• Hybrid cloud is also a good choice
• disaster recovery
• development & testing
• scalability for unpredictable spike
• Software architecture is the key.
• Monolithic to Micro-service / Modularization (Dynamic load)
• Reduce unnecessary usage (IO / Storage / Traffic)

Technical Part – Computing
• Right Size (Vertical Scaling, Right instance type)
• Right Number (Horizontal Scaling, Auto scaling)
• Right Time
• Season (E-commerce promotion)
• Month ( Financial Report )
• Daily ( Working days)
• Hourly ( Working hours)
• Request ( Server less)
• Right Density (Docker, AWS ECS, Free)

System Briefing
• Summary (50M+ Devices)
• Device Software update
• Device Configuration update
• Device management
• Response time < 2 seconds
• System Capacity
• 45 VM (without DR)
• > 150,000,000 requests / day
Corporate data center + CDN

What are the challenges?
• Disaster Recovery Requirement
• Large initial procurement (45 VM  90 VM)
• Heavy implementation cost
• Heavy Operation Cost & Risk
• Many manual routine jobs
• DR Drill Costs a lot
• Resource Overprovision
• Reservation for surge
• Firewalls of responsibility

How we solve them
• Disaster Recovery Requirement
• Large initial procurement
• Heavy implementation cost
• Heavy Operation Cost & Risk
• Many manual routine jobs
• DR Drill Costs a lot
• Resource Overprovision
• Reservation for surge
• Firewalls of responsibility
 Pay by usage
 No reinvent the wheel
 Auto-Scaling
 Docker to isolate env.
 Automation
 Automation

This is why we use cloud
and want to use it well

How we do
Visibility
Control
Automation Tools
SOP
&
Policy
Cloud-Native Design

How we do
• Application & VM monitor
• Logstash / Elasticsearch / Kibana
• AWS Cost Tools
• CloudWatch / Cost Alert
• Billing Console / Cost explorer
• Advanced Cost Dashboard
• TIBCO Spotfire
Visibility

How we do
• AWS CLI/ Docker/ ECS (Application)
• Cloud Formation Template (Environment)
• AWS Lambda (Event Trigger)Control Automation
tools
SOP
&
Policy
• VM Scheduler
• VM & Storage Backup
• Central application log store
• General Tag policy
• General Name policy
• General Log policy

How we do
• Auto-scaling
• Design for failure
• Recover by reboot (Self-Healing)
• Micro-service
• Cost optimization
Cloud-Native Design

New Architecture on AWS Cloud
• Active & Active mode
• No single point of failure
• Auto-scaling support
• RTO < 5 minutes
• RPO < 5 minutes
Active Active

When Disaster happened in main site (1)
• Route53 redirect traffic to
DR site.
• CloudWatch notify NOC
by SNS.
• NOC will decide to trigger
DR process or not.
Inactive ActiveActive

DR site will be promoted
to main site ( < 15 minutes)
Automation

• New DR Site can be
Created within 3 hours
• Service has no impact
• Active & Active mode
works again.
Active Active

Result
• TCO of 45 VMs  1,500 USD/ Months (with DR)
• Disaster Recovery implementation & drill are easier
• Lower operation cost & service risk
• Team growth & have fun

Summary – Lesson learn recap
Visibility
Control
Automation Tools
SOP
&
Policy
Cloud-Native Design

Next Steps
• Start to use your free account
• To be familiar with AWS Service lego bricks
• Innovate your business/product with customers

Customer Sharing: HTC - What is in AWS Cloud for me?

Customer Sharing: HTC - What is in AWS Cloud for me?

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (20)

Semelhante a Customer Sharing: HTC - What is in AWS Cloud for me?

Semelhante a Customer Sharing: HTC - What is in AWS Cloud for me? (20)

Mais de Amazon Web Services

Mais de Amazon Web Services (20)

Último

Último (20)

Customer Sharing: HTC - What is in AWS Cloud for me?