2012_Architects_Guide_Designing_Integrated_Multi-Product_HA_DR_BC_Solutions_v2

Architect’s Guide to Designing Integrated
Multi-Product HA-DR-BC Solutions
John Sing, Executive Strategy, IBM Session E10

1

John Sing • 31 years of experience with IBM in high end servers, storage, and
software
– 2009 - Present: IBM Executive Strategy Consultant: IT Strategy and Planning, Enterprise
Large Scale Storage, Internet Scale Workloads and Data Center Design, Big Data Analytics,
HA/DR/BC
– 2002-2008: IBM IT Data Center Strategy, Large Scale Systems, Business Continuity,
HA/DR/BC, IBM Storage

– 1998-2001: IBM Storage Subsystems Group - Enterprise Storage Server Marketing
Manager, Planner for ESS Copy Services (FlashCopy, PPRC, XRC, Metro Mirror, Global
Mirror)
– 1994-1998: IBM Hong Kong, IBM China Marketing Specialist for High-End Storage
– 1989-1994: IBM USA Systems Center Specialist for High-End S/390 processors
– 1982-1989: IBM USA Marketing Specialist for S/370, S/390 customers (including VSE and
VSE/ESA)

• singj@us.ibm.com

• IBM colleagues may access my webpage:
– http://snjgsa.ibm.com/~singj/

• You may follow my daily IT research blog
– http://www.delicious.com/atsf_arizona

2

Agenda
• Understand today’s challenges and best
practices
– for IT High Availability and IT Business Continuity

• What has changed? What is the same?

• Strategies for:
– Requirements, design, implementation

• Step by step approach
– Essential role of automation
– Accommodating petabyte scale
– Exploiting Cloud

2012 Cloud
deployment
options

3
3

Agenda

1. Solving Today’s HA-DR-BC Challenges
2. Guiding HA-DR-BC Principles to mitigate chaos
3. Traditional Workloads vs. Internet Scale Workloads
4. Master Vision and Best Practices Methodology

4

Recovering today’s real-time massive streaming workflows is challenging

n d

Chart in public domain: IEEE Massive File Storage presentation, author: Bill Kramer, NCSA: http://storageconference.org/2010/Presentations/MSST/1.Kramer.pdf:

5

Today’s Data and Data Recovery Conundrum:

6

Inter-
Many options, including many non-traditional alternatives for Disciplinary
user deployments, workload hosting, and recovery models

Traditional alternatives: • Non-traditional alternatives:
– The Cloud, the Developing World
• Other platforms

• Other vendors

Illustrative Cloud examples only
No endorsement is implied
or expressed

7

Finally, we have this ‘little’ problem regarding Mobile proliferation

Clayton Christensen
Harvard Business School

• From IT standpoint, we are
clearly seeing
“consumerization of IT”

• Key is to recognize and
exploit hyper-pace reality
of BYOD’s associated data

• Not just the technology

• Also the recovery model
(“cloud), the business
model, and the required
ecosystem

http://en.wikipedia.org/wiki/Disruptive_innovation

8

So how do we affordably architect HA / BC / DR in 2012?

9

What has remained the same?

(Continued good Guiding Principles that mitigate
HA/DR/BC chaos)

Storage Efficiency
Service Management Data Protection

10

The Business Process is still the Recoverable Unit
Business

Business Business Business Business Business Business Business
process A process B process C process D process E process F process G

3. The loss of both
db2
applications affects two
Application

Application 2
http://xyz.xml distinctly different
Web
Sphere
business processes
MQseries
2. The error impacts management Application 3
Analytics
Application 1 the ability of two or
report reports decision
more applications to SQL point
share critical data
Infrastructure

IT Business Continuity
1. An error occurs on
a storage device that must recover at the
correspondingly
corrupts a database
business process
level

11

Cloud does not change business process; still the recovery unit
Business


3. The loss of Cloud
db2
output affects two
Application

Application 2
http://xyz.xml distinctly different
Web
Sphere processes
business
STOP
management Application 3
Analytics
Application 1 2. Cloud provider reports decision
report
outage SQL point
Infrastructure

Cloud is simply another
deployment option
1. Data input to the
cloud But doesn’t change HA/BC
fundamental approach

12

When can Cloud recovery can provide extremely
fast time to project completion?
• Where entire business process recoverable units can be out-sourced to Cloud provider

– Production example: Out-sourcing production, or backup/restore, or integrated, standalon, application to
a provider
– Cloud application-as-a-service (AaaS) example: Salesforce.com, etc.
Business


db2
Application

http://xyz.xml Application 2 Web
Sphere
MQseries
Analytics management Application 3
Application 1 decision
report SQL reports
point
Technical

13

The trick to leveraging Cloud is:

Understanding that Cloud is simply another
(albeit powerful) deployment choice

Good news:

Fundamental principles for HA/DR/BC haven’t changed

It’s only the deployment options that have changed

14

Still true: synergistic overlap of valid data protection techniques

IT Data
Protection

1. High Availability 2. Continuous Operations 3. Disaster Recovery
Fault-tolerant, failure- Non-disruptive backups and Protection against unplanned
resistant streamlined system maintenance coupled with outages such as disasters
infrastructure with continuous availability of through reliable, predictable
affordable cost applications recovery
foundation

Protection of critical Business data Operations continue after a disaster
Recovery is predictable and reliable Costs are predictable and manageable

15

Four Stages of Data Center Efficiency: (pre-req’s for HA/BC/DR)

April 2012

http://www-935.ibm.com/services/us/igs/smarterdatacenter.html
http://public.dhe.ibm.com/common/ssi/ecm/en/rlw03007usen/RLW03007USEN.PDF

16

Telecom bandwidth
still the major delimiter
Still true: Timeline of an IT Recovery ==> for any fast recovery

Execute hardware, operating system,
RPO ?
Assess
and data integrity recovery
Telecom Network

Management Control Data

Physical Facilities

Operating System

Outage!
Production ☺ Operations Staff
Network Staff

Applications Staff

Recovery Point
Objective
Recovery Time Objective (RTO)
of hardware data integrity
RPO Done? transaction
Application
integrity recovery
(RPO)

How much data Applications
must be
recreated? Recovery Time Objective (RTO)
of transaction integrity

Now we're done!

17

Still true: value of Automation for real-time failover ===>

RPO ?
Assess HW
Telecom Network

Management Control
Data

Physical Facilities

Operating System Value of
automation
Outage!
Production ☺ Operations Staff Network Staff

Applications Staff

RTO Trans.
•Reliability
RPO
Recov.
H/W
•Repeatability
Recovery Point
Applications •Scalability
Objective
(RPO)
RTO
trans. integrity
•Frequent Testing
How much data
must be Now we're done!
recreated?

18

Still true: Organize High Availability, Business Continuity Technologies
Balancing recovery time objective with cost / value

Recovery from a disk image Recovery from tape copy

BC Tier 7 – Add Server or Storage replication with end-to-end automated server
recovery
BC Tier 6 – Add real-time continuous data replication, server or
storage
BC Tier 5 – Add Application/database integration to
Backup/Restore
e u a V/ t s o C

BC Tier 4 – Add Point in Time replication to
Backup/Restore
BC Tier 3 – VTL, Data De-Dup, Remote vault
BC Tier 2 – Tape libraries +
Automation BC Tier 1 – Restore
l

15 Min. 1-4 Hr.. 4 -8 Hr.. 8-12 Hr.. 12-16 Hr.. 24 Hr.. Days from Tape
Recovery Time Objective (guidelines only)

19

Still true: Replication Technology Drives RPO

For example:

Wks Days Hrs Mins Secs Secs Mins Hrs Days Wks

Recovery Point Recovery Time

Tape
Backup Periodic
Replication
Asynchronous
replication
Synchronous
replication / HA

20

Still true: Recovery Automation Drives Recovery Time
For example:

Wks Days Hrs Mins Secs Secs Mins Hrs Days Wks

Recovery Point Recovery Time

End to end
automated Storage
 Recovery Time includes: clustering automation Manual Tape
Restore
– Fault detection
– Recovering data
– Bringing applications back online
– Network access

21

Still true: “ideal world” construct for IT High Availability and Business Continuity

Business processes drive strategies and they are integral to the Continuity of Business Operations. A company cannot
be resilient without having strategies for alternate workspace, staff members, call centers and communications
channels.

Business Prioritization Integration into IT Manage

Awareness, Regular Validation, Change Management, Quarterly Management Briefings Resilience
Program
Management

e
ery ted
Tim
Cap rrent
lity
ss RTO/RPO
ine ct m

Re stima
s ra
abi
bu pa is Cu ra
m
og ation
im lys og ign pr id
Pr es

cov
risk a program Strategy l
an

E
assessment assessment D Design
Implement va

of
s
rea itie

ts • Maturity
ac e crisis team
High Availability 1. People
mp utag
Th abil

Model
ts

design
I 2. Processes
O
and ulner

• Measure 3. Plans
business
ROI High Availability
4. Strategies
resumption
s, V

Servers
• Roadmap 5. Networks
disaster
k

for 6. Platforms
Ris

Storage, Data
Program recovery Replication 7. Facilities
high
Database and
availability Software design

Source: IBM STG, IBM Global Services

22

The 2012 Bottom line: (IT Business Continuity Planning Steps)

For today’s real world environment……….

Need faster way than even this simplified 2007 version:

2012 key #1:
1. Collect information for this “ideal” process?
i.e. how to streamline prioritization
need a basic
2. Vulnerability, risk assessment, scope
Business Prioritization
Awareness, Regular Validation, Change Management, Quarterly Management Briefings
Integration into IT Manage Data Strategy
Resilience
Program
Management

3. Define BC targets based on scope

e
ery ted
Cap rrent

Tim
lity

s RTO/RPO
es
sin t

Re stima
abi

m
bu pac is m ra
og ion
Cu

im lys ra
og n pr idat

cov
Pr esig

4. Solution option design and evaluation
a

E
risk an program Strategy l
D Implement va
assessment assessment Design
2012 key #2:
f
so • Maturity 1. People
ct high availability
s

5. Recommend solutions and products
pa age crisis team
rea itie

Model 2. Processes
Im ut
Workload type
design
O • Measure
Th abil

3. Plans
ts

ROI
and ulner

business 4. Strategies
• Roadmap High Availability
resumption 5. Networks
Servers
V

for Program 6. Platforms

6. Recommend strategy and roadmap
ks,

7. Facilities
Ris

disaster Data
recovery Replication

high
availability Database and
Software design

23

Streamlined BC Actions 2005 version
Input Output
Scope, Resource Business
Business processes, Key 1. Collect info for Impact
Perf. Indicators, IT prioritization Component effect on business
processes
inventory

Defined vulnerabilities
List of vulnerabilities 2. Vulnerability / Risk
Assessment
Defined BC baseline
Existing BC capability, KPIs, targets,
3. Define desired HA/BC architecture,
targets, and success rate
targets based on scope decision and
success criteria

Technologies and solution
4. Solution design and
options
evaluation Business process segments
and solutions

Generic solutions that meet 5. Recommend Recommended IBM
criteria solutions and products Solutions and benefits
Budget, major project
milestones, resource 6. Recommend strategy and Baseline Bus. Cont. strategy,
availability, business roadmap, benefits, challenges,
roadmap
financial implications and
process priority
justification
24

Streamlined BC Actions 2012 version
Input Output
Scope, Resource Business
Business processes, Key 1. Collect info for Impact
Perf. Indicators, IT prioritization Component effect on business
processes
inventory Do basic HA/DR
List of vulnerabilities Data Strategy
2. Vulnerability / Risk
Defined vulnerabilities

Assessment
Defined BC baseline
Existing BC capability, KPIs, targets,
3. Define desired HA/BC architecture,
targets, and success rate
targets based on scope decision and
success criteria

Technologies and solution
4. Solution design and
options
evaluation Business process segments
and solutions
Exploit
Generic solutions that meet Workload Type
5. Recommend Recommended IBM
criteria solutions and products Solutions and benefits
Budget, major project
milestones, resource 6. Recommend strategy and Baseline Bus. Cont. strategy,
availability, business roadmap, benefits, challenges,
roadmap
financial implications and
process priority
justification
25

How do we get there in 2012?

Bottom line #1: have a basic Data Strategy

Bottom line #2: Exploit Workload type

Storage Efficiency
Service Management Data Protection

26

i.e. #1: It’s all about the

Data
Now, what do I mean by that?

27

What is a basic Data Strategy? Specify data usage over it’s lifespan

Applications Information Information
create data and data Archive / Retain / Delete
Management
Frequency of Access and Use

Time

28
28

Data strategy = collecting information, prioritizing, vulnerability/risk,
scope
Business processes drive strategies and they are integral to the Continuity of Business Operations. A company cannot
be resilient without having strategies for alternate workspace, staff members, call centers and communications
channels.

Business Prioritization Integration into IT Manage

Awareness, Regular Validation, Change Management, Quarterly Management Briefings Resilience
Program
Management

e
ery ted
Tim
Cap rrent
lity
ss RTO/RPO
ine ct m

Re stima
s Data ra

abi
bu pa is Cu ra
m
og ation
im lys og ign pr id
Pr es

cov
risk a program Strategy l
an

E
assessment Strategy assessment D Design
Implement va

of
s
rea itie

ts • Maturity
ac e crisis team
High Availability 1. People
mp utag
Th abil

Model
ts

design
I 2. Processes
O
and ulner

• Measure 3. Plans
business
ROI High Availability
4. Strategies
resumption
V

Servers
• Roadmap 5. Networks
k s,

for disaster 6. Platforms
Ri s

Storage, Data
Program recovery Replication 7. Facilities
high
Database and
availability Software design

Source: IBM STG, IBM Global Services

29

Data Strategy Defined

Data Strategy: relationship to Business, IT Strategies

Business Strategy IT Strategy
Business Strategies
Business Technology
Scope Scope

IT Strategy
Distinct Business System IT
Competencies Governance Competencies Governance Data Strategy

Enterprise IT Architecture

Organization, Infrastructure, IT Infrastructure
Process And processes

IT IT Infrastructure
Process Infrastructure

People Data
Skills Tools Processes Skills
Process
Technology
Structure

30

Data Strategy Defined

The role of the basic “Data Strategy” for HA / BC purposes
• Define major data types “good enough”
– i.e. by major application, by business line….
Business Strategies
– An ongoing journey
You have to
• For each data type: know your data IT Strategy
– Usage
– Performance and measurement Data Strategy
– Security
– Availability
Enterprise IT Architecture
– Criticality
– Organizational role And have a
– Who manages basic strategy
– What standards for this data for it
• What type storage deployed on
• What database
• What virtualization
IT Infrastructure

• Be pragmatic People Data
– Create a basic, “good enough” data strategy for HA/BC purposes Process
Technology
Structure
• Acquire tools that help you know your data

31

Here’s the major difference for 2012:
There are two major types of workloads:
Traditional IT Internet Scale
Workloads
HA, Business Continuity, HA/DR/BC can be done “Agnostic / HA/DR/BC must be “designed
Disaster Recovery after the fact” using replication into software stack from the
Characteristics beginning”

Data Strategy Use traditional tools/concepts to Proven Open Source toolset
understand / know data to implement failure
Storage/server virtualization and tolerance and redundancy in
pooling the application stack
Automation End to end automation of server / End to end automation of the
storage virtualization application software stack
providing failure tolerance

Commonality Apply master vision and lessons Apply master vision and
learned from internet scale data lessons learned from internet
centers scale data centers

32

Choices for high availability and replication architectures

Production Site

Geographic Load Balancer
Site Load Web Application / DB Server
Balancer Server Clusters Server Clusters Clusters Disk

Workload Application Server Storage
balancer or database Replication Replic.
Replication

Local
backup Geographic
Load Balancer Site
Web Application / DB Server
Load
Balancer Server Clusters Server Clusters Clusters
PIT Image,
Other Site(s) Tape B/U

33

Comparing IT BC architectural methods
Production Site

Geographic Load Balancer
Site Load Web Application / DB Server
Balancer Server Clusters Server Clusters Clusters Storage

Workload Application / Server Stor
Balancer DB Replication Replication Replic.

Local Geographic
Backup Load Balancer Site Web Application / DB Server
Load Server Clusters Server Clusters Clusters
Balancer Replication,
Multiple Site(s) PiT Image,
Tape

• Application / database / file system replication / workload balancer
–
File system,
Typically requires the least bandwidth
– May be required if the scale of storage is very large (i.e. internet scale) DB, Applic.
– Span of consistency is that application, database or file system only
–
Aware
Well understood by database, application, file system administrators
– Can be more complex implementation, must implement for each application

• Replication – Server (traditional IT)
– Well understood by operating systems administrators
– Storage and application independent, uses server cycles
– Span of recovery limited to that server platform

• Replication – Storage (traditional IT)
– Can provide common recovery across multiple application stacks and multiple File system,
server platforms
– Usually requires more bandwidth DB, Applic.
– Requires storage replication skill set
Agnostic

34

Principles for Internet Scale Workloads

35

Internet Scale Workload Characteristics - 1

• Embarrassingly parallel Internet workload
– Immense data sets, but relatively independent records being processed
• Example: billions of web pages, billions of log / cookie / click entries
– Web requests from different users essentially independent of each over
• Creating natural units of data partitioning and concurrency
• Lends itself well to cluster-level scheduling / load-balancing
– Independence = peak server performance not important i.e. Very low
inter-process
– What’s important is aggregate throughput of 100,000s of servers communication

• Workload Churn
– Well-defined, stable high level API’s (i.e. simple URLs)
– Software release cycles on the order of every couple of weeks
• Means Google’s entire core of search services rewritten in 2 years
– Great for rapid innovation
• Expect significant software re-writes to fix problems ongoing basis
– New products hyper-frequently emerge
• Often with workload-altering characteristics, example = YouTube

36

Internet Scale Workload Characteristics - 2
• Platform Homogeneity
– Single company owns, has technical capability, runs entire platform end-to-
end including an ecosystem
– Most Web applications more homogeneous than traditional IT
– With immense number of independent worldwide users
1% - 2% of all
Internet requests
fail*
• Fault-free operation via application middleware
– Some type of failure every few hours, including software bugs
– All hidden from users by fault-tolerant middleware Users can’t tell difference
between Internet down and
– Means hardware, software doesn’t have to be perfect your system down

Hence 99% good enough
• Immense scale:
– Workload can’t be held within 1 server, or within max size tightly-clustered
memory-shared SMP
– Requires clusters of 1000s, 10000s of servers with corresponding PBs storage,
network, power, cooling, software
– Scale of compute power also makes possible apps such as Google Maps, Google
Translate, Amazon Web Services EC2, Facebook, etc.

*The Data Center as a Computer: Introduction to Warehouse Scale Computing, p.81 Barroso, Holzle
http://www.morganclaypool.com/doi/pdf/10.2200/S00193ED1V01Y200905CAC006

37

IT architecture at internet scale
• Internet scale architectures fundamental assumptions:
Criteria:
– Distributed aggregation of data

Cost – High Availability, failure tolerance functionality is in
software on the server

– Time to Market is everything
• Breakage = “OK” if I can insulate that from user
– Affordability is everything
– Use open source software where-ever possible
Extreme:
– Expect that something somewhere in infrastructure will
- Scale always be broken
- Parallelism
- Performance – Infrastructure is designed top-to-bottom to address this
- Real time
-Time to Market
• All other criteria are driven off of these

38

For Internet Scale workloads, Open Source based
internet-scale software stack

Example shown is the 2003-2008 Google version:

1. Google File System Architecture – GFS II

2. Google Database - Bigtable

3. Google Computation - MapReduce

4. Google Scheduling - GWQ

Reliability, redundancy all in
The OS or HW doesn’t do the “application stack”
any of the redundancy

39

Internet-scale
Each red block is an
HA/DR/BC inexpensive server =
IT infrastructure plenty of power for its
portion of workflow

For

Internet

Your customers
Scale
Input from the Internet

Workloads

40

Warehouse Scale Computer programmer productivity framework example

• Hadoop • Flume
– Overall name of software stack – Populate Hadoop with data
• HDFS • Oozie
– Hadoop Distributed File System – Workflow processing system
• MapReduce • Whirr
– Software compute framework – Libraries to spin up Hadoop on
• Map = queries Amazon EC2, Rackspace, etc.
• Reduce=aggregates answers • Avro
• Hive – Data serialization
– Hadoop-based data warehouse • Mahout
• Pig – Data mining
– Hadoop-based language • Sqoop
• Hbase – Connectivity to non-Hadoop
– data stores
Non-relationship database fast
lookups • BigTop
– Packaging / interop of all
Hadoop components

http://wikibon.org/wiki/v/Big_Data:_Hadoop%2C_Business_Analytics_and_Beyond

41

Summary - two major types of approaches, depending
on workload type:
Traditional IT Internet Scale
Workloads
HA, Business Continuity, HA/DR/BC can be done “Agnostic / HA/DR/BC must be “designed
Disaster Recovery after the fact” using replication into software stack from the
Characteristics beginning”

Data Strategy Use traditional tools/conceptsw to Proven Open Source toolset
understand / know data to implement failure
Storage/server virtualization and tolerance and redundancy in
pooling the application stack
Automation End to end automation of server / End to end automation of the
storage virtualization application software stack
providing failure tolerance

Commonality Apply master vision and lessons Apply master vision and
learned from internet scale data lessons learned from internet
centers scale data centers

42

Principles for Architecting IT HA / DR / Business Continuity

43

Key strategy: segment data into logical storage pools by appropriate Data Protection
characteristics (animated chart)
Mission
Critical
• Continuous Availability (CA) – E2E automation enhances RDR
– RTO = near continuous, RPO = small as possible (Tier 7)
– Priority = uptime, with high value justification

• Rapid Data Recovery (RDR) – enhance backup/restore
– For data that requires it
– RTO = minutes, to (approx. range): 2 to 6 hours
– BC Tiers 6, 4
– Balanced priorities = Uptime and cost/value

• Backup/Restore (B/R) – assure efficient foundation
– Standardize base backup/restore foundation
– Provide universal 24 hour - 12 hour (approx) recovery capability
– Address requirements for archival, compliance, green energy
– Priority = cost

Lower Enabled by Know and categorize your data -
cost virtualization
Provides foundation for affordable data protection

44

Virtualization is fundamental to addressing today’s IT diversity

Virtuali
zation

45

Consolidated virtualized systems become the Recoverable
Virtuali
Units for IT Business Continuity zation

Virtualized IT infrastructure Business Processes

Virtualized systems become the resource pools that enable the recoverability

46

High Availability, Business Continuity Step by Step virtualization journey
Balancing recovery time objective with cost / value


recovery
storage
Backup/Restore
e u a V/ t s o C

Backup/Restore
l

Storage pools Recovery Time Objective
Foundation

47

Storage
Pools Add automated failover to
Apply appropriate server, replicated storage
storage technology

Real Time replication Real-time
(storage or server or replication
software)

Periodic PiT replication: Point in time
-File System
- Point in Time Disk
- VTL to VTL with Dedup
Removable
media

- Foundation backup/restore
- Physical or electronic transport

PetaByte Petabyte unstructured, due to usage and Petabyte
Unstructured large scale, typically uses Unstructured
application level intelligent redundancy File, application, or
failure toleration design disk-to-disk
periodic replication

48

Methodology: Traditional IT HA / BC / DR in stages, from bottom up

•IBM ProtecTier
•IBM Virtual Tape Library
•IBM Tivoli Storage •VTL, de-dup,
Manager Backup/restore remote replication
at tape level

SAN SAN

Disk VTL/De-Dup VTL/De-Dup

•IBM FlashCopy, SnapShot
•IBM XIV, SVC, DS, SONAS
•IBM Tivoli Storage
Productivity Center 5.1
Cost

Add: Point-in-time Copy, disk to disk, Tiered Storage (Tier 4)
Foundation: electronic vaulting, automation, tape lib (Tier 3)
Foundation: standardized, automated tape backup (Tier 2, 1)
Recovery Time Objective

49

Methodology: traditional IT HA / BC / DR in stages, from bottom up
•Server virtualization
•Tivoli FlashCopy
Manager

End to end
Automated
Application Failover: Application Dynamic
Server integration
integration
Storage
Applications •VMWare
•PowerHA on p
SAN SAN

If storage:
•Metro Mirror, Global
VTL/De-Dup VTL/De-Dup
Disk Data Data VTL/De-Dup Mirror, Hitachi UR
replication replication •XIV, SVC, DS, other
storage
•TPC 5.1

End to end automated site failover servers, storage, applications (Tier 7)
Consolidate and implement real time data availability (Tier 6)
Cost

Automate applications, database for replication and automation (Tier 5)
Add: Point-in-time Copy, disk to disk for backup/restore (Tier 4)
Foundation: electronic vaulting, automation, tape lib (Tier 3)
Foundation: standardized, automated tape backup (Tier 2, 1)
Recovery Time Objective

50

Pay-per-Usage
• Supporting compute-
Persistent Storage

multi-tenancy model
Compute Cloud

• Finer granularity in
centric workloads
User

• Provider-owned
C
User

Public Cloud
E

4
Services

Enterprise A
User
B

Enterprise B
User

Enterprise C
D

assets
User
A
5

Shared Cloud
Services
51
• Standardized, multi-
tenant service
• Pay-per-usage
Operated or
model withCo-located
provider-owned
assets
2 3 Enterprise
Enterprise
Technology Deployments in Cloud

Data Center
Managed
Private Cloud Hosted Private
Cloud
Co-lo operated Co-lo owned and
Co-lo owned and
operated
operated
• Consumption models including client-
owned and provider-owned assets
• Delivery options including client premise
& hosted
• Strategic Outsourcing clients with
standardized services
Private Cloud

• Client-managed

implementation
Data Center
Enterprise

Private
Cloud

• Internal or

services
partner
cloud
1

Cloud as remote site deployment options

Real Time replication
(storage or server or
software)

Recovery
Production
in
Periodic PiT replication: Cloud
-File System

- Point in Time Copies

PetaByte Petabyte level storage typically Petabyte
Unstructured uses intelligent file or application replication Unstructured
due to large scale, usage patterns

52

Virtualized Automated
Storage failover
Data strategy
remote cloud

Real Time replication Real-time
(storage or server or replication
software)

Periodic PiT replication: Point in time
-File System
Removable
media

Disk-to-disk
replication


53

Local Cloud deployment from
data standpoint

PetaByte
Unstructured

54

Cloud provider
responsibility
for HA
and BC
Real Time replication
(storage or server or
software)

Recovery
Your
By
Production Periodic PiT replication: Cloud
In -File System
- Point in Time Disk Provider
Cloud



55

Today’s world: High Availability, Business Continuity
Cloud
is a Step by Step data strategy / workload journey deployment
Balancing recovery time objective with cost / value if needed


recovery
storage
Backup/Restore
e u a V/ t s o C

Backup/Restore
l

Data Strategy Recovery Time Objective

Workload Types
56

Step by Step Virtualization, High Availability,
Cloud
Business Continuity data strategy deployment
Balancing recovery time objective with cost / value if needed


BC Tier 7 – Add Server or Storage Availability end-to-end automated server
Continuous replication with
recovery
storage
Rapid Data Recovery
Backup/Restore
e u a V/ t s o C

Backup/Restore
Backup/Restore
l

Data Strategy Workload types Recovery Time Objective

57

Summary – IT High Availability / Business Continuity Best Practices 2012
Continuous Implement BC Tier 7 – Standardize use of Continuous
Availability Availability automated Failover

Implement Tier 6 – Standardize high volume
data replication method
Rapid
Data I
Recovery

Implement Tier 4 – Standardize use of disk to disk and
Point in Time disk copy

Backup / Implement Tier 3 – Consolidate and standardize
Restore Backup/Restore methods. Implement tape VTL, data de-dup,
Server / Storage Virtualization / Mgmt tools, basic automation

Production
Backup/Restore Tier 1, 2 Backup/Restore Tier 1, 2
Foundation: replicated foundation:
Storage, server virtualization SAN and server
and consolidation virtualization and
Understand my data consolidation
Define scope of recovery Implement remote
Data strategy S sites (Tier 1, 2) Recovery
Workload types
58

Summary
• Understand today’s best practices
– Data Workload
for IT High Availability and IT Business Continuity
Strategy types
• What has changed? What is the same?
– Principles for requirements = no change
• Data Strategy
– Deployment for true internet scale wkloads:
• Application level redundancy

• Strategies for:
– Requirements, design, implementation
– In-house vs. out-sourcing
Cloud
deployment
• Step by step approach options
– Automation, virtualization essential
– Segment workloads traditional vs. petabyte scale
– Exploiting Cloud

59
59

2012_Architects_Guide_Designing_Integrated_Multi-Product_HA_DR_BC_Solutions_v2

Recomendados

Recomendados

Mais conteúdo relacionado

Mais de John Sing

Mais de John Sing (7)

Último

Último (20)

2012_Architects_Guide_Designing_Integrated_Multi-Product_HA_DR_BC_Solutions_v2

Notas do Editor