SlideShare a Scribd company logo
1 of 25
How RightScale Architects its Databases
    (for World-wide Scale, HA and DR scenarios)



Josep Blanquer
Senior Systems Architect, RightScale
2#




                                Menu
                                 Intro
                           Data Taxonomy
                         Data Storage Design
                            Scale, HA and DR

                             Conclusion




Talk with the Experts.
3#



Intro: Expectations and scope
                         What this is and what is not
   • IS a talk about:
        • how RightScale has designed and implemented its backing datastores
        • …for a few of the most representative internal systems
        • …with the rationale behind it
   • Is NOT a talk about
        • RightScale’s overall architecture
        • Nodes or hosts, it’s about Systems
        • RightScale’s data modeling

    Note: Most of the design is implemented and in production but some of the
     most advanced things that are still in beta, or we’re still being worked on


Talk with the Experts.
4#



Intro: Tools and Technologies
   • RightScale uses a mix of RDBMS and NoSQL technologies:
        • MySQL , Cassandra and S3 (for backups and archiving)


   • Transactionality:
        • MySQL: strong ACID properties
        • Cassandra: no Atomicity, eventually Consistent, some Isolation, Durable
   • Availability:
        • MySQL: async replication. Master-SlaveN or Master-Master
        • Cassandra: Distributed, master-less, highly-replicated (multi-DC)
   • Sharding:
        • MySQL: no explicit inter-node tools. (Sharding done by application)
        • Cassandra: partitions data internally across nodes.


Talk with the Experts.
5#



Taxonomy of RightScale’s Data
        Representative systems
     with different data semantics:
             Global Objects
                 Marketplace Assets
             Dashboard Objects
                 Audits
                 Tags
                 Recent Events
             Cloud Polling Data
             Routing Data
             Monitoring/Syslog
Talk with the Experts.
6#



Taxonomy of RightScale’s Data
        Representative systems
     with different data semantics:
             Global Objects
                 Marketplace Assets   Common across accounts:
                                        Users
             Dashboard Objects          Plans
                 Audits                Settings
                                        MultiCloud Marketplace:
                 Tags
                                              Published Assets
                 Recent Events               Sharing Groups
                                              …
             Cloud Polling Data
             Routing Data
             Monitoring/Syslog
Talk with the Experts.
7#



Taxonomy of RightScale’s Data
        Representative systems
     with different data semantics:
             Global Objects
                 Marketplace Assets
             Dashboard Objects
                 Audits               Private to each account:
                 Tags                  Deployments
                                        Imported assets
                 Recent Events         Alert Specifications
                                        Server Inputs
             Cloud Polling Data
                                          Audit
             Routing Data                 Tags
                                          User Events
             Monitoring/Syslog            …

Talk with the Experts.
8#



Taxonomy of RightScale’s Data
        Representative systems
     with different data semantics:
             Global Objects
                 Marketplace Assets
             Dashboard Objects
                 Audits               Private to each account:
                 Tags                  Cloud resource states (cache)
                                        Cloud credentials
                 Recent Events
             Cloud Polling Data
             Routing Data
             Monitoring/Syslog
Talk with the Experts.
9#



Taxonomy of RightScale’s Data
        Representative systems
     with different data semantics:
             Global Objects
                 Marketplace Assets
             Dashboard Objects
                 Audits
                                       Private to each account:
                 Tags                  Instance agents location
                 Recent Events         Core agents location
                                        Agent action registry
             Cloud Polling Data         …

             Routing Data
             Monitoring/Syslog
Talk with the Experts.
10#



Taxonomy of RightScale’s Data
        Representative systems
     with different data semantics:
             Global Objects
                 Marketplace Assets
             Dashboard Objects
                 Audits
                 Tags
                 Recent Events
                                       Private to each account:
             Cloud Polling Data         Collected metric data
                                        Collected syslog data
             Routing Data               …


             Monitoring/Syslog
Talk with the Experts.
11#



Taxonomy of RightScale’s Data

                                       Who uses the data?
              Global Objects           •   Users through the Dash/API
                 Marketplace Assets   •   Instances from the Cloud
  Users




              Dashboard Objects
                 Audits
                                       Data close to the Users
                 Tags
                 Recent Events
                                               Data Placement
              Cloud Polling Data
  Instances




              Routing Data             Data close to the Cloud
              Monitoring/Syslog
Talk with the Experts.
12#



Taxonomy of RightScale’s Data

                                           Which data do we need?
       X-acct




                 Global Objects            •   Data for all accounts
                    Marketplace Assets    •   Data for a single account

                 Dashboard Objects         Data shared between accounts
                    Audits
                    Tags                 Data scope and containment
       Account




                    Recent Events
                 Cloud Polling Data        Data required within scope
                                           of a single account
                 Routing Data
                 Monitoring/Syslog
Talk with the Experts.
13#



Taxonomy of RightScale’s Data
                        Who uses the data? Proximity to User vs. Cloud
                         Which data do we need? Scope of data available
              X-acct




                         Global Objects                        Close to user
                             Marketplace Assets               Globally accessible data
  Users




                         Dashboard Objects
                             Audits                           Close to user
                             Tags                             Account-shardable data
              Account




                             Recent Events
                         Cloud Polling Data
  Instances




                                                               Close to cloud resources
                         Routing Data                          Account-shardable* data
                         Monitoring/Syslog
Talk with the Experts.
14#

            X-Account    Account
Users
Instances




Talk with the Experts.
15#

            X-Account


                                                  Why custom? More control
                                                  •   Multiple sources
Users




              global
                                                  •   Individual columns
                             Custom replication   •   Apply transformations
                                                  •   Smart re-sync features




             Global: MySQL
              • ACID semantics
              • Master-Slave replication
Instances




Talk with the Experts.
16#

            X-Account               Account
                                                                         Data archive: S3
                                                         S3               • Low read rate
                                      tags                                • Globally accessible
Users




              global     dash                   audit

                                                          Other systems: Cassandra
                                     events                   •   Simpler Key-Value access
                                                              •   Great scalability
                                                              •   Great replica control
                                                              •   High write availability
                                                              •   Time-to-live expiration as cache
                         Dashboard: MySQL
                                                              •   Rows tagged by account
                         •   ACID semantics
                         •   Master-SlaveN replication
                         •   Slave reads
Instances




                         •   Rows tagged by account




Talk with the Experts.
17#

            X-Account           Account

                                                  S3
                                 tags                                   tags
Users




              global     dash             audit            dash                 audit


                                events                                 events




                                   So we can horizontally scale our
                                   dashboard by partitioning objects
                                      based on account groups:

                                           Clusters
Instances




Talk with the Experts.
18#

                           Account


                                              S3                                          S3                                              S3
                             tags                                        tags                                            tags




                                                                                                     Cluster N
        Cluster 1




                                                   Cluster 3
                                                                                               …
                    dash              audit                    dash               audit                          dash            audit


                            events                                      events                                          events




                                                                                               Features:
Users




                                                                                                   • 1 cluster: N accounts
                                                                                                   • 1 account: 1 home
                                                               RightScale Accounts
                                                                                                   • Migratable accounts

                                                                                               Benefits:
                                                                                                   • Great horizontal growth
                                                                  Account Set 2
                      Account Set 1                                                                • Better failure isolation
                                                                                                   • Independent scale
                                                                                                   • Load rebalancing
                                                                                                   • Versionable code
                                                                                                   • Differentiated service


Talk with the Experts.
19#

            X-Account               Account

                                                       S3
                                      tags                          tags
Users




              global     dash                  audit        dash            audit


                                    events                         events




                                             gateway
Instances




                                monitor


                                             routing




Talk with the Experts.
20#

            X-Account            Account

                                                      S3
                                   tags
                         dash And partition our cloud objects based on the cloud
Users




              global                        audit
                                       the instances of an account run on:

                                  events         Islands

                                           gateway                           gateway
Instances




                             monitor                          monitor


                                           routing                           routing



Talk with the Experts.
21#

                                        Account


                  gateway                                        gateway
            Gateway: MySQL                                       gateway                                     gateway

            •   Master-Slave replication                                                  Monitoring: Custom
                                                                                          • Replicated files




                                                                                                  Island N
            •
                Island 1




                                                      Island 2
                Can port to NoSQL easily
            •   Mostly a resource monitor
                                   cache                                        monitor
                                                                                monitor   • Backup to S3               monitor

            •   But cloud partitionable                                                   • Archive to S3
                              routing                            routing
                                                                 routing                                     routing
Instances




                                                                                          Features:
                                                                                            • 1 instance: 1 home island
                                                                                            • 1 Island can serve N clouds
                                                  Routing: Cassandra                        • Core Agents: global data
                                                  •   Simpler Key-Value access
                                                  •   Very high availability              Benefits:
                           Services co-located    •       Services co-located
                                                      Great scalability                     • Close Services co-located
                                                                                                    to cloud resources
                             with resources       •         with resources
                                                      Great replica control                           with resources
                                                  •   Plus cross DC replication*
                                                                                            • Good failure isolation
                                                                                                 • As good as cloud 
                                  Cloud 1                             Cloud 2                             Cloud N
                                                                                            • Good scale: global replicas
                                                                                              across cassandra DCs

Talk with the Experts.
22#

                                    Account


                                                           S3                                               S3                                                       S3
                                         tags                                              tags                                                   tags




                                                                                                                     Cluster N
            Cluster 1




                                                                Cluster 3
Users




                                                                                                                 …
                        dash                    audit                         dash                  audit                        dash                     audit


                                       events                                             events                                                 events



                                                                Different Geographies
                                                                    What if the cloud
                                                                    where the cluster
                                                                    is deployed on…
                                    gateway                                          Fails?
                                                                               gateway                                                      gateway
Instances




                                                                                                                                 Island N
                         Island 1




                                                                 Island 2




                                                 monitor                                      monitor                                                     monitor




                                    routing                                     routing                                                     routing




                                                                            Different Clouds
Talk with the Experts.
23#

                                    Account
                                                    Sister Clusters

                                                           S3                                          S3                                                  S3
                                         tags                                          tags                                             tags




                                                                                                                Cluster N
            Cluster 1




                                                                  Cluster 3
Users




                                                        Full replica
                                                                                                            …
                        dash                    audit                         dash             audit                        dash                audit


                                       events                                         events                                           events




                                                                              Features:
                                                                               • Each master has an extra remote slave
                                    gateway                                    gateway
                                                                               • Each cluster in a pair is a DC replica of the other’s
                                                                                                                       gateway
Instances




                                                                                 localring




                                                                                                                            Island N
                         Island 1




                                                                   Island 2




                                                 monitor                      At Disaster Recovery time:
                                                                                        monitor                                                 monitor

                                                                               • Apps are told to start serving an extra shard
                                    routing                                    •routing need to provision more infrastructure to recover
                                                                                  No                                      routing

                                                                                  (try to avoid since everybody is on the same boat)
                                                                               • New resources can be allocated over time to help
                                                                                  offload existing ones


Talk with the Experts.
24#



Conclusions
   • Shown that RightScale uses multiple database technologies:
        • RDBMS – MySQL for the ACID semantics and ‘queryability’
             • Using a Master to N-Slaves for RO scale, and quick failure recovery
             • And ReadOnly Provisioning – To increase RO availability and scale remote systems
        • NoSQL: Cassandra for Availability and Scalability
             • for higher Read/Write availability within a cluster
             • For fully replicated regions across the globe (for Read/Write!)

   • Shown how RightScale uses them in different techniques
        • It partitions resource data into Islands based on cloud proximity
             • Can achieve in-cloud polling,and keep monitoring/syslog data storage next to instances
             • Can provide routing availability, colocated with instances for any world region
        • It partitions core data into Clusters based on account groups
             • To scale the core horizontally, and independently and achieve account isolation/differentiation
             • Enhances fault isolation: Assigning accounts to Clusters deployed away their cloud resources
        • It maintains cluster pairs (sister sites)
             • To recover from full cloud region failures
             • It doesn’t require massive amounts of new resources to recover
Talk with the Experts.
Questions?




Talk with the Experts.

More Related Content

Similar to How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR Scenarios

RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...RightScale
 
Klout changing landscape of social media
Klout changing landscape of social mediaKlout changing landscape of social media
Klout changing landscape of social mediaDataWorks Summit
 
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013RightScale
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfssuserd23711
 
Telco analytics at scale
Telco analytics at scaleTelco analytics at scale
Telco analytics at scaledatamantra
 
How Klout is changing the landscape of social media with Hadoop and BI
How Klout is changing the landscape of social media with Hadoop and BIHow Klout is changing the landscape of social media with Hadoop and BI
How Klout is changing the landscape of social media with Hadoop and BIDenny Lee
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFAmazon Web Services
 
Implementing Big Data at the Speed of Business
Implementing Big Data at the Speed of BusinessImplementing Big Data at the Speed of Business
Implementing Big Data at the Speed of BusinessDataWorks Summit
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSAWS User Group Kochi
 
16h00 globant - aws globant-big-data_summit2012
16h00   globant - aws globant-big-data_summit201216h00   globant - aws globant-big-data_summit2012
16h00 globant - aws globant-big-data_summit2012infolive
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Amazon Web Services
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Amazon Web Services
 
The AWS Big Data Platform – Overview
The AWS Big Data Platform – OverviewThe AWS Big Data Platform – Overview
The AWS Big Data Platform – OverviewAmazon Web Services
 
Data & analytics challenges in a microservice architecture
Data & analytics challenges in a microservice architectureData & analytics challenges in a microservice architecture
Data & analytics challenges in a microservice architectureNiels Naglé
 

Similar to How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR Scenarios (20)

RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Klout changing landscape of social media
Klout changing landscape of social mediaKlout changing landscape of social media
Klout changing landscape of social media
 
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
Building RightScale's Globally Distributed Datastore - RightScale Compute 2013
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
 
Telco analytics at scale
Telco analytics at scaleTelco analytics at scale
Telco analytics at scale
 
How Klout is changing the landscape of social media with Hadoop and BI
How Klout is changing the landscape of social media with Hadoop and BIHow Klout is changing the landscape of social media with Hadoop and BI
How Klout is changing the landscape of social media with Hadoop and BI
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SF
 
Implementing Big Data at the Speed of Business
Implementing Big Data at the Speed of BusinessImplementing Big Data at the Speed of Business
Implementing Big Data at the Speed of Business
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
 
Globant and Big Data on AWS
Globant and Big Data on AWSGlobant and Big Data on AWS
Globant and Big Data on AWS
 
16h00 globant - aws globant-big-data_summit2012
16h00   globant - aws globant-big-data_summit201216h00   globant - aws globant-big-data_summit2012
16h00 globant - aws globant-big-data_summit2012
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
RavenDB overview
RavenDB overviewRavenDB overview
RavenDB overview
 
Using Data Lakes
Using Data Lakes Using Data Lakes
Using Data Lakes
 
The AWS Big Data Platform – Overview
The AWS Big Data Platform – OverviewThe AWS Big Data Platform – Overview
The AWS Big Data Platform – Overview
 
Data & analytics challenges in a microservice architecture
Data & analytics challenges in a microservice architectureData & analytics challenges in a microservice architecture
Data & analytics challenges in a microservice architecture
 

More from RightScale

10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT Governance10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT GovernanceRightScale
 
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOpsKubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOpsRightScale
 
Optimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScaleOptimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScaleRightScale
 
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About NowPrepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About NowRightScale
 
How to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseHow to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseRightScale
 
Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)RightScale
 
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBMComparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBMRightScale
 
How to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaHow to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaRightScale
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...RightScale
 
Using RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsUsing RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsRightScale
 
Best Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceBest Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceRightScale
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreAutomating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreRightScale
 
The 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesThe 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesRightScale
 
9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage CostsRightScale
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMServerless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMRightScale
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessBest Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessRightScale
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMCloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMRightScale
 
2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud ReportRightScale
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsGot a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsRightScale
 
How to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaHow to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaRightScale
 

More from RightScale (20)

10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT Governance10 Must-Have Automated Cloud Policies for IT Governance
10 Must-Have Automated Cloud Policies for IT Governance
 
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOpsKubernetes and Terraform in the Cloud: How RightScale Does DevOps
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
 
Optimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScaleOptimize Software, SaaS, and Cloud with Flexera and RightScale
Optimize Software, SaaS, and Cloud with Flexera and RightScale
 
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About NowPrepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
 
How to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseHow to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your Enterprise
 
Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)Multi-Cloud Management with RightScale CMP (Demo)
Multi-Cloud Management with RightScale CMP (Demo)
 
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBMComparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
 
How to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale OptimaHow to Allocate and Report Cloud Costs with RightScale Optima
How to Allocate and Report Cloud Costs with RightScale Optima
 
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
 
Using RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider ToolsUsing RightScale CMP with Cloud Provider Tools
Using RightScale CMP with Cloud Provider Tools
 
Best Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and ComplianceBest Practices for Multi-Cloud Security and Compliance
Best Practices for Multi-Cloud Security and Compliance
 
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and MoreAutomating Multi-Cloud Policies for AWS, Azure, Google, and More
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
 
The 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for EnterprisesThe 5 Stages of Cloud Management for Enterprises
The 5 Stages of Cloud Management for Enterprises
 
9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs9 Ways to Reduce Cloud Storage Costs
9 Ways to Reduce Cloud Storage Costs
 
Serverless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBMServerless Comparison: AWS vs Azure vs Google vs IBM
Serverless Comparison: AWS vs Azure vs Google vs IBM
 
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP SuccessBest Practices for Cloud Managed Services Providers: The Path to CMP Success
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
 
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBMCloud Storage Comparison: AWS vs Azure vs Google vs IBM
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
 
2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report2018 Cloud Trends: RightScale State of the Cloud Report
2018 Cloud Trends: RightScale State of the Cloud Report
 
Got a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP HelpsGot a Multi-Cloud Strategy? How RightScale CMP Helps
Got a Multi-Cloud Strategy? How RightScale CMP Helps
 
How to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale OptimaHow to Manage Cloud Costs with RightScale Optima
How to Manage Cloud Costs with RightScale Optima
 

Recently uploaded

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR Scenarios

  • 1. How RightScale Architects its Databases (for World-wide Scale, HA and DR scenarios) Josep Blanquer Senior Systems Architect, RightScale
  • 2. 2# Menu Intro Data Taxonomy Data Storage Design Scale, HA and DR Conclusion Talk with the Experts.
  • 3. 3# Intro: Expectations and scope What this is and what is not • IS a talk about: • how RightScale has designed and implemented its backing datastores • …for a few of the most representative internal systems • …with the rationale behind it • Is NOT a talk about • RightScale’s overall architecture • Nodes or hosts, it’s about Systems • RightScale’s data modeling Note: Most of the design is implemented and in production but some of the most advanced things that are still in beta, or we’re still being worked on Talk with the Experts.
  • 4. 4# Intro: Tools and Technologies • RightScale uses a mix of RDBMS and NoSQL technologies: • MySQL , Cassandra and S3 (for backups and archiving) • Transactionality: • MySQL: strong ACID properties • Cassandra: no Atomicity, eventually Consistent, some Isolation, Durable • Availability: • MySQL: async replication. Master-SlaveN or Master-Master • Cassandra: Distributed, master-less, highly-replicated (multi-DC) • Sharding: • MySQL: no explicit inter-node tools. (Sharding done by application) • Cassandra: partitions data internally across nodes. Talk with the Experts.
  • 5. 5# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits  Tags  Recent Events Cloud Polling Data Routing Data Monitoring/Syslog Talk with the Experts.
  • 6. 6# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Common across accounts:  Users Dashboard Objects  Plans  Audits  Settings  MultiCloud Marketplace:  Tags  Published Assets  Recent Events  Sharing Groups  … Cloud Polling Data Routing Data Monitoring/Syslog Talk with the Experts.
  • 7. 7# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits Private to each account:  Tags  Deployments  Imported assets  Recent Events  Alert Specifications  Server Inputs Cloud Polling Data  Audit Routing Data  Tags  User Events Monitoring/Syslog  … Talk with the Experts.
  • 8. 8# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits Private to each account:  Tags  Cloud resource states (cache)  Cloud credentials  Recent Events Cloud Polling Data Routing Data Monitoring/Syslog Talk with the Experts.
  • 9. 9# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits Private to each account:  Tags  Instance agents location  Recent Events  Core agents location  Agent action registry Cloud Polling Data  … Routing Data Monitoring/Syslog Talk with the Experts.
  • 10. 10# Taxonomy of RightScale’s Data Representative systems with different data semantics: Global Objects  Marketplace Assets Dashboard Objects  Audits  Tags  Recent Events Private to each account: Cloud Polling Data  Collected metric data  Collected syslog data Routing Data  … Monitoring/Syslog Talk with the Experts.
  • 11. 11# Taxonomy of RightScale’s Data Who uses the data? Global Objects • Users through the Dash/API  Marketplace Assets • Instances from the Cloud Users Dashboard Objects  Audits Data close to the Users  Tags  Recent Events Data Placement Cloud Polling Data Instances Routing Data Data close to the Cloud Monitoring/Syslog Talk with the Experts.
  • 12. 12# Taxonomy of RightScale’s Data Which data do we need? X-acct Global Objects • Data for all accounts  Marketplace Assets • Data for a single account Dashboard Objects Data shared between accounts  Audits  Tags Data scope and containment Account  Recent Events Cloud Polling Data Data required within scope of a single account Routing Data Monitoring/Syslog Talk with the Experts.
  • 13. 13# Taxonomy of RightScale’s Data Who uses the data? Proximity to User vs. Cloud Which data do we need? Scope of data available X-acct Global Objects Close to user  Marketplace Assets Globally accessible data Users Dashboard Objects  Audits Close to user  Tags Account-shardable data Account  Recent Events Cloud Polling Data Instances Close to cloud resources Routing Data Account-shardable* data Monitoring/Syslog Talk with the Experts.
  • 14. 14# X-Account Account Users Instances Talk with the Experts.
  • 15. 15# X-Account Why custom? More control • Multiple sources Users global • Individual columns Custom replication • Apply transformations • Smart re-sync features Global: MySQL • ACID semantics • Master-Slave replication Instances Talk with the Experts.
  • 16. 16# X-Account Account Data archive: S3 S3 • Low read rate tags • Globally accessible Users global dash audit Other systems: Cassandra events • Simpler Key-Value access • Great scalability • Great replica control • High write availability • Time-to-live expiration as cache Dashboard: MySQL • Rows tagged by account • ACID semantics • Master-SlaveN replication • Slave reads Instances • Rows tagged by account Talk with the Experts.
  • 17. 17# X-Account Account S3 tags tags Users global dash audit dash audit events events So we can horizontally scale our dashboard by partitioning objects based on account groups: Clusters Instances Talk with the Experts.
  • 18. 18# Account S3 S3 S3 tags tags tags Cluster N Cluster 1 Cluster 3 … dash audit dash audit dash audit events events events Features: Users • 1 cluster: N accounts • 1 account: 1 home RightScale Accounts • Migratable accounts Benefits: • Great horizontal growth Account Set 2 Account Set 1 • Better failure isolation • Independent scale • Load rebalancing • Versionable code • Differentiated service Talk with the Experts.
  • 19. 19# X-Account Account S3 tags tags Users global dash audit dash audit events events gateway Instances monitor routing Talk with the Experts.
  • 20. 20# X-Account Account S3 tags dash And partition our cloud objects based on the cloud Users global audit the instances of an account run on: events Islands gateway gateway Instances monitor monitor routing routing Talk with the Experts.
  • 21. 21# Account gateway gateway Gateway: MySQL gateway gateway • Master-Slave replication Monitoring: Custom • Replicated files Island N • Island 1 Island 2 Can port to NoSQL easily • Mostly a resource monitor cache monitor monitor • Backup to S3 monitor • But cloud partitionable • Archive to S3 routing routing routing routing Instances Features: • 1 instance: 1 home island • 1 Island can serve N clouds Routing: Cassandra • Core Agents: global data • Simpler Key-Value access • Very high availability Benefits: Services co-located • Services co-located Great scalability • Close Services co-located to cloud resources with resources • with resources Great replica control with resources • Plus cross DC replication* • Good failure isolation • As good as cloud  Cloud 1 Cloud 2 Cloud N • Good scale: global replicas across cassandra DCs Talk with the Experts.
  • 22. 22# Account S3 S3 S3 tags tags tags Cluster N Cluster 1 Cluster 3 Users … dash audit dash audit dash audit events events events Different Geographies What if the cloud where the cluster is deployed on… gateway Fails? gateway gateway Instances Island N Island 1 Island 2 monitor monitor monitor routing routing routing Different Clouds Talk with the Experts.
  • 23. 23# Account Sister Clusters S3 S3 S3 tags tags tags Cluster N Cluster 1 Cluster 3 Users Full replica … dash audit dash audit dash audit events events events Features: • Each master has an extra remote slave gateway gateway • Each cluster in a pair is a DC replica of the other’s gateway Instances localring Island N Island 1 Island 2 monitor At Disaster Recovery time: monitor monitor • Apps are told to start serving an extra shard routing •routing need to provision more infrastructure to recover No routing (try to avoid since everybody is on the same boat) • New resources can be allocated over time to help offload existing ones Talk with the Experts.
  • 24. 24# Conclusions • Shown that RightScale uses multiple database technologies: • RDBMS – MySQL for the ACID semantics and ‘queryability’ • Using a Master to N-Slaves for RO scale, and quick failure recovery • And ReadOnly Provisioning – To increase RO availability and scale remote systems • NoSQL: Cassandra for Availability and Scalability • for higher Read/Write availability within a cluster • For fully replicated regions across the globe (for Read/Write!) • Shown how RightScale uses them in different techniques • It partitions resource data into Islands based on cloud proximity • Can achieve in-cloud polling,and keep monitoring/syslog data storage next to instances • Can provide routing availability, colocated with instances for any world region • It partitions core data into Clusters based on account groups • To scale the core horizontally, and independently and achieve account isolation/differentiation • Enhances fault isolation: Assigning accounts to Clusters deployed away their cloud resources • It maintains cluster pairs (sister sites) • To recover from full cloud region failures • It doesn’t require massive amounts of new resources to recover Talk with the Experts.