How to Build Scalable Websites in the Cloud

How to BuildScalable Web Applicationsin the CloudReference Architectures and Best PracticesDecember 14, 2010

Your Panel Today Presenting: ,[object Object]

Brian Adler, Sr. Professional Services ArchitectQ&A: ,[object Object],Please use the questions window to ask questions any time!

Comparison of traditional and cloud resource models

Reference architecture for a scalable web application

Best practices for each tier of the reference architecture

Summary & conclusionsPlease use the questions window to ask questions any time!

Scalable Web Application What? An application built on an architecture that can adapt to changing conditions

Scalable Web Application What? An application layered on an architecture that can adapt to changing conditions Why? Traffic and load patterns are unpredictable Viral or flash-mob events can result in very dynamic conditions Availability and Reliability Application must be distributed to increase likelihood of end-user accessibility Overprovision Under-utilized resources == wasted $$$ Underprovision Missed opportunities – users unable to access your site/product Don’t be a victim of your own success

Load Balancing HAProxy + Apache Can handle SSL termination on the load balancer Connection statistics available via socket connection and status web page Each instance can handle a specific amount of traffic with no ramp-up time Each instance can only handle a specific amount of traffic Addition of load balancers is possible, but requires DNS modifications

Load Balancing HAProxy + Apache Can handle SSL termination on the load balancer Connection statistics available via socket connection and status web page Each instance can handle a specific amount of traffic with no ramp-up time Each instance can only handle a specific amount of traffic Addition of load balancers is possible, but requires DNS modifications Elastic Load Balancer (ELB) SSL termination is now supported Can scale to handle large amounts of traffic, but can be slow to ramp-up Options do exist for “pre-warming” the ELB Only need one – it will scale to accommodate traffic load Essentially a load balancing appliance No visibility into inner-workings and/or connection rates, statistics, failures, etc. (RightScale has a technical white paper on load balancing solutions that is available at www.RightScale.com/whitepapers)

Load Balancing Load Balancer + Application server Possible, and good for test and dev Not a best practice for a production environment Traffic spikes can cause instance to perform both load balancing and application functions…poorly

Load Balancing Load Balancer + Application server Possible, and good for test and dev Not a best practice for a production environment Traffic spikes can cause instance to perform both load balancing and application functions…poorly Recommendation: Minimum of two load balancers Each load balancer should be in a different availability zone (AZ) to increase reliability and availability RightScale testing has shown that m1.large is a good choice for load balancers Due to 100K-120K packet-per-second limit, larger instances do not provide much gain in throughput Roughly 5K responses/second can be handled by m1.large With the 5K threshold in mind, select the number of load balancers required to handle your peak traffic

Application Server Tier Puts the “scalable” in a scalable application True autoscaling a must in any dynamic/unpredictable environment

Application Server Tier Autoscaling Fully automated server launch based on autoscaling triggers No manual intervention (can be challenging in certain environments, i.e. Windows) Download and install application code from common repository to ensure identical configuration of all servers in the tier

Application Server Tier Autoscaling Fully automated server launch based on autoscaling triggers No manual intervention (can be challenging in certain environments, i.e. Windows) Download and install application code from common repository to ensure identical configuration of all servers in the tier Triggers Common CPU idle Free memory System load Custom Web server connections Application-specific metrics

Application Server Tier When to scale? Conservatively. Both up and down

Application Server Tier When to scale? Conservatively. Both up and down Up Allow adequate lead time for new servers to become operational Before system is negatively impacted Look for trends in activity and react early Worst that can happen: Charged for an extra instance hour

Application Server Tier When to scale? Conservatively. Both up and down Up Allow adequate lead time for new servers to become operational Before system is negatively impacted Look for trends in activity and react early Worst that can happen: Charged for an extra instance hour Down When system has been underutilized for a consistent, consecutive period of time Scale down fewer servers than in a scale-up event Again, only downside is an extra hour of instance charge Better safe than sorry

Application Server Tier Array considerations

Application Server Tier Array considerations Weight the array across all availability zones (not regions) Increases reliability of application NOTE: Traffic within an AZ on private IPs is free. Traffic between AZs incurs a per-gigabyte charge Traffic between regions is charged at public Internet rates

Application Server Tier Array considerations Weight the array across all availability zones (not regions) Increases reliability of application NOTE: Traffic within an AZ on private IPs is free. Traffic between AZs incurs a per-gigabyte charge Traffic between regions is charged at public Internet rates Set minimums and maximums appropriately Minimum can assist in cost savings in times of low usage Maximum can limit overall cost exposure

Application Server Tier Array considerations Weight the array across all availability zones (not regions) Increases reliability of application NOTE: Traffic within an AZ on private IPs is free. Traffic between AZs incurs a per-gigabyte charge Traffic between regions is charged at public Internet rates Set minimums and maximums appropriately Minimum can assist in cost savings in times of low usage Maximum can limit overall cost exposure Instance size m1.large is typically a good choice for array-based servers in a production environment m1.smalls (and even micro instances) can be used in test and development environments Every application is different, so run load tests and benchmarks to find the optimal solution for your environment

Application Server Tier Array considerations Weight the array across all availability zones (not regions) Increases reliability of application NOTE: Traffic within an AZ on private IPs is free. Traffic between AZs incurs a per-gigabyte charge Traffic between regions is charged at public Internet rates Set minimums and maximums appropriately Minimum can assist in cost savings in times of low usage Maximum can limit overall cost exposure Instance size m1.large is typically a good choice for array-based servers in a production environment m1.smalls (and even micro instances) can be used in test and development environments Every application is different, so run load tests and benchmarks to find the optimal solution for your environment Code Deployment Updated code can be pushed to all servers in an array via a single click of a button

Caching Tier Caching can dramatically decrease the load on the database Particularly in read-intensive applications Costs of caching Application complexity/modification Additional instance hours to support the cache

Caching Tier Best practice is to have a separate, dedicated caching tier Caching can be implemented on each application server Prevents the use of a distributed cache Local cache should only be used by the co-resident application server Application complexities Database performance degradation

Caching Tier Best practice is to have a separate, dedicated caching tier Caching can be implemented on each application server Prevents the use of a distributed cache Local cache should only be used by the co-resident application server Application complexities Database performance degradation Instance Size and Count Determine memory caching footprint Select instance size and count that spreads the load over several servers Prevents loss of entire cache if a single instance fails Distribute caching servers across AZs for reliability Overprovision if possible Provide capacity for system to grow to fully utilize cache (budget permitting)

Caching Tier Best practice is to have a separate, dedicated caching tier Caching can be implemented on each application server Prevents the use of a distributed cache Local cache should only be used by the co-resident application server Application complexities Database performance degradation Instance Size and Count Determine memory caching footprint Select instance size and count that spreads the load over several servers Prevents loss of entire cache if a single instance fails Distribute caching servers across AZs for reliability Overprovision if possible Provide capacity for system to grow to fully utilize cache (budget permitting) Manually scaling caching servers is possible but non-trivial Involves application and database performance degradation Time To Lives (TTLs) Always set to expire

Caching Tier Write-intensive applications Don’t see as large a performance gain as read-intensive apps

Caching Tier Write-intensive applications Don’t see as large a performance gain as read-intensive apps Third-party providers Vendor solutions exist that allow dynamic memcached scaling

Database Tier Numerous database architecture options exist No “one size fits all” solution, so testing and benchmarking are critical to determine proper configuration

Database Tier Masters and Slave(s) Multiple Slaves if budget permits Distribute Master and Slave(s) across AZs Always use same instance size for Master and Slaves

Database Tier Masters and Slave(s) Multiple Slaves if budget permits Distribute Master and Slave(s) across AZs Always use same instance size for Master and Slaves Data Storage EBS volumes for data store Never use ephemeral storage for persistent data Back up Master and Slaves frequently Upload snapshots to S3 or some other persistent, redundant storage

Database Tier Masters and Slave(s) Multiple Slaves if budget permits Distribute Master and Slave(s) across AZs Always use same instance size for Master and Slaves Data Storage EBS volumes for data store Never use ephemeral storage for persistent data Back up Master and Slaves frequently Upload snapshots to S3 or some other persistent, redundant storage Instance Size Varies greatly based on the nature of the application and site traffic Load testing and benchmarking can assist in identifying a reasonable initial size

Database Tier Relational Database Service (RDS) Database Appliance No access to instance (no visibility into CPU utilization, memory usage, slow-query logs, etc.) Requires scheduled downtime Announcement of multi-AZ functionality in May 2010 allows 24/7 operation Read replicas announced in October 2010

Database Scaling Vertical “Grow” or “shrink” a database server from one instance size to another

Database Scaling Vertical “Grow” or “shrink” a database server from one instance size to another Horizontal Add additional servers to spread the database load

Database Vertical/Horizontal Scaling

Horizontal Database Scaling Addition of read Slaves Effective for read-intensive applications Only writes need to access the master Replication lag to slaves must be considered

Horizontal Database Scaling Addition of read Slaves Effective for read-intensive applications Only writes need to access the master Replication lag to slaves must be considered Effective mechanism is to use MySQL Proxy

Horizontal Database Scaling Sharding Concept is to partition the database into distinct, non-overlapping pieces “Horizontal slicing” of the database tables into groups of rows Forethought required in setting up shards since cross-shard joins are resource intensive

Horizontal Database Scaling Before Sharding

Horizontal Database Scaling After Sharding

Horizontal Database Scaling Master-Master Two (or more) Master DBs Any Master can modify any database object Replication lag can result in database inconsistencies Poorly-designed applications can cause data object collisions and leave databases in indeterminate state Not a recommended best practice, nor supported by RightScale

Horizontal Database Scaling NoSQL solutions Many options exist – SimpleDB, Cassandra, Membase, CouchDB, MongoDB, Riak, etc. Basically a Key/Value store No complex operations between data objects (no relational operations) Multiple nodes can be used to implement a distributed data store Coordinated backup and recovery can be challenging Some RightScale customers are beginning to use some NoSQL solutions in specific use cases.

So What’s Best? As is typical in many technology discussions…

So What’s Best? As is typical in many technology discussions… “It depends”

So What’s Best? As is typical in many technology discussions… Many scalable environments share some common underlying architecture concepts “It depends”

How to Build Scalable Websites in the Cloud

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (20)

Semelhante a How to Build Scalable Websites in the Cloud

Semelhante a How to Build Scalable Websites in the Cloud (20)

Mais de RightScale

Mais de RightScale (20)

Último

Último (20)

How to Build Scalable Websites in the Cloud