SlideShare uma empresa Scribd logo
1 de 52
Baixar para ler offline
Scalability and
database virtualization
How virtualizing your databases improves
performance, and lowers costs
New York City MySQL Meetup, October 3, 2013
What’s this presentation about?
• Scalability and the database tier
•
•
•
•
•
•

What’s the problem?
How did we get here?
Some proposed solutions
What are parallel databases?
What’s ParElastic?
How do I get ParElastic?

• Q&A

October 3, 2013

Tweet this presentation
#parelastic

Scalability and the database tier | NYC MySQL Meetup

2
What is the scalability
problem?

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

3
What is the scalability problem?
• Has many faces
•
•
•
•

Connections and Concurrency
Data Volume and Retention Period
Databases and Tenants
Read vs. Write

• Your problem(s)
• May be more than one
• May change over time

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

4
Connections and Concurrency
• More [Active] Connections
• Worse Performance

• Sizing your database

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

5
Data Volume and Retention Period
• Longer Retention Period
• More Data

• More Data
• Worse Performance

• Progressive deterioration
• All data in memory 
• All indexes in memory
• Not enough memory 

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

6
Databases and “Tenants”
• Common paradigm in SaaS applications
• Each tenant’s application instance has a database
• Several databases on each database instance

• More databases per instance
• Worse Performance
In one customer engagement we were informed that no more than 1000
tenants could be located on one database instance before performance
became unacceptable
October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

7
Read vs. Write
• Simple read (SELECT) queries could scale well
• Key based lookups
• With favorable indexes

• Things that cause heartburn
• Complex joins (with large data sets)
• Sorts
• Aggregation

• Reads are easier to scale than writes

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

8
How did we get here?
A brief history lesson 

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

9
How did we get here? [1]
• A combination of factors
• Changes in the application user/usage
• Driven by the Internet and mobile computing
• “News Cycles” are getting shorter

• Economics
• Commodity computing is cheap and getting cheaper
• Solutions that can “scale-out” win, others lose

• Ability to leverage higher core-densities
• Other databases does a better job at this than MySQL
• MySQL would do great if you had a 20GHz processor ;)

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

10
How did we get here? [2]
• The Evolution of the Database Management System
• A battle between “generalized” and “specialized”

• The Relational Database Management System (RDBMS)
• Designed for monolithic systems
• SMP
• Scale-Up

• Applications evolve quickly!
• Databases respond slowly

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

11
How did we get here? [3]
• Moore’s Law
• Scale-Up seemed like a fine answer

• But there are limits …

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

12
How did we get here? [4]
• Database architectures traditionally were
• Shared CPU/Memory/Disk
• Also known as “Shared-Everything”

• But “Shared-Everything” doesn’t scale 
• At least not for databases
A server costing twice as much doesn’t always give you twice as much
database “power”. You reach a point of diminishing returns.

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

13
How did we get here? [5]
• You can pay more but you may not get more 

Source: Amazon RDS TPC-C Benchmark. Md. Borhan Uddin, Bo He,
Radu Sion, Cloud Computing Center, SUNY Stony Brook.
Viewed online http://digitalpiglet.org/research/sion2010cloud-rds.pdf

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

14
Some proposed
solutions

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

15
Some proposed solutions
• Several strategies have been advocated
•
•
•
•
•

Cache, Cache, Cache,…
Get a bigger server [a.k.a. Scale-Up]
Sharding [a form of Scale-Out]
NoSQL or NewSQL [typically Scale-Out]
Replication and variants

• We look at each one in more detail

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

16
Cache, Cache, Cache!
That’s easy! Do
some caching!

caching transitive verb
to cache
cache
noun
Temporary computer storage used for quick retrieval
of data in order to increase processing speed.

• Caching only addresses
‘read’; not ‘write’
• Social Media workloads
are 'write heavy‘,
'interactive‘ and ‘highly
personalized’
October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

17
Get a bigger server [Scale-Up]
I will use a
bigger database
server
Can I even get a
bigger server?
What if
m2.4xlarge isn’t
enough?
Maybe I just
have too much
data?
Maybe I have
too many users?

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

18
Sharding [a form of Scale-Out]
Sharding will solve
my problem!

shard
noun ˈshärd
a piece or fragment of a brittle
substance <shards of glass>; broadly :
a small piece or part
sharding
noun ˈshär-diŋ
(a) to make ones application brittle or
fragmented;
(b) to take one big problem and make
many small problems;
(c) to complicate an application while
claiming to solve a scalability
problem;
(d) to decrease developer
productivity;
(e) a bad idea;
(f) sharding library: a mechanism
that attempts (unsuccessfully) to
hide the bad taste of sharding

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

19
NoSQL or NewSQL?
You need NoSQL
or NewSQL!

• Yes, I have to rewrite my
application

• Yes, not all queries will work
• No, there’s no standard query
language
• No, most do not have ACID
guarantees; hell some don’t even
guarantee Durability
• Yes, most are somewhat untried
science-experiments
• More flavors than Ben & Jerry’s
Ice Cream [yes, really]
• But, all the cool kids are doing it!

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

20
Replication and variants
• Replication based solutions (typically called clustering)
•
•
•
•

Many copies of the data
Distribute queries across the copies
Keep the copies synchronized: like herding cats
Write bottleneck

• Read/Write splitting
•
•
•
•

Single Master (gets all the writes)
Many Slaves (share the reads)
Unpredictable latency
Write bottleneck

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

21
What about MySQL Cluster?
• MySQL Cluster is a strange beast
• For best results, you must use the NDB interface
• Only supports the NDB storage engine
• Primarily a distributed in-memory Key-Value Store
• That is ACID compliant and supports joins and things if you
use the SQL interface
• But no one tells you about the performance of this path!

• Published benchmarks are all “FlexAsync” which talk
directly to the NDB interface
• And READ-ONLY
For more details visit http://www.parelastic.com/blog/mysql-cluster-and-benchmarks
Or stick around after the presentation and we can chat!
October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

22
What are parallel
databases?

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

23
What are parallel databases?
1

• A database architecture proposed in 1992
• Very successfully applied to many database problems
• Oracle Exadata, Netezza, Teradata, Greenplum, …

• An example of the “Shared Nothing” database
2
paradigm

1

Parallel Database Systems: The future of high performance database processing [1992, Dewitt, Gray,
ftp://ftp.cs.wisc.edu/pub/techreports/1992/TR1079.pdf]
2
The Case for Shared Nothing [1986, Stonebraker, http://db.cs.berkeley.edu/papers/hpts85-nothing.pdf]

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

24
How parallel databases execute queries

Image from “Parallel Database Systems: The future of high performance database processing” [1992, Dewitt,
Gray, ftp://ftp.cs.wisc.edu/pub/techreports/1992/TR1079.pdf]

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

25
Benefits of parallel databases
• Linear improvement in “reads”
• Linear improvements in “writes”
• Better than linear improvement in “joins”
• Better than linear improvement in “aggregation”
• Better than linear improvement in “sorts”

For more details, refer “Parallel Database Systems: The future of high performance database processing”
[1992, Dewitt, Gray, ftp://ftp.cs.wisc.edu/pub/techreports/1992/TR1079.pdf]

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

26
Parallel Databases vs. Sharding
• Parallel Database
• Database architecture
• Application is data
location agnostic
• Application perceives a
single database
• Requires no application
rewrites

• Application is not
constrained by parallel
database architecture
• A parallel database
handles any schema
October 3, 2013

• Sharding
• Application architecture
• Application is data location
aware
• Application perceives a
collection of databases
• Requires application
rewrites

• Application is constrained
to the limitations of the
sharding architecture
• Not all schemas are
shard’able

Scalability and the database tier | NYC MySQL Meetup

27
What is ParElastic?
Hypervisor for databases

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

28
What is ParElastic?
• An approach to relational database virtualization
• Addresses issues of scalability in relational databases
• A parallel database architecture
• Built on standard MySQL or MySQL variant databases
• Horizontal Scalability
• Elastic

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

29
ParElastic: System Architecture

ParElastic Architecture protected by US8214356, “Apparatus for elastic database processing with heterogeneous data”

10/7/2013

Flex Your Database | ParElastic ® Database Virtualization
Engine

30
Data Distribution: How it works
• User data is “distributed” across multiple storage nodes
• Queries are executed in parallel by some [or all] nodes
• Multiple distribution models supported
•
•
•
•

Range
Hash
Broadcast
Random

• ParElastic guarantees co-location and query execution

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

31
Storage Elasticity: How it works
• A “generational scheme”
• Storage Nodes added over time
• Each creates a new “generation”

• Unnecessary to migrate large amounts of data
• A key drawback with “sharding” that requires “resharding”

Storage Elasticity protected by US8478790, US8386532 and other patents.

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

32
ParElastic: How It Works

10/7/2013

Flex Your Database | ParElastic ® Database Virtualization
Engine

33
ParElastic: Simple query processing example

SELECT COUNT(*)
FROM CUSTOMER;
count(*)
-------2771
(1 row affected)

PROVISION 1 DYNAMIC NODE
ON DYNAMIC NODE
CREATE TEMP TABLE
T1
( C INT );
ON ALL STORAGE NODES
SELECT COUNT(*)
FROM CUSTOMER
AND REDISTRIBUTE
TO T1
ON DYNAMIC NODE
SELECT SUM(C)
FROM T1;

10/7/2013

Flex Your Database | ParElastic ® Database Virtualization
Engine

34
ParElastic Performance Benefits
• Connection Scalability
• ParElastic Tier Elasticity; have more or less ParElastic servers

• Storage / Data Volume Scalability
• Add ParElastic Persistent Nodes as data volumes increase
• Multiple machines working together

• Workloads are variable
• Compute Node Elasticity; have more or less as required

• Databases and Tenants [SaaS applications]
• ParElastic Adaptive Multi-tenancy ™

• No application change
• Queries processed by, data stored on standard MySQL!
10/7/2013

Flex Your Database | ParElastic ® Database Virtualization
Engine

35
ParElastic Multi-Tenancy

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

36
ParElastic Concurrency [1]

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

37
ParElastic Concurrency [2]

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

38
ParElastic data “ingest”
One Million rows/s!
15 Storage Nodes, 2 ParElastic Servers

Tests conducted in Amazon Cloud. Native MySQL testing on m1.xlarge server, standard MySQL, standard EBS volumes. Test driver was a c1.xlarge server to provide
sufficient CPU head-room to generate load. ParElastic run with 5 and 15 persistent storage nodes identically configured, m1.xlarge, standard MySQL, standard EBS
Volumes. 15 node test employed two c1.xlarge test drivers. Best ParElastic performance was with 10 threads, 10 persistent storage nodes and an insert batch size of
5,000 tuples per insert batch. Best native MySQL performance was with 2 threads and a batch size of 10,000 tuples per insert batch.

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

39
What’s the ParElastic Overhead?
Query Time
15.72ms
Test Client
Machine 1

Query Time
17.03ms

ParElastic overhead ~ 1.31ms
Network RTT
0.35ms

Machine 1

ParElastic
Machine 2

mysqld

mysqld

Machine 2

Machine 3

October 3, 2013

Test Client

mysqld
…

Scalability and the database tier | NYC MySQL Meetup

Machine 4

40
Characterizing ParElastic Performance
• A “fixed cost”, the overhead per query
• A “variable cost” for query processing
• Consider this example, a simple “COUNT” query.

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

41
Some things to keep in mind
• Horizontal Scale-Out benefits from
• Being “stateless”, or at least having less state
• Adhering to a truly “shared nothing” approach

• Horizontal Scale-Out is impeded by
• Complex or Shared “State”
• Things that violate the “shared nothing” paradigm

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

42
What is ParElastic?
• An approach to relational database virtualization
• "A Hypervisor for the Database Tier"

• Scale out database capacity across many servers
• Effectively handle workloads too big for one server

• Share this pool of database among many applications
• Efficiently allocate database capacity to workload

• An elastic, multi-tenant, parallel database architecture
• Built on standard MySQL or MySQL variant databases
• Horizontal Scalability
• Elastic

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

43
Some target markets
• Database Virtualization – “Hypervisor for the Database”
• Reduce capex and simplify administration for development
and test

• SaaS Enablement
• Simplified deployment of SaaS applications using multitenancy

• High Volume Database Applications
• High traffic websites, (e.g. social, ecommerce, on-line games)
• High speed data ingest (e.g. click tracking, sensor arrays,
mobile)

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

45
Where do I get
ParElastic?

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

46
Getting ParElastic
• For Evaluations
• Available at no charge on Amazon Marketplace
• Preconfigured for evaluation purposes; not performance
testing
• Runs completely on a single EC2 instance

• For Larger Configurations
•
•
•
•

Contact ParElastic
Email: info@parelastic.com
Twitter: @parelastic
Web: http://www.parelastic.com

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

47
Getting ParElastic
• On the Amazon AWS Marketplace
(aws.amazon.com/marketplace)

• Quick start guide and simple (two-step) setup wizard
provided.

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

48
Conclusion

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

49
Conclusion
• Database Scalability is a very real problem
• The Cloud has put a very complicated wrinkle in it

• The problem was seen before with commodity servers
• Virtualization was able to address this problem

• Several “hacks” have been proposed
• Not really solutions, just hacks

• ParElastic is a database virtualization solution
• Based on standard relational databases
• Provides benefits of horizontal scalability and multi-tenancy

• ParElastic is available for evaluation on many platforms
• Free evaluation also available on Amazon Marketplace
October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

50
Contacting ParElastic
• Look us up online
– http://www.parelastic.com

• Watch an explainer video
– http://www.parelastic.com/video

• Contact us
– Email: info@parelastic.com

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

51
Q&A

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

52
Image Credits
•

Moore’s Law
•

•

Hercules slays the Hydra
•

•

Wikipedia [http://commons.wikimedia.org/wiki/File%3AHercules_slaying_the_Hydra.jpg]

CPU History
•

•

Wikipedia [http://commons.wikimedia.org/wiki/File%3ATransistor_Count_and_Moore's_Law_-_2011.svg]

Phillip E. Ross, “Why CPU Frequency Stalled” [http://spectrum.ieee.org/computing/hardware/why-cpu-frequency-stalled]

Herding Cats
•

Image from [http://wodongatafe.wordpress.com/2011/05/27/herding-cats-or-facilitating-a-webinar-whats-the-difference/]

October 3, 2013

Scalability and the database tier | NYC MySQL Meetup

53

Mais conteúdo relacionado

Mais procurados

NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big dataSteven Francia
 
Considerations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT projectConsiderations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT projectAkmal Chaudhri
 
SQL or NoSQL, that is the question!
SQL or NoSQL, that is the question!SQL or NoSQL, that is the question!
SQL or NoSQL, that is the question!Andraz Tori
 
Big Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-LandBig Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-LandAndrew Brust
 
NoSQL Data Architecture Patterns
NoSQL Data ArchitecturePatternsNoSQL Data ArchitecturePatterns
NoSQL Data Architecture PatternsMaynooth University
 
SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?Venu Anuganti
 
NoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsNoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsDATAVERSITY
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sqlRam kumar
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational DatabasesUdi Bauman
 
NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and UsesSuvradeep Rudra
 
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural LessonsCassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural LessonsDataStax
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsAndrew Brust
 
NoSQL databases pros and cons
NoSQL databases pros and consNoSQL databases pros and cons
NoSQL databases pros and consFabio Fumarola
 
Chapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choicesChapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choicesMaynooth University
 

Mais procurados (20)

Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
Relational vs. Non-Relational
Relational vs. Non-RelationalRelational vs. Non-Relational
Relational vs. Non-Relational
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big data
 
Considerations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT projectConsiderations for using NoSQL technology on your next IT project
Considerations for using NoSQL technology on your next IT project
 
SQL or NoSQL, that is the question!
SQL or NoSQL, that is the question!SQL or NoSQL, that is the question!
SQL or NoSQL, that is the question!
 
Big Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-LandBig Data and NoSQL in Microsoft-Land
Big Data and NoSQL in Microsoft-Land
 
NoSQL Data Architecture Patterns
NoSQL Data ArchitecturePatternsNoSQL Data ArchitecturePatterns
NoSQL Data Architecture Patterns
 
SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?SQL/NoSQL How to choose ?
SQL/NoSQL How to choose ?
 
Selecting best NoSQL
Selecting best NoSQL Selecting best NoSQL
Selecting best NoSQL
 
Databases in the Cloud
Databases in the CloudDatabases in the Cloud
Databases in the Cloud
 
NoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsNoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture Patterns
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
SQL vs. NoSQL
SQL vs. NoSQLSQL vs. NoSQL
SQL vs. NoSQL
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and Uses
 
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural LessonsCassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI Pros
 
NoSQL databases pros and cons
NoSQL databases pros and consNoSQL databases pros and cons
NoSQL databases pros and cons
 
Chapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choicesChapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choices
 

Semelhante a Making MySQL Flexible with ParElastic Database Scalability, Amrith Kumar, Founder CTO, ParElastic

NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
SQL vs. NoSQL. It's always a hard choice.
SQL vs. NoSQL. It's always a hard choice.SQL vs. NoSQL. It's always a hard choice.
SQL vs. NoSQL. It's always a hard choice.Denis Reznik
 
[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...
[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...
[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...Insight Technology, Inc.
 
Storage Systems For Scalable systems
Storage Systems For Scalable systemsStorage Systems For Scalable systems
Storage Systems For Scalable systemselliando dias
 
What can we learn from NoSQL technologies?
What can we learn from NoSQL technologies?What can we learn from NoSQL technologies?
What can we learn from NoSQL technologies?Ivan Zoratti
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxRahul Borate
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseModern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseEric Bragas
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology LandscapeShivanandaVSeeri
 
Solr cloud the 'search first' nosql database extended deep dive
Solr cloud the 'search first' nosql database   extended deep diveSolr cloud the 'search first' nosql database   extended deep dive
Solr cloud the 'search first' nosql database extended deep divelucenerevolution
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxRahul Borate
 
Understanding data
Understanding dataUnderstanding data
Understanding dataShahd Salama
 
Visualizing big data in the browser using spark
Visualizing big data in the browser using sparkVisualizing big data in the browser using spark
Visualizing big data in the browser using sparkDatabricks
 
1. Lecture1_NOSQL_Introduction.pdf
1. Lecture1_NOSQL_Introduction.pdf1. Lecture1_NOSQL_Introduction.pdf
1. Lecture1_NOSQL_Introduction.pdfShaimaaMohamedGalal
 
How & When to Use NoSQL at Websummit Dublin
How & When to Use NoSQL at Websummit DublinHow & When to Use NoSQL at Websummit Dublin
How & When to Use NoSQL at Websummit DublinAmazon Web Services
 

Semelhante a Making MySQL Flexible with ParElastic Database Scalability, Amrith Kumar, Founder CTO, ParElastic (20)

NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
SQL vs. NoSQL. It's always a hard choice.
SQL vs. NoSQL. It's always a hard choice.SQL vs. NoSQL. It's always a hard choice.
SQL vs. NoSQL. It's always a hard choice.
 
How and when to use NoSQL
How and when to use NoSQLHow and when to use NoSQL
How and when to use NoSQL
 
[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...
[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...
[db tech showcase Tokyo 2017] C16: Azure SQL Database - Are you ready for the...
 
iForum 2015: SQL vs. NoSQL
iForum 2015: SQL vs. NoSQLiForum 2015: SQL vs. NoSQL
iForum 2015: SQL vs. NoSQL
 
Storage Systems For Scalable systems
Storage Systems For Scalable systemsStorage Systems For Scalable systems
Storage Systems For Scalable systems
 
What can we learn from NoSQL technologies?
What can we learn from NoSQL technologies?What can we learn from NoSQL technologies?
What can we learn from NoSQL technologies?
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseModern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology Landscape
 
Solr cloud the 'search first' nosql database extended deep dive
Solr cloud the 'search first' nosql database   extended deep diveSolr cloud the 'search first' nosql database   extended deep dive
Solr cloud the 'search first' nosql database extended deep dive
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
 
NoSQL and MongoDB
NoSQL and MongoDBNoSQL and MongoDB
NoSQL and MongoDB
 
Understanding data
Understanding dataUnderstanding data
Understanding data
 
Visualizing big data in the browser using spark
Visualizing big data in the browser using sparkVisualizing big data in the browser using spark
Visualizing big data in the browser using spark
 
Cassandra at scale
Cassandra at scaleCassandra at scale
Cassandra at scale
 
1. Lecture1_NOSQL_Introduction.pdf
1. Lecture1_NOSQL_Introduction.pdf1. Lecture1_NOSQL_Introduction.pdf
1. Lecture1_NOSQL_Introduction.pdf
 
mongodb_DS.pptx
mongodb_DS.pptxmongodb_DS.pptx
mongodb_DS.pptx
 
How & When to Use NoSQL at Websummit Dublin
How & When to Use NoSQL at Websummit DublinHow & When to Use NoSQL at Websummit Dublin
How & When to Use NoSQL at Websummit Dublin
 
NoSQL.pptx
NoSQL.pptxNoSQL.pptx
NoSQL.pptx
 

Mais de ✔ Eric David Benari, PMP

SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...✔ Eric David Benari, PMP
 
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTODatabase Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO✔ Eric David Benari, PMP
 
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...✔ Eric David Benari, PMP
 
Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...
Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...
Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...✔ Eric David Benari, PMP
 
Database Camp 2016 @ United Nations, NYC - Amir Orad, CEO, Sisense
Database Camp 2016 @ United Nations, NYC - Amir Orad, CEO, SisenseDatabase Camp 2016 @ United Nations, NYC - Amir Orad, CEO, Sisense
Database Camp 2016 @ United Nations, NYC - Amir Orad, CEO, Sisense✔ Eric David Benari, PMP
 
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, BlazegraphDatabase Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph✔ Eric David Benari, PMP
 
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, CouchbaseDatabase Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase✔ Eric David Benari, PMP
 
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UNMariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN✔ Eric David Benari, PMP
 
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two DatabasesNoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases✔ Eric David Benari, PMP
 
MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...
MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...
MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...✔ Eric David Benari, PMP
 

Mais de ✔ Eric David Benari, PMP (10)

SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
SVP of Couchbase: The Exciting World of NoSQL: Scaling NoSQL Data, N1QL vs. S...
 
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTODatabase Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
 
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
Database Camp 2016 @ United Nations, NYC - Michael Glukhovsky, Co-Founder, Re...
 
Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...
Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...
Database Camp 2016 @ United Nations, NYC - Minerva Tantoco, CTO of the City o...
 
Database Camp 2016 @ United Nations, NYC - Amir Orad, CEO, Sisense
Database Camp 2016 @ United Nations, NYC - Amir Orad, CEO, SisenseDatabase Camp 2016 @ United Nations, NYC - Amir Orad, CEO, Sisense
Database Camp 2016 @ United Nations, NYC - Amir Orad, CEO, Sisense
 
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, BlazegraphDatabase Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
Database Camp 2016 @ United Nations, NYC - Brad Bebee, CEO, Blazegraph
 
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, CouchbaseDatabase Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
 
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UNMariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
MariaDB 10.2 & MariaDB 10.1 by Michael Monty Widenius at Database Camp 2016 @ UN
 
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two DatabasesNoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
 
MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...
MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...
MySQL to Cassandra: Big Data, High Scale, Data Migration... Oh My! Scott Bonn...
 

Último

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Making MySQL Flexible with ParElastic Database Scalability, Amrith Kumar, Founder CTO, ParElastic

  • 1. Scalability and database virtualization How virtualizing your databases improves performance, and lowers costs New York City MySQL Meetup, October 3, 2013
  • 2. What’s this presentation about? • Scalability and the database tier • • • • • • What’s the problem? How did we get here? Some proposed solutions What are parallel databases? What’s ParElastic? How do I get ParElastic? • Q&A October 3, 2013 Tweet this presentation #parelastic Scalability and the database tier | NYC MySQL Meetup 2
  • 3. What is the scalability problem? October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 3
  • 4. What is the scalability problem? • Has many faces • • • • Connections and Concurrency Data Volume and Retention Period Databases and Tenants Read vs. Write • Your problem(s) • May be more than one • May change over time October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 4
  • 5. Connections and Concurrency • More [Active] Connections • Worse Performance • Sizing your database October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 5
  • 6. Data Volume and Retention Period • Longer Retention Period • More Data • More Data • Worse Performance • Progressive deterioration • All data in memory  • All indexes in memory • Not enough memory  October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 6
  • 7. Databases and “Tenants” • Common paradigm in SaaS applications • Each tenant’s application instance has a database • Several databases on each database instance • More databases per instance • Worse Performance In one customer engagement we were informed that no more than 1000 tenants could be located on one database instance before performance became unacceptable October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 7
  • 8. Read vs. Write • Simple read (SELECT) queries could scale well • Key based lookups • With favorable indexes • Things that cause heartburn • Complex joins (with large data sets) • Sorts • Aggregation • Reads are easier to scale than writes October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 8
  • 9. How did we get here? A brief history lesson  October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 9
  • 10. How did we get here? [1] • A combination of factors • Changes in the application user/usage • Driven by the Internet and mobile computing • “News Cycles” are getting shorter • Economics • Commodity computing is cheap and getting cheaper • Solutions that can “scale-out” win, others lose • Ability to leverage higher core-densities • Other databases does a better job at this than MySQL • MySQL would do great if you had a 20GHz processor ;) October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 10
  • 11. How did we get here? [2] • The Evolution of the Database Management System • A battle between “generalized” and “specialized” • The Relational Database Management System (RDBMS) • Designed for monolithic systems • SMP • Scale-Up • Applications evolve quickly! • Databases respond slowly October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 11
  • 12. How did we get here? [3] • Moore’s Law • Scale-Up seemed like a fine answer • But there are limits … October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 12
  • 13. How did we get here? [4] • Database architectures traditionally were • Shared CPU/Memory/Disk • Also known as “Shared-Everything” • But “Shared-Everything” doesn’t scale  • At least not for databases A server costing twice as much doesn’t always give you twice as much database “power”. You reach a point of diminishing returns. October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 13
  • 14. How did we get here? [5] • You can pay more but you may not get more  Source: Amazon RDS TPC-C Benchmark. Md. Borhan Uddin, Bo He, Radu Sion, Cloud Computing Center, SUNY Stony Brook. Viewed online http://digitalpiglet.org/research/sion2010cloud-rds.pdf October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 14
  • 15. Some proposed solutions October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 15
  • 16. Some proposed solutions • Several strategies have been advocated • • • • • Cache, Cache, Cache,… Get a bigger server [a.k.a. Scale-Up] Sharding [a form of Scale-Out] NoSQL or NewSQL [typically Scale-Out] Replication and variants • We look at each one in more detail October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 16
  • 17. Cache, Cache, Cache! That’s easy! Do some caching! caching transitive verb to cache cache noun Temporary computer storage used for quick retrieval of data in order to increase processing speed. • Caching only addresses ‘read’; not ‘write’ • Social Media workloads are 'write heavy‘, 'interactive‘ and ‘highly personalized’ October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 17
  • 18. Get a bigger server [Scale-Up] I will use a bigger database server Can I even get a bigger server? What if m2.4xlarge isn’t enough? Maybe I just have too much data? Maybe I have too many users? October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 18
  • 19. Sharding [a form of Scale-Out] Sharding will solve my problem! shard noun ˈshärd a piece or fragment of a brittle substance <shards of glass>; broadly : a small piece or part sharding noun ˈshär-diŋ (a) to make ones application brittle or fragmented; (b) to take one big problem and make many small problems; (c) to complicate an application while claiming to solve a scalability problem; (d) to decrease developer productivity; (e) a bad idea; (f) sharding library: a mechanism that attempts (unsuccessfully) to hide the bad taste of sharding October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 19
  • 20. NoSQL or NewSQL? You need NoSQL or NewSQL! • Yes, I have to rewrite my application • Yes, not all queries will work • No, there’s no standard query language • No, most do not have ACID guarantees; hell some don’t even guarantee Durability • Yes, most are somewhat untried science-experiments • More flavors than Ben & Jerry’s Ice Cream [yes, really] • But, all the cool kids are doing it! October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 20
  • 21. Replication and variants • Replication based solutions (typically called clustering) • • • • Many copies of the data Distribute queries across the copies Keep the copies synchronized: like herding cats Write bottleneck • Read/Write splitting • • • • Single Master (gets all the writes) Many Slaves (share the reads) Unpredictable latency Write bottleneck October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 21
  • 22. What about MySQL Cluster? • MySQL Cluster is a strange beast • For best results, you must use the NDB interface • Only supports the NDB storage engine • Primarily a distributed in-memory Key-Value Store • That is ACID compliant and supports joins and things if you use the SQL interface • But no one tells you about the performance of this path! • Published benchmarks are all “FlexAsync” which talk directly to the NDB interface • And READ-ONLY For more details visit http://www.parelastic.com/blog/mysql-cluster-and-benchmarks Or stick around after the presentation and we can chat! October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 22
  • 23. What are parallel databases? October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 23
  • 24. What are parallel databases? 1 • A database architecture proposed in 1992 • Very successfully applied to many database problems • Oracle Exadata, Netezza, Teradata, Greenplum, … • An example of the “Shared Nothing” database 2 paradigm 1 Parallel Database Systems: The future of high performance database processing [1992, Dewitt, Gray, ftp://ftp.cs.wisc.edu/pub/techreports/1992/TR1079.pdf] 2 The Case for Shared Nothing [1986, Stonebraker, http://db.cs.berkeley.edu/papers/hpts85-nothing.pdf] October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 24
  • 25. How parallel databases execute queries Image from “Parallel Database Systems: The future of high performance database processing” [1992, Dewitt, Gray, ftp://ftp.cs.wisc.edu/pub/techreports/1992/TR1079.pdf] October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 25
  • 26. Benefits of parallel databases • Linear improvement in “reads” • Linear improvements in “writes” • Better than linear improvement in “joins” • Better than linear improvement in “aggregation” • Better than linear improvement in “sorts” For more details, refer “Parallel Database Systems: The future of high performance database processing” [1992, Dewitt, Gray, ftp://ftp.cs.wisc.edu/pub/techreports/1992/TR1079.pdf] October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 26
  • 27. Parallel Databases vs. Sharding • Parallel Database • Database architecture • Application is data location agnostic • Application perceives a single database • Requires no application rewrites • Application is not constrained by parallel database architecture • A parallel database handles any schema October 3, 2013 • Sharding • Application architecture • Application is data location aware • Application perceives a collection of databases • Requires application rewrites • Application is constrained to the limitations of the sharding architecture • Not all schemas are shard’able Scalability and the database tier | NYC MySQL Meetup 27
  • 28. What is ParElastic? Hypervisor for databases October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 28
  • 29. What is ParElastic? • An approach to relational database virtualization • Addresses issues of scalability in relational databases • A parallel database architecture • Built on standard MySQL or MySQL variant databases • Horizontal Scalability • Elastic October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 29
  • 30. ParElastic: System Architecture ParElastic Architecture protected by US8214356, “Apparatus for elastic database processing with heterogeneous data” 10/7/2013 Flex Your Database | ParElastic ® Database Virtualization Engine 30
  • 31. Data Distribution: How it works • User data is “distributed” across multiple storage nodes • Queries are executed in parallel by some [or all] nodes • Multiple distribution models supported • • • • Range Hash Broadcast Random • ParElastic guarantees co-location and query execution October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 31
  • 32. Storage Elasticity: How it works • A “generational scheme” • Storage Nodes added over time • Each creates a new “generation” • Unnecessary to migrate large amounts of data • A key drawback with “sharding” that requires “resharding” Storage Elasticity protected by US8478790, US8386532 and other patents. October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 32
  • 33. ParElastic: How It Works 10/7/2013 Flex Your Database | ParElastic ® Database Virtualization Engine 33
  • 34. ParElastic: Simple query processing example SELECT COUNT(*) FROM CUSTOMER; count(*) -------2771 (1 row affected) PROVISION 1 DYNAMIC NODE ON DYNAMIC NODE CREATE TEMP TABLE T1 ( C INT ); ON ALL STORAGE NODES SELECT COUNT(*) FROM CUSTOMER AND REDISTRIBUTE TO T1 ON DYNAMIC NODE SELECT SUM(C) FROM T1; 10/7/2013 Flex Your Database | ParElastic ® Database Virtualization Engine 34
  • 35. ParElastic Performance Benefits • Connection Scalability • ParElastic Tier Elasticity; have more or less ParElastic servers • Storage / Data Volume Scalability • Add ParElastic Persistent Nodes as data volumes increase • Multiple machines working together • Workloads are variable • Compute Node Elasticity; have more or less as required • Databases and Tenants [SaaS applications] • ParElastic Adaptive Multi-tenancy ™ • No application change • Queries processed by, data stored on standard MySQL! 10/7/2013 Flex Your Database | ParElastic ® Database Virtualization Engine 35
  • 36. ParElastic Multi-Tenancy October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 36
  • 37. ParElastic Concurrency [1] October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 37
  • 38. ParElastic Concurrency [2] October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 38
  • 39. ParElastic data “ingest” One Million rows/s! 15 Storage Nodes, 2 ParElastic Servers Tests conducted in Amazon Cloud. Native MySQL testing on m1.xlarge server, standard MySQL, standard EBS volumes. Test driver was a c1.xlarge server to provide sufficient CPU head-room to generate load. ParElastic run with 5 and 15 persistent storage nodes identically configured, m1.xlarge, standard MySQL, standard EBS Volumes. 15 node test employed two c1.xlarge test drivers. Best ParElastic performance was with 10 threads, 10 persistent storage nodes and an insert batch size of 5,000 tuples per insert batch. Best native MySQL performance was with 2 threads and a batch size of 10,000 tuples per insert batch. October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 39
  • 40. What’s the ParElastic Overhead? Query Time 15.72ms Test Client Machine 1 Query Time 17.03ms ParElastic overhead ~ 1.31ms Network RTT 0.35ms Machine 1 ParElastic Machine 2 mysqld mysqld Machine 2 Machine 3 October 3, 2013 Test Client mysqld … Scalability and the database tier | NYC MySQL Meetup Machine 4 40
  • 41. Characterizing ParElastic Performance • A “fixed cost”, the overhead per query • A “variable cost” for query processing • Consider this example, a simple “COUNT” query. October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 41
  • 42. Some things to keep in mind • Horizontal Scale-Out benefits from • Being “stateless”, or at least having less state • Adhering to a truly “shared nothing” approach • Horizontal Scale-Out is impeded by • Complex or Shared “State” • Things that violate the “shared nothing” paradigm October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 42
  • 43. What is ParElastic? • An approach to relational database virtualization • "A Hypervisor for the Database Tier" • Scale out database capacity across many servers • Effectively handle workloads too big for one server • Share this pool of database among many applications • Efficiently allocate database capacity to workload • An elastic, multi-tenant, parallel database architecture • Built on standard MySQL or MySQL variant databases • Horizontal Scalability • Elastic October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 43
  • 44. Some target markets • Database Virtualization – “Hypervisor for the Database” • Reduce capex and simplify administration for development and test • SaaS Enablement • Simplified deployment of SaaS applications using multitenancy • High Volume Database Applications • High traffic websites, (e.g. social, ecommerce, on-line games) • High speed data ingest (e.g. click tracking, sensor arrays, mobile) October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 45
  • 45. Where do I get ParElastic? October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 46
  • 46. Getting ParElastic • For Evaluations • Available at no charge on Amazon Marketplace • Preconfigured for evaluation purposes; not performance testing • Runs completely on a single EC2 instance • For Larger Configurations • • • • Contact ParElastic Email: info@parelastic.com Twitter: @parelastic Web: http://www.parelastic.com October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 47
  • 47. Getting ParElastic • On the Amazon AWS Marketplace (aws.amazon.com/marketplace) • Quick start guide and simple (two-step) setup wizard provided. October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 48
  • 48. Conclusion October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 49
  • 49. Conclusion • Database Scalability is a very real problem • The Cloud has put a very complicated wrinkle in it • The problem was seen before with commodity servers • Virtualization was able to address this problem • Several “hacks” have been proposed • Not really solutions, just hacks • ParElastic is a database virtualization solution • Based on standard relational databases • Provides benefits of horizontal scalability and multi-tenancy • ParElastic is available for evaluation on many platforms • Free evaluation also available on Amazon Marketplace October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 50
  • 50. Contacting ParElastic • Look us up online – http://www.parelastic.com • Watch an explainer video – http://www.parelastic.com/video • Contact us – Email: info@parelastic.com October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 51
  • 51. Q&A October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 52
  • 52. Image Credits • Moore’s Law • • Hercules slays the Hydra • • Wikipedia [http://commons.wikimedia.org/wiki/File%3AHercules_slaying_the_Hydra.jpg] CPU History • • Wikipedia [http://commons.wikimedia.org/wiki/File%3ATransistor_Count_and_Moore's_Law_-_2011.svg] Phillip E. Ross, “Why CPU Frequency Stalled” [http://spectrum.ieee.org/computing/hardware/why-cpu-frequency-stalled] Herding Cats • Image from [http://wodongatafe.wordpress.com/2011/05/27/herding-cats-or-facilitating-a-webinar-whats-the-difference/] October 3, 2013 Scalability and the database tier | NYC MySQL Meetup 53