Parallel analytics as a service

Parallel Analytics as a Service
Petrie Wong, Andy He, Eric Lo
Department of Computing
The Hong Kong Polytechnic University
{cskfwong, cszahe, ericlo}@comp.polyu.edu.hk

Motivation
Massively Parallel Processing Relational
Database Systems
o SQL interface
o easy to scale-out
o fast query time
o expensive
 hardware cost, operation cost and
 software license
Vertica, Microsoft PDW, Greenplum, ...
about USD 15K per core or
USD 50K per TB of data
Parallel Analytics as a Service Petrie Wong, Andy He, Eric Lo ,The Hong Kong Polytechnic University 2

Motivation
● MPPDB-as-a-Service
o service provider
 consolidate tenants to fewer machines
● achieves a lower operation cost
o tenants
 enjoy parallel analytics at a lower price

MPPDB as a service: Example
● Shivnath: requests a 64-node PDB
● Reynold Xin: requests a 128-node PDB
● Eric: requests a 32-node PDB
● Donald Kossmann: ...
● We: service provider
o have 1,000 nodes
● Problems to Solve:
o Tenant Consolidation
 Performance (SLA Guarantee P)
 High Availability (Replication Factor R)
● P% (e.g., 99.9%) of time a tenant finishes its queries
without feeling any performance delay
o Shivnath's Q1 finishes in 100 seconds (isolation)
o After consolidation: Q1 finishes in 100 seconds
o After Consolidation
 Query Routing
 Monitoring, Elastic Scaling, and Re-consolidation

Virtual Machine vs. Shared-process
Virtual Machine*
● offers hard
isolation
● incurs significant
redundancy on
resources
o multiple copies of
DBMS
Shared-process
● obtains higher
consolidation
effectiveness
● previous research
o single DB
o OLTP workloads
● This paper
o MPPDB
o OLAP workloads
* We believe Amazon's RedShift is based on VM

DaaS
● OLTP workload
o queries touch small data
● SLA
o throughput (e.g., 100tps)
b4-&-after consolidation
is the same
Previous Works (DaaS)
vs. MPPDBaaS
MPPDBaaS
● OLAP workload
o I/O-intensive
● SLA
o query latency b4-&-after
consolidation is the same
after consolidation,
many tenants active together?
SLA OK!
after consolidation,
many tenants active together?
Easily Violate SLA!
● many tenants
o small data, single node
● many tenants
o "big" data, multiple nodes

Requirements of a MPPDBaaS
R1. Support 10000+ of tenants and machine nodes
R2. Offer high availability services
o replicate tenants' data
R3. Offer query-latency based performance SLA
guarantees
o need performance prediction techniques
R4. Support general tenants:
o tenant's query templates may not be known in
advance
o voids R3
 because most performance prediction techniques
require query templates be given

Our solution:
Tenant-Driven Design
● active tenant ratio in real multi-tenant
environments is low
o in IBM DaaS: ~10% active at same time
● and come up with a deployment plan that
o let each active tenant exclusively use an MPPDB
B. Reinwald. Multitenancy. University of
Washington and Microsoft Research Summer
Institute, 2010.

● R = 3, TDD creates 3 MPPDBs
● # of nodes of each MPPDB
= the tenant that requests the max # of
nodes
Tenant Driven Design:
A Toy Example
● 10 tenants: T1, T2, ... T10
Tenant T1 T2 T3 T4 T5 ... T10 Total
Requested
Nodes
6 6 5 5 2 ... 2 34
Group MPPDB0 MPPDB1 MPPDB2
Nodes 6 6 6
Hosting T1 - T10 T1 - T10 T1 - T10
TDD property:
replication factor
= A (# of active tenant in peak hour)

T2
T2
Tenant Driven Design
Query Routing - Example
T2
T1
T5
00:00 GMT 24:00 GMT
T1
Q1
Q3
Q2
Q4
T2
Q7
Q5
Q6
MPPDB1
MPPDB0
MPPDB2
Routing Principle:
Let a tenant exclusively use a MPPDB!
non-linear scale-out
A=3
TDD's SLA Guarantee: NO matter what query,
ad-hoc query (a new query
that is seen the first time)
linear scale-out query with template given
submitted sequentially
(interactive queries)
submitted in a batch
at any multi-programming-level
the SLAs of a maximum of A active tenants are met.

Scaling up TDD
Realistic Example
● 5,000 tenants
● Average 10% of tenants active
● 500 active tenants
500+ Replicas !!!
TDD guarantee:
the SLAs of a maximum of A active tenants are met.
TDD property:
replication factor = A (# of active tenant in peak hour)
TDD guarantee:
the SLAs of a maximum of 500 active tenants are met.
TDD property:
replication factor = 500 (# of active tenant in peak hour)

Scaling up TDD: solution
● Further divide the tenants into groups
● Each group uses one TDD
● Considerations
o minimize the total # of nodes used
o reasonable replication factor
 # active tenant in each tenant-group is low
e.g. only 3 tenants active together
e.g. divide 5000 tenant into 300 groups

Scaling up TDD
● Formulate as
o Largest Item
Vector Bin
Packing Problem
with Fuzzy
Capacity
● Devise effective
heuristic solution
called 2-step
Grouping
algorithm

Thrifty - System Architecture

Experimental Result -- Overview
● Setup: MulTe-like
● 5,000 tenants
● 2 - 32 nodes
● 200GB to 3.2TB
~89,300 nodes requested
(without replica)
saves 83.3% of nodes !!!
~15,000 nodes only
including high availability
already (3 replicas!)
after
consolidation

Consolidation Effectiveness
86.5%
79.3%

Thrifty - Prototype of MPPDB as a Service
● Shared-Process Consolidation
● provide High Availability
● guarantee Query Performance SLA
● support General Tenants
Conclusions

Thank You
any questions?
Parallel Analytics as a Service Petrie Wong, Andy He, Eric Lo ,The Hong Kong Polytechnic University END

Tenant-Driven Design (TDD)
● Cluster Design
o divide the machine nodes in the cluster into A
groups
 each group of nodes deploys a separate
MPPDB
● Tenant Placement
o each MPPDB hosts all tenants
● Query Routing
o route a tenant's query to a MPPDB that contains its
dataTDD's property :
enforces a replication factor of A for each tenant
Parallel Analytics as a Service Petrie Wong, Andy He, Eric Lo ,The Hong Kong Polytechnic University b-1

Consolidation Effectiveness -
under Higher Active Tenant Ratio

Heuristic Algorithm -
2-Step Grouping

2-Step Grouping
vs. Standard Heuristics
● get higher
consolidation ratio in
all dimensions
● get result in few
hours only for 10k
tenants

Lightweight Elastic Scaling
● Reactive approach
o If the SLA is lower than 99.9% based in past 24hrs
statistic, the system will trigger elastic scaling.
o a new MPPDB will be online and the system will
o deploy the tenant Tx to that MPPDB
o which is most active in past 24hrs to.
o all queries of the Tx will be route to the new MPPDB
after its online.
● Example
o three 10-node MPPDBs serves 20 tenants
o SLA guarantee drops from 99.95% to 99.0%
 solution: starts a new 10-node MPPDB
● cost 30 minutes for 1TB data (with parallel loading)Parallel Analytics as a Service Petrie Wong, Andy He, Eric Lo ,The Hong Kong Polytechnic University b-5

Lightweight Elastic Scaling
trigger finishtrigger finish

Manual Tuning
System administrator may instead use
the parameter U to manually
Use case
● tiny time percentage of possibly SLA
violations
o three 10-node MPPDBs
o serves 20 tenants
o SLA guarantee drops from 99.9% to 99.8%
 solution 1: starts a new 10-node MPPDB (+10)
 solution 2: increase 10-nodes to 11-nodes (+3)
● shorter query latency
saves 7 nodes (17.5%)
number of nodes in a MPPDB

Linear and Non-linear Scale-out
Queries - Concurrent Execution
consolidation opportunity by merging clusters
● query template required
SLA
SLA

Contribution
● Thrifty
o a prototype implementation of MPPDBaaS
o Tenant-driven Design (TDD)
 Smartly organize the tenants into groups
 Each group has R MPPDBs
 Each MPPDB
● host a group of tenants
 Prove that TDD solves R2 to R4 in a row!
o Scale TDD to 10000+ of tenants and machine nodes
(R1)?
 Cast the problem as a new variant of
vector bin packing problem
 Provide an efficient heuristics algorithm
[R = replication factor]
R2. high availability
R3. query latency SLA
R4. generality

Parallel analytics as a service

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Parallel analytics as a service

Similar to Parallel analytics as a service (20)

Recently uploaded

Recently uploaded (20)

Parallel analytics as a service

Editor's Notes