SlideShare a Scribd company logo
1 of 28
Download to read offline
MIDDLEWARE SYSTEMS
RESEARCH GROUP
MSRG.ORG

Solving Big Data Challenges for
Enterprise Application
Performance Management
Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen
University of Toronto

Sergio Gomez-Villamor
Universitat Politecnica de Catalunya

Victor Muntes-Mulero*, Serge Mankowskii
CA Labs (Europe*)
Agenda


Application Performance Management



APM Benchmark



Benchmarked Systems



Test Results

Tilmann Rabl - VLDB 2012

8/26/2012
Motivation

Tilmann Rabl - VLDB 2012

8/26/2012
Enterprise System Architecture
SAP
Agent

Client
Client
Client

Identity
Manager

Agent

Web Server
Agent

Application
Server

Agent

Application
Server

Agent

Message
Queue

Database

Agent

Message
Broker

Agent

Agent

Web
Service
Agent

Application
Server

Client

Main Frame

3rd Party

Database

Agent

Measurements

Agent

Agent
Tilmann Rabl - VLDB 2012

Agent
8/26/2012
Application Performance
Management
SAP
Agent

Client
Client
Client

Identity
Manager

Agent

Web Server
Agent

Application
Server

Agent

Application
Server

Agent

Message
Queue

Database

Agent

Message
Broker

Agent

Agent

Web
Service
Agent

Application
Server

Client

Main Frame

3rd Party

Database

Agent

Measurements

Agent

Agent
Tilmann Rabl - VLDB 2012

Agent
8/26/2012
APM in Numbers


Nodes in an enterprise
system










10 sec






100B / event

Raw data


Up to 50K
Avg 10K

Reporting period

Data size


Metrics per node




100 – 10K



100MB / sec
355GB / h
2.8 PB / y

Event rate at storage


> 1M / sec

Tilmann Rabl - VLDB 2012

8/24/2012
APM Benchmark


Based on Yahoo! Cloud Serving Benchmark (YCSB)




Single table






CRUD operations
25 byte key
5 values (10 byte each)
75 byte / record

Five workloads



50 rows scan length
Tilmann Rabl - VLDB 2012

8/24/2012
Benchmarked Systems


6 systems











Categories









Cassandra
HBase
Project Voldemort
Redis
VoltDB
MySQL
Key-value stores
Extensible record stores
Scalable relational stores
Sharded systems
Main memory systems
Disk based systems

Chosen by


Previous results, popularity, maturity, availability

Tilmann Rabl - VLDB 2012

8/26/2012
Classification of Benchmarked
Systems

Disk-Based
Stores
Distributed
In-Memory Sharded
Stores
Key-value stores

Extensible Row
Stores

Scalable Relational
Stores

Tilmann Rabl - VLDB 2012

8/26/2012
Experimental Testbed
Two clusters
 Cluster M (memory-bound)








16 nodes (plus master node)
2x quad core CPU,16 GB RAM, 2x 74GB HDD (RAID 0)
10 million records per node (~700 MB raw)
128 connections per node (8 per core)

Cluster D (disk-bound)






24 nodes
2x dual core CPU, 4 GB RAM, 74 GB HDD
150 million records on 12 nodes (~10.5 GB raw)
8 connections per node
Tilmann Rabl - VLDB 2012

8/24/2012
Evaluation






Minimum 3 runs per workload and system
Fresh install for each run
10 minutes runtime
Up to 5 clients for 12 nodes




To make sure YCSB is no bottleneck

Maximum achievable throughput

Tilmann Rabl - VLDB 2012

8/24/2012
Workload W - Throughput







Cassandra dominates
Higher throughput for Cassandra and HBase
Lower throughput for other systems
Scalability not as good for all web stores
VoltDB best single node throughput
Tilmann Rabl - VLDB 2012

8/24/2012
Workload W – Latency Writes



Same latencies for




Cassandra,Voldemort,VoltDB,Redis, MySQL

HBase latency increased

Tilmann Rabl - VLDB 2012

8/24/2012
Workload W – Latency Reads



Same latency as for R for




Cassandra,Voldemort, Redis,VoltDB, HBase

HBase latency in second range

Tilmann Rabl - VLDB 2012

8/24/2012
Workload R - Throughput







95% reads, 5% writes
On a single node, main memory systems have best
performance
Linear scalability for web data stores
Sublinear scalability for sharded systems
Slow-down for VoltDB
Tilmann Rabl - VLDB 2012

8/24/2012
Workload RW - Throughput






50% reads, 50% writes
VoltDB highest single node throughput
HBase and Cassandra throughput increase
MySQL and Voldemort throughput reduction
Tilmann Rabl - VLDB 2012

8/24/2012
Workload RS – Latency Scans





HBase latency equal to reads in Workload R
MySQL latency very high due to full table scans
Similar but increased latency for


Cassandra, Redis,VoltDB
Tilmann Rabl - VLDB 2012

8/24/2012
Workload RWS - Throughput





25% reads, 25% scans, 50% writes
Cassandra, HBase, Redis,VoltDB gain performance
MySQL performance 20 – 4 ops / sec

Tilmann Rabl - VLDB 2012

8/24/2012
Bounded Throughput –
Write Latency






Workload R, normalized
Maximum throughput in previous tests 100%
Steadily decreasing for most systems
HBase not as stable (but below 0.1 ms)
Tilmann Rabl - VLDB 2012

8/24/2012
Bounded Throughput –
Read Latency






Redis and Voldemort slightly decrease
HBase two states
Cassandra decreases linearly
MySQL decreases and then stays constant
Tilmann Rabl - VLDB 2012

8/24/2012
Disk Usage






Raw data: 75 byte per record, 0.7GB/10M
Cassandra: 2.5GB / 10M
HBase: 7.5GB / 10M
No compression
Tilmann Rabl - VLDB 2012

8/24/2012
Cluster D Results – Throughput




Disk bound, 150M records on 8 nodes
More writes – more throughput




Cassandra: 26x
Hbase: 15x
Voldemort 3x
Tilmann Rabl - VLDB 2012

8/24/2012
Cluster D Results – Latency Writes






Low latency for writes for all systems
Relatively stable latency for writes
Lowest for HBase
Equal for Voldemort and Cassandra
Tilmann Rabl - VLDB 2012

8/24/2012
Cluster D Results – Latency Reads






Cassandra latency decreases with more writes
Voldemort latency low 5-20ms
HBase latency high (up to 200 ms)
Cassandra in between
Tilmann Rabl - VLDB 2012

8/24/2012
Lessons Learned


YCSB




Cassandra




Higher number of connections might lead to better results

MySQL




Jedis distribution uneven

Voldemort




Difficult setup, special JVM settings necessary

Redis




Optimal tokens for data distribution necessary

HBase




Client to host ratio 1:3 better 1:2

Eager scan evaluation

VoltDB


Synchronous communication slow on multiple nodes
Tilmann Rabl - VLDB 2012

8/24/2012
Conclusions I


Cassandra




HBase




Read and write latency stable at low level

Redis




Scales well, latency decreases with higher scales

Voldemort




Low write latency, high read latency, low throughput

Sharded MySQL




Winner in terms of scalability and throughput

Standalone has high throughput, sharded version does not scale well

VoltDB


High single node throughput, does not scale for synchronous access

Tilmann Rabl - VLDB 2012

8/24/2012
Conclusions II


Cassandra’s performance close to APM requirements




Additional improvements needed for reliably sustaining APM
workload

Future work



Benchmark impact of replication, compression
Monitor the benchmark runs using APM tool

Tilmann Rabl - VLDB 2012

8/26/2012
Thanks!

 Questions?



Contact: Tilmann Rabl
tilmann.rabl@utoronto.ca

Tilmann Rabl - VLDB 2012

8/24/2012

More Related Content

What's hot

Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Clustrix
 
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Database Architecture & Scaling Strategies, in the Cloud & on the Rack Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Database Architecture & Scaling Strategies, in the Cloud & on the Rack Clustrix
 
Maximizing performance via tuning and optimization
Maximizing performance via tuning and optimizationMaximizing performance via tuning and optimization
Maximizing performance via tuning and optimizationMariaDB plc
 
RedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
RedisConf18 - Techniques for Synchronizing In-Memory Caches with RedisRedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
RedisConf18 - Techniques for Synchronizing In-Memory Caches with RedisRedis Labs
 
Introducing Venice - Strata NYC 2017
Introducing Venice - Strata NYC 2017Introducing Venice - Strata NYC 2017
Introducing Venice - Strata NYC 2017Felix GV
 
Fast Online Access to Massive Offline Data - SECR 2016
Fast Online Access to Massive Offline Data - SECR 2016Fast Online Access to Massive Offline Data - SECR 2016
Fast Online Access to Massive Offline Data - SECR 2016Felix GV
 
Distributed transactions
Distributed transactionsDistributed transactions
Distributed transactionsAritra Das
 
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711Dave Anselmi
 
hbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Neteasehbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at NeteaseHBaseCon
 
Running database infrastructure on containers
Running database infrastructure on containersRunning database infrastructure on containers
Running database infrastructure on containersMariaDB plc
 
Date-tiered Compaction Policy for Time-series Data
Date-tiered Compaction Policy for Time-series DataDate-tiered Compaction Policy for Time-series Data
Date-tiered Compaction Policy for Time-series DataHBaseCon
 
Apache Flume - DataDayTexas
Apache Flume - DataDayTexasApache Flume - DataDayTexas
Apache Flume - DataDayTexasArvind Prabhakar
 
MariaDB Performance Tuning and Optimization
MariaDB Performance Tuning and OptimizationMariaDB Performance Tuning and Optimization
MariaDB Performance Tuning and OptimizationMariaDB plc
 
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...confluent
 

What's hot (15)

Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
 
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Database Architecture & Scaling Strategies, in the Cloud & on the Rack Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
 
Maximizing performance via tuning and optimization
Maximizing performance via tuning and optimizationMaximizing performance via tuning and optimization
Maximizing performance via tuning and optimization
 
Real time database
Real time databaseReal time database
Real time database
 
RedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
RedisConf18 - Techniques for Synchronizing In-Memory Caches with RedisRedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
RedisConf18 - Techniques for Synchronizing In-Memory Caches with Redis
 
Introducing Venice - Strata NYC 2017
Introducing Venice - Strata NYC 2017Introducing Venice - Strata NYC 2017
Introducing Venice - Strata NYC 2017
 
Fast Online Access to Massive Offline Data - SECR 2016
Fast Online Access to Massive Offline Data - SECR 2016Fast Online Access to Massive Offline Data - SECR 2016
Fast Online Access to Massive Offline Data - SECR 2016
 
Distributed transactions
Distributed transactionsDistributed transactions
Distributed transactions
 
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
 
hbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Neteasehbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Netease
 
Running database infrastructure on containers
Running database infrastructure on containersRunning database infrastructure on containers
Running database infrastructure on containers
 
Date-tiered Compaction Policy for Time-series Data
Date-tiered Compaction Policy for Time-series DataDate-tiered Compaction Policy for Time-series Data
Date-tiered Compaction Policy for Time-series Data
 
Apache Flume - DataDayTexas
Apache Flume - DataDayTexasApache Flume - DataDayTexas
Apache Flume - DataDayTexas
 
MariaDB Performance Tuning and Optimization
MariaDB Performance Tuning and OptimizationMariaDB Performance Tuning and Optimization
MariaDB Performance Tuning and Optimization
 
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
 

Viewers also liked

RingLead: Accelerating Revenue Through Smart Data
RingLead: Accelerating Revenue Through Smart DataRingLead: Accelerating Revenue Through Smart Data
RingLead: Accelerating Revenue Through Smart DataRingLead
 
Data Decay: Why Your CRM Data SUCKS
Data Decay: Why Your CRM Data SUCKSData Decay: Why Your CRM Data SUCKS
Data Decay: Why Your CRM Data SUCKSRingLead
 
Supermicro High Performance Enterprise Hadoop Infrastructure
Supermicro High Performance Enterprise Hadoop InfrastructureSupermicro High Performance Enterprise Hadoop Infrastructure
Supermicro High Performance Enterprise Hadoop Infrastructuretempledf
 
Composite Information Server
Composite Information ServerComposite Information Server
Composite Information Servertempledf
 
Pervasive DataRush
Pervasive DataRushPervasive DataRush
Pervasive DataRushtempledf
 
How Intronis Reduced the Data Import Process With RingLead
How Intronis Reduced the Data Import Process With RingLead How Intronis Reduced the Data Import Process With RingLead
How Intronis Reduced the Data Import Process With RingLead RingLead
 
How to create self-service analytics tool from activity logs garbage
How to create self-service analytics tool from activity logs garbageHow to create self-service analytics tool from activity logs garbage
How to create self-service analytics tool from activity logs garbageAnton Anokhin
 
Business driven BI - Self-service Techniques
Business driven BI - Self-service TechniquesBusiness driven BI - Self-service Techniques
Business driven BI - Self-service TechniquesEckerson Group
 
Datameer Analytics Solution
Datameer Analytics SolutionDatameer Analytics Solution
Datameer Analytics Solutiontempledf
 
Dimension Data – Enabling the Journey to the Cloud: Real Examples
Dimension Data – Enabling the Journey to the Cloud: Real ExamplesDimension Data – Enabling the Journey to the Cloud: Real Examples
Dimension Data – Enabling the Journey to the Cloud: Real Examplesitnewsafrica
 
Data Rehab Series: Automating Taxonomy
Data Rehab Series: Automating TaxonomyData Rehab Series: Automating Taxonomy
Data Rehab Series: Automating TaxonomyRingLead
 
Run IT as Business Meetup self-service BI
Run IT as Business Meetup self-service BIRun IT as Business Meetup self-service BI
Run IT as Business Meetup self-service BIMark Wu
 
Self Serve Training in SAP
Self Serve Training in SAPSelf Serve Training in SAP
Self Serve Training in SAPCristin Merritt
 
Tableau Administrators User Group - Data Governance
Tableau Administrators User Group - Data GovernanceTableau Administrators User Group - Data Governance
Tableau Administrators User Group - Data GovernanceMark Wu
 
Best Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by DatameerBest Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by DatameerDatameer
 
Enabling Self Service Business Intelligence using Excel
Enabling Self Service Business Intelligenceusing ExcelEnabling Self Service Business Intelligenceusing Excel
Enabling Self Service Business Intelligence using ExcelAlan Koo
 

Viewers also liked (18)

RingLead: Accelerating Revenue Through Smart Data
RingLead: Accelerating Revenue Through Smart DataRingLead: Accelerating Revenue Through Smart Data
RingLead: Accelerating Revenue Through Smart Data
 
Data Decay: Why Your CRM Data SUCKS
Data Decay: Why Your CRM Data SUCKSData Decay: Why Your CRM Data SUCKS
Data Decay: Why Your CRM Data SUCKS
 
Supermicro High Performance Enterprise Hadoop Infrastructure
Supermicro High Performance Enterprise Hadoop InfrastructureSupermicro High Performance Enterprise Hadoop Infrastructure
Supermicro High Performance Enterprise Hadoop Infrastructure
 
Composite Information Server
Composite Information ServerComposite Information Server
Composite Information Server
 
Pervasive DataRush
Pervasive DataRushPervasive DataRush
Pervasive DataRush
 
How Intronis Reduced the Data Import Process With RingLead
How Intronis Reduced the Data Import Process With RingLead How Intronis Reduced the Data Import Process With RingLead
How Intronis Reduced the Data Import Process With RingLead
 
How to create self-service analytics tool from activity logs garbage
How to create self-service analytics tool from activity logs garbageHow to create self-service analytics tool from activity logs garbage
How to create self-service analytics tool from activity logs garbage
 
Business driven BI - Self-service Techniques
Business driven BI - Self-service TechniquesBusiness driven BI - Self-service Techniques
Business driven BI - Self-service Techniques
 
Datameer Analytics Solution
Datameer Analytics SolutionDatameer Analytics Solution
Datameer Analytics Solution
 
Learning-based Data Cleaning
Learning-based Data CleaningLearning-based Data Cleaning
Learning-based Data Cleaning
 
Dimension Data – Enabling the Journey to the Cloud: Real Examples
Dimension Data – Enabling the Journey to the Cloud: Real ExamplesDimension Data – Enabling the Journey to the Cloud: Real Examples
Dimension Data – Enabling the Journey to the Cloud: Real Examples
 
Data Rehab Series: Automating Taxonomy
Data Rehab Series: Automating TaxonomyData Rehab Series: Automating Taxonomy
Data Rehab Series: Automating Taxonomy
 
Run IT as Business Meetup self-service BI
Run IT as Business Meetup self-service BIRun IT as Business Meetup self-service BI
Run IT as Business Meetup self-service BI
 
Self Serve Training in SAP
Self Serve Training in SAPSelf Serve Training in SAP
Self Serve Training in SAP
 
Tableau Administrators User Group - Data Governance
Tableau Administrators User Group - Data GovernanceTableau Administrators User Group - Data Governance
Tableau Administrators User Group - Data Governance
 
Best Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by DatameerBest Practices for Big Data Analytics with Machine Learning by Datameer
Best Practices for Big Data Analytics with Machine Learning by Datameer
 
Enabling Self Service Business Intelligence using Excel
Enabling Self Service Business Intelligenceusing ExcelEnabling Self Service Business Intelligenceusing Excel
Enabling Self Service Business Intelligence using Excel
 
Self-Service Analytics on Hadoop: Lessons Learned
Self-Service Analytics on Hadoop: Lessons LearnedSelf-Service Analytics on Hadoop: Lessons Learned
Self-Service Analytics on Hadoop: Lessons Learned
 

Similar to Solving Big Data Challenges for Enterprise Application Performance Management

Track 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbedTrack 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbedEMC Forum India
 
How to Manage Scale-Out Environments with MariaDB MaxScale
How to Manage Scale-Out Environments with MariaDB MaxScaleHow to Manage Scale-Out Environments with MariaDB MaxScale
How to Manage Scale-Out Environments with MariaDB MaxScaleMariaDB plc
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPythian
 
PoC: Using a Group Communication System to improve MySQL Replication HA
PoC: Using a Group Communication System to improve MySQL Replication HAPoC: Using a Group Communication System to improve MySQL Replication HA
PoC: Using a Group Communication System to improve MySQL Replication HAUlf Wendel
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAmazon Web Services
 
Bhupeshbansal bigdata
Bhupeshbansal bigdata Bhupeshbansal bigdata
Bhupeshbansal bigdata Bhupesh Bansal
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Minnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraMinnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraJeff Bollinger
 
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory EngineRedis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory EngineScaleGrid.io
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Bhupesh Bansal
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop User Group
 
DB2 for z/O S Data Sharing
DB2 for z/O S  Data  SharingDB2 for z/O S  Data  Sharing
DB2 for z/O S Data SharingSurekha Parekh
 
Beyond relational database - Building high performance websites using Redis a...
Beyond relational database - Building high performance websites using Redis a...Beyond relational database - Building high performance websites using Redis a...
Beyond relational database - Building high performance websites using Redis a...Dinh Pham
 
Mumps the Internet scale database
Mumps the Internet scale databaseMumps the Internet scale database
Mumps the Internet scale databasegeorge.james
 
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014Amazon Web Services
 
Inside Apache SystemML by Frederick Reiss
Inside Apache SystemML by Frederick ReissInside Apache SystemML by Frederick Reiss
Inside Apache SystemML by Frederick ReissSpark Summit
 
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...Clustrix
 
Spinnaker VLDB 2011
Spinnaker VLDB 2011Spinnaker VLDB 2011
Spinnaker VLDB 2011sandeep_tata
 

Similar to Solving Big Data Challenges for Enterprise Application Performance Management (20)

Track 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbedTrack 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbed
 
How to Manage Scale-Out Environments with MariaDB MaxScale
How to Manage Scale-Out Environments with MariaDB MaxScaleHow to Manage Scale-Out Environments with MariaDB MaxScale
How to Manage Scale-Out Environments with MariaDB MaxScale
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWS
 
Greenplum feature
Greenplum featureGreenplum feature
Greenplum feature
 
PoC: Using a Group Communication System to improve MySQL Replication HA
PoC: Using a Group Communication System to improve MySQL Replication HAPoC: Using a Group Communication System to improve MySQL Replication HA
PoC: Using a Group Communication System to improve MySQL Replication HA
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon Redshift
 
Bhupeshbansal bigdata
Bhupeshbansal bigdata Bhupeshbansal bigdata
Bhupeshbansal bigdata
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Minnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraMinnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with Cassandra
 
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory EngineRedis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedIn
 
DB2 for z/O S Data Sharing
DB2 for z/O S  Data  SharingDB2 for z/O S  Data  Sharing
DB2 for z/O S Data Sharing
 
Beyond relational database - Building high performance websites using Redis a...
Beyond relational database - Building high performance websites using Redis a...Beyond relational database - Building high performance websites using Redis a...
Beyond relational database - Building high performance websites using Redis a...
 
Oracle RAC 12c Overview
Oracle RAC 12c OverviewOracle RAC 12c Overview
Oracle RAC 12c Overview
 
Mumps the Internet scale database
Mumps the Internet scale databaseMumps the Internet scale database
Mumps the Internet scale database
 
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
 
Inside Apache SystemML by Frederick Reiss
Inside Apache SystemML by Frederick ReissInside Apache SystemML by Frederick Reiss
Inside Apache SystemML by Frederick Reiss
 
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
Tech Talk Series, Part 4: How do you achieve high availability in a MySQL env...
 
Spinnaker VLDB 2011
Spinnaker VLDB 2011Spinnaker VLDB 2011
Spinnaker VLDB 2011
 

More from Tilmann Rabl

TPC-DI - The First Industry Benchmark for Data Integration
TPC-DI - The First Industry Benchmark for Data IntegrationTPC-DI - The First Industry Benchmark for Data Integration
TPC-DI - The First Industry Benchmark for Data IntegrationTilmann Rabl
 
Crafting bigdatabenchmarks
Crafting bigdatabenchmarksCrafting bigdatabenchmarks
Crafting bigdatabenchmarksTilmann Rabl
 
Big Data Benchmarking Tutorial
Big Data Benchmarking TutorialBig Data Benchmarking Tutorial
Big Data Benchmarking TutorialTilmann Rabl
 
A BigBench Implementation in the Hadoop Ecosystem
A BigBench Implementation in the Hadoop EcosystemA BigBench Implementation in the Hadoop Ecosystem
A BigBench Implementation in the Hadoop EcosystemTilmann Rabl
 
MADES - A Multi-Layered, Adaptive, Distributed Event Store
MADES - A Multi-Layered, Adaptive, Distributed Event StoreMADES - A Multi-Layered, Adaptive, Distributed Event Store
MADES - A Multi-Layered, Adaptive, Distributed Event StoreTilmann Rabl
 
CaSSanDra: An SSD Boosted Key-Value Store
CaSSanDra: An SSD Boosted Key-Value StoreCaSSanDra: An SSD Boosted Key-Value Store
CaSSanDra: An SSD Boosted Key-Value StoreTilmann Rabl
 
Rapid Development of Data Generators Using Meta Generators in PDGF
Rapid Development of Data Generators Using Meta Generators in PDGFRapid Development of Data Generators Using Meta Generators in PDGF
Rapid Development of Data Generators Using Meta Generators in PDGFTilmann Rabl
 

More from Tilmann Rabl (7)

TPC-DI - The First Industry Benchmark for Data Integration
TPC-DI - The First Industry Benchmark for Data IntegrationTPC-DI - The First Industry Benchmark for Data Integration
TPC-DI - The First Industry Benchmark for Data Integration
 
Crafting bigdatabenchmarks
Crafting bigdatabenchmarksCrafting bigdatabenchmarks
Crafting bigdatabenchmarks
 
Big Data Benchmarking Tutorial
Big Data Benchmarking TutorialBig Data Benchmarking Tutorial
Big Data Benchmarking Tutorial
 
A BigBench Implementation in the Hadoop Ecosystem
A BigBench Implementation in the Hadoop EcosystemA BigBench Implementation in the Hadoop Ecosystem
A BigBench Implementation in the Hadoop Ecosystem
 
MADES - A Multi-Layered, Adaptive, Distributed Event Store
MADES - A Multi-Layered, Adaptive, Distributed Event StoreMADES - A Multi-Layered, Adaptive, Distributed Event Store
MADES - A Multi-Layered, Adaptive, Distributed Event Store
 
CaSSanDra: An SSD Boosted Key-Value Store
CaSSanDra: An SSD Boosted Key-Value StoreCaSSanDra: An SSD Boosted Key-Value Store
CaSSanDra: An SSD Boosted Key-Value Store
 
Rapid Development of Data Generators Using Meta Generators in PDGF
Rapid Development of Data Generators Using Meta Generators in PDGFRapid Development of Data Generators Using Meta Generators in PDGF
Rapid Development of Data Generators Using Meta Generators in PDGF
 

Recently uploaded

unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Solving Big Data Challenges for Enterprise Application Performance Management

  • 1. MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Solving Big Data Challenges for Enterprise Application Performance Management Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen University of Toronto Sergio Gomez-Villamor Universitat Politecnica de Catalunya Victor Muntes-Mulero*, Serge Mankowskii CA Labs (Europe*)
  • 2. Agenda  Application Performance Management  APM Benchmark  Benchmarked Systems  Test Results Tilmann Rabl - VLDB 2012 8/26/2012
  • 3. Motivation Tilmann Rabl - VLDB 2012 8/26/2012
  • 4. Enterprise System Architecture SAP Agent Client Client Client Identity Manager Agent Web Server Agent Application Server Agent Application Server Agent Message Queue Database Agent Message Broker Agent Agent Web Service Agent Application Server Client Main Frame 3rd Party Database Agent Measurements Agent Agent Tilmann Rabl - VLDB 2012 Agent 8/26/2012
  • 6. APM in Numbers  Nodes in an enterprise system      10 sec    100B / event Raw data  Up to 50K Avg 10K Reporting period Data size  Metrics per node   100 – 10K  100MB / sec 355GB / h 2.8 PB / y Event rate at storage  > 1M / sec Tilmann Rabl - VLDB 2012 8/24/2012
  • 7. APM Benchmark  Based on Yahoo! Cloud Serving Benchmark (YCSB)   Single table     CRUD operations 25 byte key 5 values (10 byte each) 75 byte / record Five workloads  50 rows scan length Tilmann Rabl - VLDB 2012 8/24/2012
  • 8. Benchmarked Systems  6 systems        Categories        Cassandra HBase Project Voldemort Redis VoltDB MySQL Key-value stores Extensible record stores Scalable relational stores Sharded systems Main memory systems Disk based systems Chosen by  Previous results, popularity, maturity, availability Tilmann Rabl - VLDB 2012 8/26/2012
  • 9. Classification of Benchmarked Systems Disk-Based Stores Distributed In-Memory Sharded Stores Key-value stores Extensible Row Stores Scalable Relational Stores Tilmann Rabl - VLDB 2012 8/26/2012
  • 10. Experimental Testbed Two clusters  Cluster M (memory-bound)      16 nodes (plus master node) 2x quad core CPU,16 GB RAM, 2x 74GB HDD (RAID 0) 10 million records per node (~700 MB raw) 128 connections per node (8 per core) Cluster D (disk-bound)     24 nodes 2x dual core CPU, 4 GB RAM, 74 GB HDD 150 million records on 12 nodes (~10.5 GB raw) 8 connections per node Tilmann Rabl - VLDB 2012 8/24/2012
  • 11. Evaluation     Minimum 3 runs per workload and system Fresh install for each run 10 minutes runtime Up to 5 clients for 12 nodes   To make sure YCSB is no bottleneck Maximum achievable throughput Tilmann Rabl - VLDB 2012 8/24/2012
  • 12. Workload W - Throughput      Cassandra dominates Higher throughput for Cassandra and HBase Lower throughput for other systems Scalability not as good for all web stores VoltDB best single node throughput Tilmann Rabl - VLDB 2012 8/24/2012
  • 13. Workload W – Latency Writes  Same latencies for   Cassandra,Voldemort,VoltDB,Redis, MySQL HBase latency increased Tilmann Rabl - VLDB 2012 8/24/2012
  • 14. Workload W – Latency Reads  Same latency as for R for   Cassandra,Voldemort, Redis,VoltDB, HBase HBase latency in second range Tilmann Rabl - VLDB 2012 8/24/2012
  • 15. Workload R - Throughput      95% reads, 5% writes On a single node, main memory systems have best performance Linear scalability for web data stores Sublinear scalability for sharded systems Slow-down for VoltDB Tilmann Rabl - VLDB 2012 8/24/2012
  • 16. Workload RW - Throughput     50% reads, 50% writes VoltDB highest single node throughput HBase and Cassandra throughput increase MySQL and Voldemort throughput reduction Tilmann Rabl - VLDB 2012 8/24/2012
  • 17. Workload RS – Latency Scans    HBase latency equal to reads in Workload R MySQL latency very high due to full table scans Similar but increased latency for  Cassandra, Redis,VoltDB Tilmann Rabl - VLDB 2012 8/24/2012
  • 18. Workload RWS - Throughput    25% reads, 25% scans, 50% writes Cassandra, HBase, Redis,VoltDB gain performance MySQL performance 20 – 4 ops / sec Tilmann Rabl - VLDB 2012 8/24/2012
  • 19. Bounded Throughput – Write Latency     Workload R, normalized Maximum throughput in previous tests 100% Steadily decreasing for most systems HBase not as stable (but below 0.1 ms) Tilmann Rabl - VLDB 2012 8/24/2012
  • 20. Bounded Throughput – Read Latency     Redis and Voldemort slightly decrease HBase two states Cassandra decreases linearly MySQL decreases and then stays constant Tilmann Rabl - VLDB 2012 8/24/2012
  • 21. Disk Usage     Raw data: 75 byte per record, 0.7GB/10M Cassandra: 2.5GB / 10M HBase: 7.5GB / 10M No compression Tilmann Rabl - VLDB 2012 8/24/2012
  • 22. Cluster D Results – Throughput   Disk bound, 150M records on 8 nodes More writes – more throughput    Cassandra: 26x Hbase: 15x Voldemort 3x Tilmann Rabl - VLDB 2012 8/24/2012
  • 23. Cluster D Results – Latency Writes     Low latency for writes for all systems Relatively stable latency for writes Lowest for HBase Equal for Voldemort and Cassandra Tilmann Rabl - VLDB 2012 8/24/2012
  • 24. Cluster D Results – Latency Reads     Cassandra latency decreases with more writes Voldemort latency low 5-20ms HBase latency high (up to 200 ms) Cassandra in between Tilmann Rabl - VLDB 2012 8/24/2012
  • 25. Lessons Learned  YCSB   Cassandra   Higher number of connections might lead to better results MySQL   Jedis distribution uneven Voldemort   Difficult setup, special JVM settings necessary Redis   Optimal tokens for data distribution necessary HBase   Client to host ratio 1:3 better 1:2 Eager scan evaluation VoltDB  Synchronous communication slow on multiple nodes Tilmann Rabl - VLDB 2012 8/24/2012
  • 26. Conclusions I  Cassandra   HBase   Read and write latency stable at low level Redis   Scales well, latency decreases with higher scales Voldemort   Low write latency, high read latency, low throughput Sharded MySQL   Winner in terms of scalability and throughput Standalone has high throughput, sharded version does not scale well VoltDB  High single node throughput, does not scale for synchronous access Tilmann Rabl - VLDB 2012 8/24/2012
  • 27. Conclusions II  Cassandra’s performance close to APM requirements   Additional improvements needed for reliably sustaining APM workload Future work   Benchmark impact of replication, compression Monitor the benchmark runs using APM tool Tilmann Rabl - VLDB 2012 8/26/2012
  • 28. Thanks!  Questions?  Contact: Tilmann Rabl tilmann.rabl@utoronto.ca Tilmann Rabl - VLDB 2012 8/24/2012