CassandraMeetup-0225-updated

1Confidential and Proprietary |
Cassandra at Zoosk
Wei Zhu
Principal Platform Engineer
Feb 25, 2015

Outline
● About Zoosk/me
● Why Zoosk chose Cassandra
● Two use cases
● Production setup
● Things to watch out for
● Future plans

Zoosk
● Founded in 2007
● Zoosk is a leading online dating company
● Over 33 million searchable members
● #1 grossing online dating app in the Apple App Store, Zoosk is a
market leader in mobile dating.
● Available in over 80 countries and translated into 25 languages

Wei Zhu
● Platform Engineer
● Developed Zoosk Internal API (ZIA), a Java and PHP Restful service
architecture
● Implemented a few services using Cassandra as storage
● Other stuff

Why do we want to move away from
MySQL?
● The traditional master-slave architecture of MySQL (one write master
with n-1 slaves) only supports one write master. We are using MHA
which requires master-slave.
● Manual sharding process with rapid growth of data is really painful.
● Management overhead is high.

Why Cassandra?
● Had bad experience with Mongo
– Memory consumption
– Stability
● Riak
– read-before-write is a no-no.
– Riak favors reads more than writes
– Riak with Bitcask has more demand for memory

Highlights of Cassandra
● Minimal Administration.
● No Single Point of Failure.
● Handles failure gracefully, Cassandra is crash-only.
● Scales Horizontally.
● Writes are durable.
● Consistency is tunable as needed on reads and writes.
● Schema is flexible, can be updated live.
● Replication is easy, Rack and Datacenter aware.

Benchmark
● Friends table, 2.7B friend relations in MySQL db.
● Created data for 6 Million users, based on the published Facebook
friend distribution.
– Number of friends from 6 – 5000.
– Average 490 friends.
– Total of 2.94 B relations.
– ~700 G of data

Benchmark numbers (out of box setting)
● We only ran for couple of hours, since we didn’t know what
compaction/repair can do to you at that time.
● Dell C1100 Three nodes cluster, RF = 3
– Dual L5640 CPUs (6-Core 2.13 Ghz), 72GB Memory (18 x 4GB), 4 x
100GB SLC SSDs (or MET-MLC)
Unit: ms, RL: read latency, WL: write latency

A
p
a
c
h
e
A
p
a
c
h
e
L
B
L
B
ZIA Service Layer
Tomcat
ZIA Service Layer
Tomcat
Jersey
(ZIA business Logic)
Hector
or
CQL Java
Driver
Hector
or
CQL Java
Driver
ZIA Service Layer
Tomcat
ZIA Service Layer
Tomcat
ZIA Service Layer
Tomcat
ZIA Service Layer
Tomcat
CassandraCassandra
CassandraCassandra
CassandraCassandra
Http
Post
JSON
M
e
m
c
a
c
h
e
M
e
m
c
a
c
h
e

Friends in MySQL
Friends Table:
id user_ID friend_user_id
1 1231069955177344716 1231070367578097419
2 1231070367578097419 1231069955177344716
3 1231069955177344716 1231070505050586151
4 1231070505050586151 1231069955177344716

Users
id first_name last_name
1231069955177344716 Mary Smith
1231070367578097419 James Brown
1231070505050586151 Robert Wilson

Cassandra Schema
● // column name is a composite column with fname + lname + user_id
create column family friends
with comparator = 'CompositeType(UTF8Type, UTF8Type, LongType)’
and key_validation_class = 'LongType'
and compaction_strategy='LeveledCompactionStrategy’
● Data is denormalized, a bit complicated for updating. (What if user
decides to change their name?)

Data in Cassandra
1231069955177344716 James:Brown:12310703
67578097419
Robert:Wilson:1231070
505050586151
{"s":5,"r":0,"c”:3,"l":1343
346668000,"m":0,"ct":13
43346668000,"i":106}
{"s”:7,"r":0,"c”:2,"l":1343
346410000,"m":0,"ct":13
43346410000,"i":10}
1231070367578097419 Mary:Smith:1231069955
177344716
{"s":5,"r":0,"c":1,"l":1343
346668000,"m":0,"ct":13
43346668000,"i":106}
1231070505050586151 Mary:Smith:1231069955
177344716
{"s”:7,"r":0,"c”:1,"l":1343
346410000,"m":0,"ct":13
43346410000,"i":10}

Persistent Notification Services

Cassandra schema for notification
create column family notifications
with column_type = 'Standard'
and comparator = 'CompositeType(TimeUUIDType, UTF8Type)'
and default_validation_class = 'UTF8Type'
and key_validation_class = 'LongType'

Data in Cassandra for notifications
[default@zoosk] get notifications[2752669903264728509];
=> (column=8c6e8800-f687-172c-aa11-008cfa0410fc:is_viewed, value=1, timestamp=1423879093429000, ttl=1814400)
=> (column=8c6e8800-f687-172c-aa11-008cfa0410fc:items,
value={"app_id":1,"type":502,"time":1422992424,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"XXXXX","agd":"f","apr":
{"t":1,"s":1,"d":0,"o":0}}"}, timestamp=1422992424656001, ttl=1814400)
=> (column=60f8ff80-6938-1744-9ebc-008cfa0ea5e8:is_viewed, value=1, timestamp=1423879093429001, ttl=1814400)
=> (column=60f8ff80-6938-1744-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1423652427,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"YYYYY","agd":"f","apr":
=> (column=f1161080-fd82-174c-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1423893912,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"ZZZZZZZ","apr":
=> (column=6adb4f80-e667-1751-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1424032109,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"AAAAAAA","agd":"f","apr":
=> (column=e9801700-f4b8-1754-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1424118125,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"BBBBBBB","agd":"f","apr":
=> (column=18093480-9978-175a-920e-008cfa0e9778:items,
value={"app_id":1,"type":511,"time":1424276977,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"CCCCCCC","agd":"f","apr":
=> (column=30d1b180-5d73-175b-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":509,"time":1424298525,"author_zid":"02752669903264728509","payload":"{"t":"4","an":"DDDDDDDD","agd":"f","apr
":{"t":1,"s":1,"d":0,"o":0},"exp":1424381325}"}, timestamp=1424298525775001, ttl=82800)
=> (column=e8385d00-3f6e-175c-920e-008cfa0e9778:items,
value={"app_id":1,"type":511,"time":1424323372,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"EEEEEE","agd":"f",:

Persistent Notification Services

Data in Cassandra for notifications
● => (column=f1161080-fd82-174c-9ebc-008cfa0ea5e8:is_viewed, value=1, timestamp=1424378452228000, ttl=1814400)
● => (column=f1161080-fd82-174c-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1423893912,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"ZZZZZZZ","apr":
● => (column=6adb4f80-e667-1751-9ebc-008cfa0ea5e8:is_viewed, value=1, timestamp=1424378449318000, ttl=1814400)
● => (column=6adb4f80-e667-1751-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1424032109,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"AAAAAAA","agd":"f","a
pr":{"t":1,"s":1,"d":0,"o":0}}"}, timestamp=1424032109051001, ttl=1814400)
● => (column=e9801700-f4b8-1754-9ebc-008cfa0ea5e8:is_viewed, value=1, timestamp=1424378447794000, ttl=1814400)
● => (column=e9801700-f4b8-1754-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":511,"time":1424118125,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"BBBBBBB","agd":"f","a
pr":{"t":1,"s":1,"d":0,"o":0}}"}, timestamp=1424118125862001, ttl=1814400)
● => (column=18093480-9978-175a-920e-008cfa0e9778:is_viewed, value=1, timestamp=1424378445009000, ttl=1814400)
● => (column=18093480-9978-175a-920e-008cfa0e9778:items,
value={"app_id":1,"type":511,"time":1424276977,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"CCCCCCC","agd":"f","
apr":{"t":1,"s":1,"d":0,"o":0}}"}, timestamp=1424276977453001, ttl=1814400)
● => (column=30d1b180-5d73-175b-9ebc-008cfa0ea5e8:is_viewed, value=1, timestamp=1424378443164000, ttl= 82800)
● => (column=30d1b180-5d73-175b-9ebc-008cfa0ea5e8:items,
value={"app_id":1,"type":509,"time":1424298525,"author_zid":"02752669903264728509","payload":"{"t":"4","an":"DDDDDDDD","agd":"f",
"apr":{"t":1,"s":1,"d":0,"o":0},"exp":1424381325}"}, timestamp=1424298525775001, ttl=82800)
● => (column=e8385d00-3f6e-175c-920e-008cfa0e9778:is_viewed, value=1, timestamp=1424378409297000, ttl=1814400)
● => (column=e8385d00-3f6e-175c-920e-008cfa0e9778:items,
value={"app_id":1,"type":511,"time":1424323372,"author_zid":"02752669903264728509","payload":"{"t":"3","an":"EEEEEE","agd":"f",:

Production Setup
– Persistent Notifications: 5 Nodes Single DC, RF = 3
• 1.1.6
• SSD
• Powerful machines (Used to be Mysql Server): 74G RAM, 24core
• Cassandra is running on 8G Heap
• 30 GB data per node
• 250 Writes per second
• 70 Reads per second
• Write Latency: <0.02ms
• Read Latency: < 2ms

Production Setup
– All the rest: 14 Nodes, 2DCs, {DC1:3, DC2:3}
• Active-backup
• 2.0.8
• Less powerful machines: 32G RAM, 2 core
• Very little usage for now
• Cassandra is running on 8G Heap
• Consistency level is set to LOCAL_QUORUM

Compaction Strategy
● We choose Leveled Compaction because:
– It requires less disk space (theoretically)
– It requires more I/O, but we have SSD
– We have TTL, so compaction is important
● Things to watch out
– SSTable size was default to 5MB in version prior to (1.2.9) which is way too
small.
– Defaults to 160MB for version after 1.2.9,
https://issues.apache.org/jira/browse/CASSANDRA-5727
– Way to set SSTable size on C* 2.X
ALTER TABLE test
WITH compaction = {'class': 'LeveledCompactionStrategy',
'sstable_size_in_mb': 256};

Repair
● The hard requirement for routine repair frequency is the value of
gc_grace_seconds. (10 days by default)
● Things to watch out
– Use –pr
– Schedule repair wisely
– Watch your disk (Even for LCS, the disk would double during the repair)
– Watch your performance metrics
– nodetool setcompactionthroughput
– nodetool setstreamthroughput

Repair impacts performance

Cluster setup choice
● Big cluster with less powerful machine
– It’s easier to scale with vnodes
– Less administrative overhead
– More nodes meaning higher occurrences of node failure, but C* is so
resilient to the node failure
● Small cluster with more powerful machine
– Can be tuned specific for each user case
– Self contained to each service, in case of outage, less impact
● We are moving to a single big cluster with less powerful machines
● Bring more services to Cassandra

CassandraMeetup-0225-updated

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a CassandraMeetup-0225-updated

Semelhante a CassandraMeetup-0225-updated (20)

CassandraMeetup-0225-updated