Fault Tolerance in Cassandra

•

7 gostaram•6,625 visualizações

A short talk on how Cassandra deals with various failure modes. Discussion of replication and consistency levels and how they can be used to survive many kinds of failure. Ends with explanation of recovery methods - repair, hinted handoff and read repair.

Tecnologia Notícias e política

Fault tolerance in
Cassandra
Richard Low

richard@acunu.com
@acunu
@richardalow

Cassandra London Meetup, 5 Sept 2011

Tuesday, 6 September 2011

Menu

• Failure modes
• Maintaining availability
• Recovery

Tuesday, 6 September 2011

Failure modes

Tuesday, 6 September 2011

Failures are the norm

• With more than a few nodes, something
goes wrong all the time
• Don’t want to be down all the time

Tuesday, 6 September 2011

Failure causes

• Hardware failure
• Bug
• Power
• Natural disaster

Tuesday, 6 September 2011

Failure modes
• Data centre failure
• Node failure
• Disk failure

Tuesday, 6 September 2011

Failure modes
• Data centre failure
• Node failure
• Disk failure
• Temporary
• Permanent

Tuesday, 6 September 2011

Failure modes

• Network failure
• One node
• Network partition
• Whole data centre

Tuesday, 6 September 2011

Failure modes

• Operator failure
• Delete ﬁles
• Delete entire database
• Incorrect conﬁguration

Tuesday, 6 September 2011

Failure modes

• Want a system that can tolerate all the
above failures
• Make assumptions about probabilities of
multiple events
• Be careful when assuming independence

Tuesday, 6 September 2011

Solutions

• Do nothing
• Make boxes bullet proof
• Replication

Tuesday, 6 September 2011

How maintain
availability in the
presence of failure?

Tuesday, 6 September 2011

Replication

• Buy cheap nodes and cheap disks
• Store multiple copies of the data
• Don’t care if some disappear

Tuesday, 6 September 2011

Replication

• What about consistency?
• What if I can’t tolerate out-of-date reads?
• How restore a replica?

Tuesday, 6 September 2011

RF and CL
• Replication factor
• How many copies
• How much failure can tolerate
• Consistency Level
• How many nodes must be contactable
for operation to succeed

Tuesday, 6 September 2011

Simple example
• Replication factor 3
• Uniform network topology
• Read and write at CL.QUORUM
• Strong consistency
• Available if any one node is down
• Can recover if any two nodes fail
Tuesday, 6 September 2011

In general

• RF N, reads and writes at CL.QUORUM
• Available if ceil(N/2)-1 nodes fail
• Can recover if N-1 nodes fail

Tuesday, 6 September 2011

Multi data centre

• Cassandra knows location of hosts
• Through the snitch
• Can ensure replicas in each DC
• NetworkTopologyStrategy
• => can cope with whole DC failure

Tuesday, 6 September 2011

Recovery
• Want to maintain replication factor
• Ensures recovery guarantees

• Methods:
• Automatic
• Manual
Tuesday, 6 September 2011

Automatic processes

• Eventually moves replicas towards
consistency
• The ‘eventual’ in ‘eventual consistency’

Tuesday, 6 September 2011

Hinted Handoff
• Hints
• Stored on any node
• When a node is temporarily unavailable
• Delivered when the node comes back
• Can use CL.ANY
• Writes not immediately readable
Tuesday, 6 September 2011

Read Repair

• Since done a read, might as well repair any
old copies
• Compare values, update any out of sync

Tuesday, 6 September 2011

Repair: method
• Ensures a node is up to date
• Run ‘nodetool -h <node> repair’
• Reads through entire data on the node
• Builds a Merkel tree
• Compares with replicas
• Streams differences
Tuesday, 6 September 2011

Repair: when

• After node has been down a long time
• After increasing replication factor
• Every 10 days to ensure tombstones are
propagated
• Can be used to restore a failed node

Tuesday, 6 September 2011

Replace a node: method

• Bootstrap new node with <old_token>-1
• Tell existing nodes old node is dead
• nodetool remove

Tuesday, 6 September 2011

Replace a node: when

• Complete node failure
• Cannot replace failed disk
• Corruption

Tuesday, 6 September 2011

Restore from backup:
method
• Stop Cassandra on the node
• Copy SSTables from backup
• Restart Cassandra
• Make take a while reading indexes

Tuesday, 6 September 2011

Restore from backup:
when
• Disk failure
• with no RAID rebuild available
• Operator error
• Corruption
• Hacker

Tuesday, 6 September 2011

Thanks :)
www.acunu.com
richard@acunu.com
@acunu
@richardalow

Tuesday, 6 September 2011

Mais conteúdo relacionado

Destaque

How to size up an Apache Cassandra cluster (Training)DataStax Academy

Ten Reasons Why You Should Prefer PostgreSQL to MySQLanandology

An Overview of Apache CassandraDataStax

Cassandra Introduction & FeaturesDataStax Academy

Introduction to memcachedJurriaan Persyn

Cassandra NoSQL TutorialMichelle Darling

Dynamo and BigTable in light of the CAP theoremGrisha Weintraub

Cassandra Compression and Performance EvaluationSchubert Zhang

Destaque (8)

How to size up an Apache Cassandra cluster (Training)

Ten Reasons Why You Should Prefer PostgreSQL to MySQL

An Overview of Apache Cassandra

Cassandra Introduction & Features

Introduction to memcached

Cassandra NoSQL Tutorial

Dynamo and BigTable in light of the CAP theorem

Cassandra Compression and Performance Evaluation

Mais de Acunu

Acunu and Hailo: a realtime analytics case study on CassandraAcunu

Virtual nodes: Operational AspirinAcunu

Acunu Analytics and Cassandra at Hailo All Your Base 2013 Acunu

Understanding Cassandra internals to solve real-world problemsAcunu

Acunu Analytics: Simpler Real-Time Cassandra AppsAcunu

All Your BaseAcunu

Realtime Analytics with Apache CassandraAcunu

Realtime Analytics with Apache Cassandra - JAX LondonAcunu

Real-time CassandraAcunu

Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...Acunu

Realtime Analytics with CassandraAcunu

Acunu Analytics @ Cassandra LondonAcunu

Exploring Big Data value for your businessAcunu

Realtime Analytics on the Twitter Firehose with CassandraAcunu

Progressive NOSQL: CassandraAcunu

Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...Acunu

Cassandra EU 2012 - Putting the X Factor into CassandraAcunu

Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source EffortsAcunu

Next Generation CassandraAcunu

Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans Acunu

Mais de Acunu (20)

Acunu and Hailo: a realtime analytics case study on Cassandra

Virtual nodes: Operational Aspirin

Acunu Analytics and Cassandra at Hailo All Your Base 2013

Understanding Cassandra internals to solve real-world problems

Acunu Analytics: Simpler Real-Time Cassandra Apps

All Your Base

Realtime Analytics with Apache Cassandra

Realtime Analytics with Apache Cassandra - JAX London

Real-time Cassandra

Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...

Realtime Analytics with Cassandra

Acunu Analytics @ Cassandra London

Exploring Big Data value for your business

Realtime Analytics on the Twitter Firehose with Cassandra

Progressive NOSQL: Cassandra

Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...

Cassandra EU 2012 - Putting the X Factor into Cassandra

Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source Efforts

Next Generation Cassandra

Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans

Último

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

How to convert PDF to text with Nanonetsnaman860154

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

GenAI Risks & Security Meetup 01052024.pdflior mazor

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Artificial Intelligence: Facts and MythsJoaquim Jorge

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Partners Life - Insurer Innovation Award 2024The Digital Insurer

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Tech Trends Report 2024 Future Today Institute.pdfhans926745

Fault Tolerance in Cassandra

1. Fault tolerance in Cassandra Richard Low richard@acunu.com @acunu @richardalow Cassandra London Meetup, 5 Sept 2011 Tuesday, 6 September 2011

2. Menu • Failure modes • Maintaining availability • Recovery Tuesday, 6 September 2011

3. Failure modes Tuesday, 6 September 2011

4. Failures are the norm • With more than a few nodes, something goes wrong all the time • Don’t want to be down all the time Tuesday, 6 September 2011

5. Failure causes • Hardware failure • Bug • Power • Natural disaster Tuesday, 6 September 2011

6. Failure modes • Data centre failure • Node failure • Disk failure Tuesday, 6 September 2011

7. Failure modes • Data centre failure • Node failure • Disk failure • Temporary • Permanent Tuesday, 6 September 2011

8. Failure modes • Network failure • One node • Network partition • Whole data centre Tuesday, 6 September 2011

9. Failure modes • Operator failure • Delete ﬁles • Delete entire database • Incorrect conﬁguration Tuesday, 6 September 2011

10. Failure modes • Want a system that can tolerate all the above failures • Make assumptions about probabilities of multiple events • Be careful when assuming independence Tuesday, 6 September 2011

11. Solutions • Do nothing • Make boxes bullet proof • Replication Tuesday, 6 September 2011

12. Availability Tuesday, 6 September 2011

13. How maintain availability in the presence of failure? Tuesday, 6 September 2011

14. Replication • Buy cheap nodes and cheap disks • Store multiple copies of the data • Don’t care if some disappear Tuesday, 6 September 2011

15. Replication • What about consistency? • What if I can’t tolerate out-of-date reads? • How restore a replica? Tuesday, 6 September 2011

16. RF and CL • Replication factor • How many copies • How much failure can tolerate • Consistency Level • How many nodes must be contactable for operation to succeed Tuesday, 6 September 2011

17. Simple example • Replication factor 3 • Uniform network topology • Read and write at CL.QUORUM • Strong consistency • Available if any one node is down • Can recover if any two nodes fail Tuesday, 6 September 2011

18. In general • RF N, reads and writes at CL.QUORUM • Available if ceil(N/2)-1 nodes fail • Can recover if N-1 nodes fail Tuesday, 6 September 2011

19. Multi data centre • Cassandra knows location of hosts • Through the snitch • Can ensure replicas in each DC • NetworkTopologyStrategy • => can cope with whole DC failure Tuesday, 6 September 2011

20. Recovery Tuesday, 6 September 2011

21. Recovery • Want to maintain replication factor • Ensures recovery guarantees • Methods: • Automatic • Manual Tuesday, 6 September 2011

22. Automatic Tuesday, 6 September 2011

23. Automatic processes • Eventually moves replicas towards consistency • The ‘eventual’ in ‘eventual consistency’ Tuesday, 6 September 2011

24. Hinted Handoff • Hints • Stored on any node • When a node is temporarily unavailable • Delivered when the node comes back • Can use CL.ANY • Writes not immediately readable Tuesday, 6 September 2011

25. Read Repair • Since done a read, might as well repair any old copies • Compare values, update any out of sync Tuesday, 6 September 2011

26. Manual Tuesday, 6 September 2011

27. Repair: method • Ensures a node is up to date • Run ‘nodetool -h <node> repair’ • Reads through entire data on the node • Builds a Merkel tree • Compares with replicas • Streams differences Tuesday, 6 September 2011

28. Repair: when • After node has been down a long time • After increasing replication factor • Every 10 days to ensure tombstones are propagated • Can be used to restore a failed node Tuesday, 6 September 2011

29. Replace a node: method • Bootstrap new node with <old_token>-1 • Tell existing nodes old node is dead • nodetool remove Tuesday, 6 September 2011

30. Replace a node: when • Complete node failure • Cannot replace failed disk • Corruption Tuesday, 6 September 2011

31. Restore from backup: method • Stop Cassandra on the node • Copy SSTables from backup • Restart Cassandra • Make take a while reading indexes Tuesday, 6 September 2011

32. Restore from backup: when • Disk failure • with no RAID rebuild available • Operator error • Corruption • Hacker Tuesday, 6 September 2011

33. Thanks :) www.acunu.com richard@acunu.com @acunu @richardalow Tuesday, 6 September 2011

Fault Tolerance in Cassandra

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (8)

Mais de Acunu

Mais de Acunu (20)

Último

Último (20)

Fault Tolerance in Cassandra