Fault tolerant mechanisms in Big Data

•

3 gostaram•1,870 visualizações

Featuring a brief overview of fault-tolerant mechanisms across various Big Data systems such as Google File system (GFS), Amazon Dynamo, Bigtable, Hadoop - Map Reduce, Facebook Cassandra along with description of an existing fault tolerant model

Dados e análise

+
Fault-tolerant mechanisms
in Big Data
Karan Pardeshi

+
Agenda
 Introduction
 Distributed Fault-tolerant mechanisms in Big Data
 Current Model
 Use of Features to build a better model
 Future Work

+
Introduction
 Cloud computing is everywhere.
 Advantages
 Cost Efficient
 Unlimited storage
 Seamless access
 Importance of Fault Tolerance
 Mass outage at Amazon Web Services
 A zone was off for an entire day!
 Time critical systems
 Rocket on a mission
 Bank applications

+
Fault tolerant mechanisms in
Distributed Systems
 Google File System (GFS)
 Focused on storage
 Replication mechanism
 different machines on different racks, N=3.
 Shadow-master’s in support to primary master
 Read access
 Checksums for data reliability
 CRC
 Amazon Dynamo
 Focused on High Availability
 Use Vector Clocks
 For semantic reconcilation
 Hinted hand-off
 Merkle Tree
 To detect and correct instabilities

+
Fault tolerant mechanisms in
Distributed Systems (continued)
 Facebook’s Cassandra
 Accrual Failure detection mechanism with gossip based protocol.
 First of its kind
 Probabilistic failure rate estimator
 Zookeeper
 Group of workstations acting as servers
 One master, other service providers in accordance with the main master
 High availability
 Bigtable
 Works on top of GFS
 Chubby service – metadata storage
 Heart of Bigtable
 Primary co-ordinator of Bigtable
 Data persistence

+
Fault tolerant mechanisms in
Distributed Systems (continued)
 MapReduce
 Classic Master-Slave configuration
 Ex - Hadoop
 Re-execution of entire operation
 If any operation terminates in between
 Operational even if some worker’s fail
 Efficient load balancing
 HDFS

+
Existing Fault tolerant model for
Cloud Computing
 Proposed by Anjali Meshram, A.S Sambare, S.D Zade
 Input is passed to all VM’s
 Accepter
 Testing carried out on algorithms for every VM.
 Timer
 Monitoring time constraint for each VM
 Reliability Assessor (RA)
 Starts with reliability of 100% for every VM
 Calculated with time taken for every result for each VM
 Decision Maker
 Selects output of node with highest reliability.
 Raises failure if reliability falls below minimum and node is removed.

+
Features that can be combined to
create a new Fault Tolerant Model
 Master Node
 Co-ordinator
 Built on Zookeeper service
 Each job carried on three different
node
 Accrual Fault Detectors
 Probabilistic failure value
 Measured on ping responses from
Master
 Decision Maker
 Selects the majority vote to produce
final output

+
Future Work
 Develop a better and a more robust fault tolerant model
using the features described in earlier slides.

Mais conteúdo relacionado

Mais procurados

MapReduce Design PatternsDonald Miner

Memory ManagementSanthiNivas

Shared memoryAbhishek Khune

CS6401 Operating SystemsKathirvel Ayyaswamy

Memory management1rizwanaabassi

cloud schedulingMudit Verma

Data LocalitySyam Lal

Chapter 6 osAbDul ThaYyal

Information Storage and Management EMC

distributed memory architecture/ Non Shared MIMD ArchitectureHBukhary

Terminologies Used In Big data Environments,G.Sumithra,II-M.sc(computer scien...sumithragunasekaran

23246406 dbms-unit-1Piyush Kant Singh

Distributed datababase Transaction and concurrency controlbalamurugan.k Kalibalamurugan

03 backup-and-recoveryhunny garg

Storage VirtualizationMehul Jariwala

Distributed System pptOECLIB Odisha Electronics Control Library

Presentation on backup and recoveryyyyyyyyyyyyyTehmina Gulfam

9 fault-tolerance4020132038

Monitoring with GangliaFastly

Ch1-Operating System ConceptsMuhammad Bilal Tariq

Mais procurados (20)

MapReduce Design Patterns

Memory Management

Shared memory

CS6401 Operating Systems

Memory management1

cloud scheduling

Data Locality

Chapter 6 os

Information Storage and Management

distributed memory architecture/ Non Shared MIMD Architecture

Terminologies Used In Big data Environments,G.Sumithra,II-M.sc(computer scien...

23246406 dbms-unit-1

Distributed datababase Transaction and concurrency control

03 backup-and-recovery

Storage Virtualization

Distributed System ppt

Presentation on backup and recoveryyyyyyyyyyyyy

9 fault-tolerance

Monitoring with Ganglia

Ch1-Operating System Concepts

Destaque

Fault tolerance in Big DataPOOJA MEHTA

Fault Tolerance in Big Data Processing Using Heartbeat Messages and Data Repl...IJSRD

Hadoop fault tolerancePallav Jha

Table Partitioning: Secret Weapon for Big Data ProblemsJohn Sterrett

Data Workflows for Machine Learning - Seattle DAMLPaco Nathan

Big Data Architectural PatternsAmazon Web Services

SQL to Hive Cheat SheetHortonworks

Destaque (7)

Fault tolerance in Big Data

Fault Tolerance in Big Data Processing Using Heartbeat Messages and Data Repl...

Hadoop fault tolerance

Table Partitioning: Secret Weapon for Big Data Problems

Data Workflows for Machine Learning - Seattle DAML

Big Data Architectural Patterns

SQL to Hive Cheat Sheet

Semelhante a Fault tolerant mechanisms in Big Data

Azure and cloud design patternsVenkatesh Narayanan

Cloud computingZeeshan Bilal

CLOUD BIOINFORMATICS Part1ARPUTHA SELVARAJ A

Hadoop mapreduce and yarn frame work- unit5RojaT4

How to extract valueable information from real time data feedsGene Leybzon

SaaS Enablement of your existing application (Cloud Slam 2010)Nati Shalom

Deploying SaaS Application on the Cloud - Case StudyNati Shalom

Dynamic Scheduling - Federated clusters in mesosAaron Carey

Everything comes in 3'sdelagoya

AI&BigData Lab 2016. Сарапин Виктор: Размер имеет значение: анализ по требова...GeeksLab Odessa

Scalable service architectures @ VDB16Zoltán Németh

ML on Big Data: Real-Time Analysis on Time SeriesSigmoid

Clusters (Distributed computing)Sri Prasanna

Scalable Service ArchitecturesZoltán Németh

Sawmill - Integrating R and Large Data CloudsRobert Grossman

Comparison between Cloud Mirror, Mesos Cluster, and Google OmegaGIST (Gwangju Institute of Science and Technology)

Muves3 Elastic Grid Java One2009 FinalElastic Grid, LLC.

NoSQL Introduction, Theory, ImplementationsFirat Atagun

Scalable service architectures @ BWS16Zoltán Németh

Bhupeshbansal bigdata Bhupesh Bansal

Semelhante a Fault tolerant mechanisms in Big Data (20)

Azure and cloud design patterns

Cloud computing

CLOUD BIOINFORMATICS Part1

Hadoop mapreduce and yarn frame work- unit5

How to extract valueable information from real time data feeds

SaaS Enablement of your existing application (Cloud Slam 2010)

Deploying SaaS Application on the Cloud - Case Study

Dynamic Scheduling - Federated clusters in mesos

Everything comes in 3's

AI&BigData Lab 2016. Сарапин Виктор: Размер имеет значение: анализ по требова...

Scalable service architectures @ VDB16

ML on Big Data: Real-Time Analysis on Time Series

Clusters (Distributed computing)

Scalable Service Architectures

Sawmill - Integrating R and Large Data Clouds

Comparison between Cloud Mirror, Mesos Cluster, and Google Omega

Muves3 Elastic Grid Java One2009 Final

NoSQL Introduction, Theory, Implementations

Scalable service architectures @ BWS16

Bhupeshbansal bigdata

Último

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

Midocean dropshipping via API with DroFxolyaivanovalion

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Week-01-2.ppt BBB human Computer interactionfulawalesam

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Halmar dropshipping via API with DroFxolyaivanovalion

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Capstone Project on IBM Data Analytics ProgramMoniSankarHazra

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765

Fault tolerant mechanisms in Big Data

1. + Fault-tolerant mechanisms in Big Data Karan Pardeshi

2. + Agenda  Introduction  Distributed Fault-tolerant mechanisms in Big Data  Current Model  Use of Features to build a better model  Future Work

3. + Introduction  Cloud computing is everywhere.  Advantages  Cost Efficient  Unlimited storage  Seamless access  Importance of Fault Tolerance  Mass outage at Amazon Web Services  A zone was off for an entire day!  Time critical systems  Rocket on a mission  Bank applications

4. + Fault tolerant mechanisms in Distributed Systems  Google File System (GFS)  Focused on storage  Replication mechanism  different machines on different racks, N=3.  Shadow-master’s in support to primary master  Read access  Checksums for data reliability  CRC  Amazon Dynamo  Focused on High Availability  Use Vector Clocks  For semantic reconcilation  Hinted hand-off  Merkle Tree  To detect and correct instabilities

5. + Fault tolerant mechanisms in Distributed Systems (continued)  Facebook’s Cassandra  Accrual Failure detection mechanism with gossip based protocol.  First of its kind  Probabilistic failure rate estimator  Zookeeper  Group of workstations acting as servers  One master, other service providers in accordance with the main master  High availability  Bigtable  Works on top of GFS  Chubby service – metadata storage  Heart of Bigtable  Primary co-ordinator of Bigtable  Data persistence

6. + Fault tolerant mechanisms in Distributed Systems (continued)  MapReduce  Classic Master-Slave configuration  Ex - Hadoop  Re-execution of entire operation  If any operation terminates in between  Operational even if some worker’s fail  Efficient load balancing  HDFS

7. + Existing Fault tolerant model for Cloud Computing  Proposed by Anjali Meshram, A.S Sambare, S.D Zade  Input is passed to all VM’s  Accepter  Testing carried out on algorithms for every VM.  Timer  Monitoring time constraint for each VM  Reliability Assessor (RA)  Starts with reliability of 100% for every VM  Calculated with time taken for every result for each VM  Decision Maker  Selects output of node with highest reliability.  Raises failure if reliability falls below minimum and node is removed.

8. + Fig.

9. + Features that can be combined to create a new Fault Tolerant Model  Master Node  Co-ordinator  Built on Zookeeper service  Each job carried on three different node  Accrual Fault Detectors  Probabilistic failure value  Measured on ping responses from Master  Decision Maker  Selects the majority vote to produce final output

10. + Future Work  Develop a better and a more robust fault tolerant model using the features described in earlier slides.

11. + ThankYou

Fault tolerant mechanisms in Big Data

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (7)

Semelhante a Fault tolerant mechanisms in Big Data

Semelhante a Fault tolerant mechanisms in Big Data (20)

Último

Último (20)

Fault tolerant mechanisms in Big Data