SlideShare uma empresa Scribd logo
1 de 20
HDFS 
HADOOP DISTRIBUTED FILE SYSTEM
What is HDFS 
HDFS is a Filesystem designed for storing very large files running on 
cluster of commodity hardware 
It provides fault-tolerant storage layer for Hadoop and other sub-projects 
It is a distributed file system that provides high-throughput access to 
application data 
It stores data reliably even in case of hardware faliure
HDFS Architecture 
There are two types of nodes, works in master-slaves pattern 
◦ Master (or called as Namenode) – It manages all the slave nodes and assign 
work to slaves 
◦ Slave (or called as Datanode) – Actual worker nodes, which does actual task
HDFS Architecture
Distributed storage 
Data is distributed across the cluster of nodes 
Data is splitted into smaller pieces and stored across multiple nodes of 
the cluster 
In this way it provides a way to Map-Reduce to process subset of large 
data parallely on multiple nodes 
Distributed storage also provides fault-tolerance capability
Namenode Datanode 
Two Daemons Runs for HDFS (for storage) 
◦ NameNode 
◦ This daemon runs on Master 
◦ Namenode stores all the metadata like filename, path, number of blocks, blockIds, block 
locations, number of replicas, slave related config, etc. 
◦ DataNode 
◦ This daemon runs on all the slaves 
◦ Datanodes stores the actual data
Namenode Datanode
Blocks 
Files are splitted into number of blocks 
Unlike 1 KB block size of OS filesystem, HDFS’s default block size is 64 
MB (which can be increased according to the requirements) 
It saves disk seek time, and other advantage is in the case of processing 
(1 Mapper processes 1 block at a time)
Blocks
Replication 
In HDFS all the data blocks are replicated across the cluster of nodes 
Ideally one replica is present at one geographical location 
The default replication factor is 3, which can be changed according to 
the requirements. 
We can change replication factor to desired value by editing 
configuration files
High Availability 
HDFS provides high availability of data by creating multiple replicas of 
data blocks 
If a network link or node or some hardware goes down data will be 
available from some other path 
If a hardware goes down data can be accessed from other nodes or 
other paths as we have replicated data (default 3 replicas)
Reliable Data Storage 
Data is stored quite reliably in HDFS, this feature is related to High 
Availability. 
Since data is highly available due to replication, hence if a node or few 
nodes or some hardware crashes, the data will be stored reliably. 
We can again balance the cluster by making the replication factor to 
desirable value (by running few commands) 
In other words we can say Hadoop (HDFS) is fault tolerant
Reliable Data Storage
Scalability 
Two ways to scale the Filesystem 
◦ Add more disks on the nodes of the cluster 
◦ For this we need to edit the configuration files and add corresponding entries of newly added 
disks 
◦ Horizontal Scaling 
◦ Another option is to add more nodes in the cluster 
◦ We can add n number of nodes in the cluster on the fly (without any downtime)
File Access & other Operations 
Programming 
CLI (Command Line Interface) 
We can perform almost all the operations which we can do on local 
filesystem 
HDFS provides different access rights rwx to ugo 
We can also browse the filesystem by browser (http://Master-IP:50070)
Read a File 
To read a file from HDFS, client needs to interact with Namenode, as 
namenode is single place where all the the metadata is stored 
Namenode provides the address of the slaves where file is stored 
Client will interact with respective datanodes to read the file 
Namenode also provide a token to the client which it shows to data 
node for authentication
Read a File
Write a File 
To write a file into HDFS, client needs to interact with namenode. 
Namenode provides the address of the slave on which client will start 
writing the data 
As soon as client finishes writing the block, the slave starts copying the 
block to another slave which in tern copy the block to another slave (3 
replicas by default) 
After required replicas are created then it will send the acknowledge to 
the client 
Authentication process (same as file read)
Write a File
Thank You

Mais conteúdo relacionado

Mais procurados

Hadoop at a glance
Hadoop at a glanceHadoop at a glance
Hadoop at a glanceTan Tran
 
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, FacebookМасштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebookyaevents
 
Hadoop Interacting with HDFS
Hadoop Interacting with HDFSHadoop Interacting with HDFS
Hadoop Interacting with HDFSApache Apex
 
Snapshot in Hadoop Distributed File System
Snapshot in Hadoop Distributed File SystemSnapshot in Hadoop Distributed File System
Snapshot in Hadoop Distributed File SystemBhavesh Padharia
 
Hadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHanborq Inc.
 
HDFS User Reference
HDFS User ReferenceHDFS User Reference
HDFS User ReferenceBiju Nair
 
Hadoop HDFS NameNode HA
Hadoop HDFS NameNode HAHadoop HDFS NameNode HA
Hadoop HDFS NameNode HAHanborq Inc.
 
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed SystemsCoordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed SystemsKonstantin V. Shvachko
 
HDFS Trunncate: Evolving Beyond Write-Once Semantics
HDFS Trunncate: Evolving Beyond Write-Once SemanticsHDFS Trunncate: Evolving Beyond Write-Once Semantics
HDFS Trunncate: Evolving Beyond Write-Once SemanticsDataWorks Summit
 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduceUday Vakalapudi
 

Mais procurados (20)

Distributed file systems dfs
Distributed file systems   dfsDistributed file systems   dfs
Distributed file systems dfs
 
Hadoop at a glance
Hadoop at a glanceHadoop at a glance
Hadoop at a glance
 
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, FacebookМасштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
 
HDFS Design Principles
HDFS Design PrinciplesHDFS Design Principles
HDFS Design Principles
 
Hadoop Interacting with HDFS
Hadoop Interacting with HDFSHadoop Interacting with HDFS
Hadoop Interacting with HDFS
 
Interacting with hdfs
Interacting with hdfsInteracting with hdfs
Interacting with hdfs
 
Hadoop HDFS
Hadoop HDFSHadoop HDFS
Hadoop HDFS
 
Snapshot in Hadoop Distributed File System
Snapshot in Hadoop Distributed File SystemSnapshot in Hadoop Distributed File System
Snapshot in Hadoop Distributed File System
 
March 2011 HUG: HDFS Federation
March 2011 HUG: HDFS FederationMarch 2011 HUG: HDFS Federation
March 2011 HUG: HDFS Federation
 
Hadoop and HDFS
Hadoop and HDFSHadoop and HDFS
Hadoop and HDFS
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 
Hadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHadoop HDFS Detailed Introduction
Hadoop HDFS Detailed Introduction
 
HDFS User Reference
HDFS User ReferenceHDFS User Reference
HDFS User Reference
 
Hadoop HDFS NameNode HA
Hadoop HDFS NameNode HAHadoop HDFS NameNode HA
Hadoop HDFS NameNode HA
 
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed SystemsCoordinating Metadata Replication: Survival Strategy for Distributed Systems
Coordinating Metadata Replication: Survival Strategy for Distributed Systems
 
HDFS Trunncate: Evolving Beyond Write-Once Semantics
HDFS Trunncate: Evolving Beyond Write-Once SemanticsHDFS Trunncate: Evolving Beyond Write-Once Semantics
HDFS Trunncate: Evolving Beyond Write-Once Semantics
 
Hadoop Introduction
Hadoop IntroductionHadoop Introduction
Hadoop Introduction
 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduce
 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 

Destaque

Distributed File Systems
Distributed File SystemsDistributed File Systems
Distributed File Systemsawesomesos
 
Distributed File Systems: An Overview
Distributed File Systems: An OverviewDistributed File Systems: An Overview
Distributed File Systems: An OverviewAnant Narayanan
 
Chapter 8 distributed file systems
Chapter 8 distributed file systemsChapter 8 distributed file systems
Chapter 8 distributed file systemsAbDul ThaYyal
 
Chapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsChapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsWayne Jones Jnr
 
Unico Riv Maya
Unico Riv MayaUnico Riv Maya
Unico Riv Mayachglat
 
Royalton Punta Cana
Royalton Punta Cana Royalton Punta Cana
Royalton Punta Cana chglat
 
Друзья-редакторы. Как СМИ становятся социальными
Друзья-редакторы. Как СМИ становятся социальнымиДрузья-редакторы. Как СМИ становятся социальными
Друзья-редакторы. Как СМИ становятся социальнымиVsevolod Pulya
 
Livingwaterspresents
LivingwaterspresentsLivingwaterspresents
LivingwaterspresentsRunner00
 
Sono Innamorato Di Te
Sono Innamorato Di TeSono Innamorato Di Te
Sono Innamorato Di Tedargula70
 
Secrets Playa Mujeres
Secrets Playa MujeresSecrets Playa Mujeres
Secrets Playa Mujereschglat
 
Spencer Bahamas Option
Spencer Bahamas OptionSpencer Bahamas Option
Spencer Bahamas Optionchglat
 
Ocsid test
Ocsid testOcsid test
Ocsid testenhee_92
 
Креативный. 3-хлетие. планы на ближайшее время
Креативный. 3-хлетие. планы на ближайшее времяКреативный. 3-хлетие. планы на ближайшее время
Креативный. 3-хлетие. планы на ближайшее времяAlexandr Mikhailov
 
Servant plan II
Servant plan IIServant plan II
Servant plan IImgiiicm
 

Destaque (20)

Distributed File Systems
Distributed File SystemsDistributed File Systems
Distributed File Systems
 
Distributed File Systems: An Overview
Distributed File Systems: An OverviewDistributed File Systems: An Overview
Distributed File Systems: An Overview
 
Chapter 8 distributed file systems
Chapter 8 distributed file systemsChapter 8 distributed file systems
Chapter 8 distributed file systems
 
Chapter 17 - Distributed File Systems
Chapter 17 - Distributed File SystemsChapter 17 - Distributed File Systems
Chapter 17 - Distributed File Systems
 
Unico Riv Maya
Unico Riv MayaUnico Riv Maya
Unico Riv Maya
 
Royalton Punta Cana
Royalton Punta Cana Royalton Punta Cana
Royalton Punta Cana
 
Друзья-редакторы. Как СМИ становятся социальными
Друзья-редакторы. Как СМИ становятся социальнымиДрузья-редакторы. Как СМИ становятся социальными
Друзья-редакторы. Как СМИ становятся социальными
 
R meetup talk
R meetup talkR meetup talk
R meetup talk
 
24 7
24 724 7
24 7
 
Livingwaterspresents
LivingwaterspresentsLivingwaterspresents
Livingwaterspresents
 
Sono Innamorato Di Te
Sono Innamorato Di TeSono Innamorato Di Te
Sono Innamorato Di Te
 
Secrets Playa Mujeres
Secrets Playa MujeresSecrets Playa Mujeres
Secrets Playa Mujeres
 
Kauai
KauaiKauai
Kauai
 
DB - Leadership experiences
DB -  Leadership experiencesDB -  Leadership experiences
DB - Leadership experiences
 
2013 10 31 halloween
2013 10 31 halloween2013 10 31 halloween
2013 10 31 halloween
 
Yasmin2
Yasmin2Yasmin2
Yasmin2
 
Spencer Bahamas Option
Spencer Bahamas OptionSpencer Bahamas Option
Spencer Bahamas Option
 
Ocsid test
Ocsid testOcsid test
Ocsid test
 
Креативный. 3-хлетие. планы на ближайшее время
Креативный. 3-хлетие. планы на ближайшее времяКреативный. 3-хлетие. планы на ближайшее время
Креативный. 3-хлетие. планы на ближайшее время
 
Servant plan II
Servant plan IIServant plan II
Servant plan II
 

Semelhante a Hdfs

Introduction_to_HDFS sun.pptx
Introduction_to_HDFS sun.pptxIntroduction_to_HDFS sun.pptx
Introduction_to_HDFS sun.pptxsunithachphd
 
big data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing databig data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing datapreetik9044
 
Hadoop distributed file system
Hadoop distributed file systemHadoop distributed file system
Hadoop distributed file systemsrikanthhadoop
 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Designsudhakara st
 
Introduction to Hadoop Distributed File System(HDFS).pptx
Introduction to Hadoop Distributed File System(HDFS).pptxIntroduction to Hadoop Distributed File System(HDFS).pptx
Introduction to Hadoop Distributed File System(HDFS).pptxSakthiVinoth78
 
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...Simplilearn
 
Apache Hadoop In Theory And Practice
Apache Hadoop In Theory And PracticeApache Hadoop In Theory And Practice
Apache Hadoop In Theory And PracticeAdam Kawa
 
Big data with HDFS and Mapreduce
Big data  with HDFS and MapreduceBig data  with HDFS and Mapreduce
Big data with HDFS and Mapreducesenthil0809
 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsDrPDShebaKeziaMalarc
 
Big data interview questions and answers
Big data interview questions and answersBig data interview questions and answers
Big data interview questions and answersKalyan Hadoop
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File SystemMilad Sobhkhiz
 
Big Data Reverse Knowledge Transfer.pptx
Big Data Reverse Knowledge Transfer.pptxBig Data Reverse Knowledge Transfer.pptx
Big Data Reverse Knowledge Transfer.pptxssuser8c3ea7
 

Semelhante a Hdfs (20)

Introduction_to_HDFS sun.pptx
Introduction_to_HDFS sun.pptxIntroduction_to_HDFS sun.pptx
Introduction_to_HDFS sun.pptx
 
module 2.pptx
module 2.pptxmodule 2.pptx
module 2.pptx
 
Introduction to HDFS
Introduction to HDFSIntroduction to HDFS
Introduction to HDFS
 
big data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing databig data hadoop technonolgy for storing and processing data
big data hadoop technonolgy for storing and processing data
 
Hadoop distributed file system
Hadoop distributed file systemHadoop distributed file system
Hadoop distributed file system
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Design
 
Introduction to Hadoop Distributed File System(HDFS).pptx
Introduction to Hadoop Distributed File System(HDFS).pptxIntroduction to Hadoop Distributed File System(HDFS).pptx
Introduction to Hadoop Distributed File System(HDFS).pptx
 
Unit 1
Unit 1Unit 1
Unit 1
 
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
Hadoop Interview Questions And Answers Part-1 | Big Data Interview Questions ...
 
Apache Hadoop In Theory And Practice
Apache Hadoop In Theory And PracticeApache Hadoop In Theory And Practice
Apache Hadoop In Theory And Practice
 
Big data with HDFS and Mapreduce
Big data  with HDFS and MapreduceBig data  with HDFS and Mapreduce
Big data with HDFS and Mapreduce
 
Hadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data AnalyticsHadoop Distributed File System for Big Data Analytics
Hadoop Distributed File System for Big Data Analytics
 
Big data interview questions and answers
Big data interview questions and answersBig data interview questions and answers
Big data interview questions and answers
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
 
HADOOP.pptx
HADOOP.pptxHADOOP.pptx
HADOOP.pptx
 
Hadoop data management
Hadoop data managementHadoop data management
Hadoop data management
 
Hadoop
HadoopHadoop
Hadoop
 
Big Data Reverse Knowledge Transfer.pptx
Big Data Reverse Knowledge Transfer.pptxBig Data Reverse Knowledge Transfer.pptx
Big Data Reverse Knowledge Transfer.pptx
 

Mais de Chirag Ahuja

Deploy hadoop cluster
Deploy hadoop clusterDeploy hadoop cluster
Deploy hadoop clusterChirag Ahuja
 
Word count example in hadoop mapreduce using java
Word count example in hadoop mapreduce using javaWord count example in hadoop mapreduce using java
Word count example in hadoop mapreduce using javaChirag Ahuja
 
Big data introduction
Big data introductionBig data introduction
Big data introductionChirag Ahuja
 
Hive : WareHousing Over hadoop
Hive :  WareHousing Over hadoopHive :  WareHousing Over hadoop
Hive : WareHousing Over hadoopChirag Ahuja
 
Mapreduce advanced
Mapreduce advancedMapreduce advanced
Mapreduce advancedChirag Ahuja
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introductionChirag Ahuja
 

Mais de Chirag Ahuja (10)

Deploy hadoop cluster
Deploy hadoop clusterDeploy hadoop cluster
Deploy hadoop cluster
 
Word count example in hadoop mapreduce using java
Word count example in hadoop mapreduce using javaWord count example in hadoop mapreduce using java
Word count example in hadoop mapreduce using java
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Flume
FlumeFlume
Flume
 
Hbase
HbaseHbase
Hbase
 
Pig
PigPig
Pig
 
Hive : WareHousing Over hadoop
Hive :  WareHousing Over hadoopHive :  WareHousing Over hadoop
Hive : WareHousing Over hadoop
 
Mapreduce advanced
Mapreduce advancedMapreduce advanced
Mapreduce advanced
 
MapReduce basic
MapReduce basicMapReduce basic
MapReduce basic
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 

Último

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 

Último (20)

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 

Hdfs

  • 2. What is HDFS HDFS is a Filesystem designed for storing very large files running on cluster of commodity hardware It provides fault-tolerant storage layer for Hadoop and other sub-projects It is a distributed file system that provides high-throughput access to application data It stores data reliably even in case of hardware faliure
  • 3. HDFS Architecture There are two types of nodes, works in master-slaves pattern ◦ Master (or called as Namenode) – It manages all the slave nodes and assign work to slaves ◦ Slave (or called as Datanode) – Actual worker nodes, which does actual task
  • 5. Distributed storage Data is distributed across the cluster of nodes Data is splitted into smaller pieces and stored across multiple nodes of the cluster In this way it provides a way to Map-Reduce to process subset of large data parallely on multiple nodes Distributed storage also provides fault-tolerance capability
  • 6. Namenode Datanode Two Daemons Runs for HDFS (for storage) ◦ NameNode ◦ This daemon runs on Master ◦ Namenode stores all the metadata like filename, path, number of blocks, blockIds, block locations, number of replicas, slave related config, etc. ◦ DataNode ◦ This daemon runs on all the slaves ◦ Datanodes stores the actual data
  • 8. Blocks Files are splitted into number of blocks Unlike 1 KB block size of OS filesystem, HDFS’s default block size is 64 MB (which can be increased according to the requirements) It saves disk seek time, and other advantage is in the case of processing (1 Mapper processes 1 block at a time)
  • 10. Replication In HDFS all the data blocks are replicated across the cluster of nodes Ideally one replica is present at one geographical location The default replication factor is 3, which can be changed according to the requirements. We can change replication factor to desired value by editing configuration files
  • 11. High Availability HDFS provides high availability of data by creating multiple replicas of data blocks If a network link or node or some hardware goes down data will be available from some other path If a hardware goes down data can be accessed from other nodes or other paths as we have replicated data (default 3 replicas)
  • 12. Reliable Data Storage Data is stored quite reliably in HDFS, this feature is related to High Availability. Since data is highly available due to replication, hence if a node or few nodes or some hardware crashes, the data will be stored reliably. We can again balance the cluster by making the replication factor to desirable value (by running few commands) In other words we can say Hadoop (HDFS) is fault tolerant
  • 14. Scalability Two ways to scale the Filesystem ◦ Add more disks on the nodes of the cluster ◦ For this we need to edit the configuration files and add corresponding entries of newly added disks ◦ Horizontal Scaling ◦ Another option is to add more nodes in the cluster ◦ We can add n number of nodes in the cluster on the fly (without any downtime)
  • 15. File Access & other Operations Programming CLI (Command Line Interface) We can perform almost all the operations which we can do on local filesystem HDFS provides different access rights rwx to ugo We can also browse the filesystem by browser (http://Master-IP:50070)
  • 16. Read a File To read a file from HDFS, client needs to interact with Namenode, as namenode is single place where all the the metadata is stored Namenode provides the address of the slaves where file is stored Client will interact with respective datanodes to read the file Namenode also provide a token to the client which it shows to data node for authentication
  • 18. Write a File To write a file into HDFS, client needs to interact with namenode. Namenode provides the address of the slave on which client will start writing the data As soon as client finishes writing the block, the slave starts copying the block to another slave which in tern copy the block to another slave (3 replicas by default) After required replicas are created then it will send the acknowledge to the client Authentication process (same as file read)