SlideShare uma empresa Scribd logo
1 de 31
Baixar para ler offline
NAMENODE AND DATANODE COUPLING
FOR A POWER-PROPORTIONAL
HADOOP DISTRIBUTED FILE SYSTEM
Hieu Hanh Le, Satoshi Hikida and Haruo Yokota
Tokyo Institute of Technology
Appeared in DASFAA 2013
The 18th International Conference on Database
Systems for Advanced Applications (Wuhan, China)
1
 Background
 Research Motivation
 Goal and Approach
 Proposals
 Experimental Evaluation
 Conclusion
Agenda
2
Background
 Hadoop Distributed File System (HDFS) is widely
used as data storage for applications in the Cloud
 Commercial Off-the-self-based system
 Support MapReduce framework
 Good scalability
 Utilize a huge number of DataNodes to store huge amount
of data requested by data-intensive applications
 Expand the power consumption of storage system
 Power-aware file systems are moving towards
power-proportional design
3
[Background]
Power-proportional Storage System
 System should consume energy in proportion to
amount of work performed [Barroso and Holzle, 2007]
 Set system’s operation to multiple gears containing
different number of data nodes
 Made possible by data placement methods
4
High Gear
Node
1
Node
2
D2
Node
3
D3D1
Node
4
D4
Low Gear
Node
1
Node
4
Node
3
Node
2
D2 D3D1 D4
D1 D4
migration
Research Motivation
5
 Gear-shifting is vital in power-proportional system
 The system needs to reflect updated data that was
modified in a lower gear to guarantee the higher
performance
 Re-transfer the updated data according to the data
placement
 The inefficient gear-shifting process in current methods
for the HDFS [Rabbit, Sierra]
 Bottleneck in metadata access
 High communication cost among nodes
Rabbit: Robust and Flexible Power-proportional Storage, ACM SOCC 2010
Sierra: Practical Power-proportionality for Data Center Storage, ACM EuroSys 2011
Gear-shifting in current HDFS-based methods [1/10]
6
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
Eg: Rabbit, Sierra
D1
D2 D3
D4
D2 D3
D1 D4
Low Gear High Gear
Gear-shifting in current HDFS-based methods [2/10]
7
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
Eg: Rabbit, Sierra
1. Access metadata to
identify updated blocks
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
Low Gear High Gear
Gear-shifting in current HDFS-based methods [3/10]
8
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [4/10]
9
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [5/10]
10
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [6/10]
11
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [7/10]
12
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [8/10]
13
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [9/10]
14
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Congestion
D1
D2 D3
D4
D2 D3
D1 D4D1
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Gear-shifting in current HDFS-based methods [10/10]
15
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Write Dataset
D = {D1, D2, D3, D4}
Data
Node1
Data
Node4
Data
Node2
Data
Node3
Name
Node
Gear Up
2. Transfer updated
blocks
Eg: Rabbit, Sierra
Sequentially
(1 block/connection)
Congestion
InefficiencyD1
D2 D3
D4
D2 D3
D1 D4D1 D4
2.1 Command
issuance
2.2 Transfer
block
Low Gear High Gear
1. Access metadata to
identify updated blocks
Goal and Approach
 Goal
 Propose a novel architecture for efficient gear-shifting for
power-proportional HDFS
 Approach
 Utilize distributed metadata management (MDM)
 Eliminate the bottleneck of the centralized MDM
 Coupling NameNode and DataNode (NDCouplingHDFS)
 Localize the range of updated blocks maintained by metadata
management
 Reduce the communication cost among nodes
 Enable multiple blocks transfer to improve the efficiency in
HDFS
16
[Proposals]
Distributed MDM
 Distribute MDM to multiple nodes to decentralize the load during
gear-shiftings
 Require a distributed MDM that is update conscious
 The MDM is transferred when the system shifts gears
 Low cost of search/insert/delete operations
 Inefficient distributed hash table based method
 For each transferred file, the hash function is needed to be applied
 Efficient range based method
 For a range of files, all the metadata can be transferred within a limited
structure transverses
 Apply two range-based methods
 Each node statically maintains a separate subnamespace
(Static Directory Partition-SDP)
 Parallel index technique with well concurrency control (Fat-Btree) [*]
17
[*] A Concurrency Control Protocol for Parallel B-tree structure without
latch-coupling for explosively growing digital content, EDBT 2008
[Proposals]
NDCouplingHDFS with Distributed MDM
 Each node maintains a subnamespace of the whole
namspace of the system
 The mapping information [Node, Range] is managed by
Distributed MDM
18
Data
Management
Distributed
MDM
ND1
Distributed
MDM
Data
Management
ND2
Distributed
MDM
Data
Management
ND3
Distributed
MDM
Data
Management
ND4
2. Forward request to
responsible nodes
3. Serve the request
and return the results
1. Send
request of 25
4. Return results
A NDCoulingHDFS
node
ND1: [1, 10]
ND2: [11,20]
ND3: [21, 30]
ND4: [31,~]
[Proposals]
Efficient Gear-shifting [1/6]19
Data
Management
Distributed
MDM
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
A DB C
A1 B1 C1 D1
WOL
Log
WOL
Log
A
A1
D
D1
<File, Temp Node, Intended Node>
Reactivated Reactivated
A1
B1 C1
D1A1 D1
 The process is
distributed to
multiple nodes
 The command
issuance from
Disitributed MDM
and Data
Management is
locally performed
 Updated blocks
are transferred in
batch way
(multiple blocks
per connection)
[Proposals]
Efficient Gear-shifting [2/6]20
Data
Management
Distributed
MDM
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
A DB C
A1 B1 C1 D1
WOL
Log
WOL
Log
A
A1
D
D1
<File, Temp Node, Intended Node>
Reactivated Reactivated
A1
B1 C1
D1A1 D1
1. Transfer updated
metadata
1. Transfer updated
metadata
 The process is
distributed to
multiple nodes
 The command
issuance from
Disitributed MDM
and Data
Management is
locally performed
 Updated blocks
are transferred in
batch way
(multiple blocks
per connection)
[Proposals]
Efficient Gear-shifting [3/6]21
Data
Management
Distributed
MDM
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
A DB C
A1 B1 C1 D1
WOL
Log
WOL
Log
A
A1
D
D1
<File, Temp Node, Intended Node>
Reactivated Reactivated
A1
B1 C1
D1A1 D1
1. Transfer updated
metadata
1. Transfer updated
metadata
 The process is
distributed to
multiple nodes
 The command
issuance from
Disitributed MDM
and Data
Management is
locally performed
 Updated blocks
are transferred in
batch way
(multiple blocks
per connection)
2. Command issuance 2. Command issuance
[Proposals]
Efficient Gear-shifting [4/6]22
Data
Management
Distributed
MDM
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
A DB C
A1 B1 C1 D1
WOL
Log
WOL
Log
A
A1
D
D1
<File, Temp Node, Intended Node>
Reactivated Reactivated
A1
B1 C1
D1A1 D1
1. Transfer updated
metadata
1. Transfer updated
metadata
 The process is
distributed to
multiple nodes
 The command
issuance from
Disitributed MDM
and Data
Management is
locally performed
 Updated blocks
are transferred in
batch way
(multiple blocks
per connection)
2. Command issuance 2. Command issuance
3. Transfer blocks3. Transfer blocks
[Proposals]
Efficient Gear-shifting [5/6]23
Data
Management
Distributed
MDM
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
A DB C
A1 B1 C1 D1
WOL
Log
WOL
Log
A
A1
D
D1
<File, Temp Node, Intended Node>
Reactivated Reactivated
A1
B1 C1
D1A1 D1
1. Transfer updated
metadata
1. Transfer updated
metadata
 The process is
distributed to
multiple nodes
 The command
issuance from
Disitributed MDM
and Data
Management is
locally performed
 Updated blocks
are transferred in
batch way
(multiple blocks
per connection)
2. Command issuance 2. Command issuance
3. Transfer blocks3. Transfer blocks
4. Updated metadata4. Updated metadata
[Proposals]
Efficient Gear-shifting [6/6]24
Data
Management
Distributed
MDM
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
Distributed
MDM
Data
Management
A DB C
A1 B1 C1 D1
WOL
Log
WOL
Log
A
A1
D
D1
<File, Temp Node, Intended Node>
Reactivated Reactivated
A1
B1 C1
D1A1 D1
1. Transfer updated
metadata
1. Transfer updated
metadata
 The process is
distributed to
multiple nodes
 The command
issuance from
Disitributed MDM
and Data
Management is
locally performed
 Updated blocks
are transferred in
batch way
(multiple blocks
per connection)
2. Command issuance 2. Command issuance
3. Transfer blocks3. Transfer blocks
4. Updated metadata4. Updated metadata
Parallelism
Reduce
network cost
Efficient block
transfer
Experiment Evaluation
 Experiment 1
 Verify the effectiveness of proposals in gear-shifting
process by comparing with the normal HDFS
 Updated block reflection is the major cost
 Coupling architecture, batch block transferring
 Experiment 2
 Evaluate the effectiveness of distributed index
technique to NDCouplingHDFS
 SDP and Fat-Btree through changing the number of nodes
25
[Experiment 1]
Validity of NDCouplingHDFS in Gear-shifting
26
Updated Data Reflection
# Gears 2
# Active nodes at Low Gear 8
# Active nodes at High
Gear
16
# files 16000
File size 1MB
HDFS
Version 0.20.2
Maximum number of
transferred blocks
100
Heartbeat interval 1s
 Compare the execution time of updated data
reflection the NDCouplingHDFS with the normal
HDFS based on five configurations
 Combinations of architecture, distributed MDM (SDP,
Fat-Btree), command issuance, block transfer
 Environment
0
5
10
15
20
25
30
35
40
45
0
10
20
30
40
50
60
70
NormalHDFS SSS SBS SBB FBB
Execution time
Number of communication connections
[commnand issuance]
[Experiment 1]
Experimental Results27
46%41%
Configuration Normal
HDFS
SSS SBS SBB FBB
Architecture HDFS Coupling Coupling Coupling Coupling
MDM Central SDP SDP SDP Fat-Btree
Command
issuance
Sequential Sequential Batch Batch Batch
Block
transference
Sequential Sequential Sequential Batch Batch
Coupling architecture and
Batch block transferring highly
effected the performance
[s]
[Experiment 2]
Scalability of metadata operations
 Evaluate SDP vs. Fat-Btree
 Change the number of files and number of nodes
28
Machine
# 1, 2, 4, 8
CPU TM8600 1.0GHz
Memory DRAM 4GB
NIC 1000Mb/s
OS Linux 3.0 64bit
Java JDK-1.7.0
Fat-Btree
Fanout 16
Control
Concurrency
LCFB [Yoshihara, 2007]
Workload
#files 3000
File size 1MB
 Fat-Btree gained better scalability when the number of
nodes increases
 The read throughput scaled well due to better search cost and
concurrency control
 The efficiency in write throughput is limited due to the
synchronization cost in updating tree structure
[Experiment 2]
Experimental Results29
0
50
100
150
200
250
300
350
1 2 4 8
SDP
Fat-Btree
0
5
10
15
20
25
30
1 2 4 8
SDP
FBT
ReadThroughput[operation/s]
WriteThroughput[operations/s]
A transaction: open/create metadata
and read/write files
Conclusion
 Proposed NDCouplingHDFS for efficient gear-shifting in
power-proportional HDFS
 Significantly reduced at most 46% the execution time of
reflecting updated data compare with the normal HDFS
 Coupling architecture and batch block transferring
 Improved the IO performance by applying distributed
index technique to NDCouplingHDFS
 NDCouplingHDFS
 Maintains supporting MapReduce
 Exptected to achieve real power-proportionality including
power consumption of metadata management
30
NameNode and DataNode Coupling for a
Power-proportional Hadoop Distributed File System
Thank you for your attention!31

Mais conteúdo relacionado

Mais procurados

Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduceUday Vakalapudi
 
co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.Yousef Fadila
 
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedHadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedDataWorks Summit
 
Fundamental of Big Data with Hadoop and Hive
Fundamental of Big Data with Hadoop and HiveFundamental of Big Data with Hadoop and Hive
Fundamental of Big Data with Hadoop and HiveSharjeel Imtiaz
 
Dynamic Namespace Partitioning with Giraffa File System
Dynamic Namespace Partitioning with Giraffa File SystemDynamic Namespace Partitioning with Giraffa File System
Dynamic Namespace Partitioning with Giraffa File SystemDataWorks Summit
 
Design and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryDesign and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryIJRESJOURNAL
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...IRJET Journal
 
HDFS for Geographically Distributed File System
HDFS for Geographically Distributed File SystemHDFS for Geographically Distributed File System
HDFS for Geographically Distributed File SystemKonstantin V. Shvachko
 
WBDB 2014 Benchmarking Virtualized Hadoop Clusters
WBDB 2014 Benchmarking Virtualized Hadoop ClustersWBDB 2014 Benchmarking Virtualized Hadoop Clusters
WBDB 2014 Benchmarking Virtualized Hadoop Clusterst_ivanov
 
Distribute Storage System May-2014
Distribute Storage System May-2014Distribute Storage System May-2014
Distribute Storage System May-2014Công Lợi Dương
 
A 3 dimensional data model in hbase for large time-series dataset-20120915
A 3 dimensional data model in hbase for large time-series dataset-20120915A 3 dimensional data model in hbase for large time-series dataset-20120915
A 3 dimensional data model in hbase for large time-series dataset-20120915Dan Han
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperDBOnto
 

Mais procurados (20)

Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduce
 
co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.
 
H04502048051
H04502048051H04502048051
H04502048051
 
Hadoop Architecture
Hadoop ArchitectureHadoop Architecture
Hadoop Architecture
 
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedHadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
 
Fundamental of Big Data with Hadoop and Hive
Fundamental of Big Data with Hadoop and HiveFundamental of Big Data with Hadoop and Hive
Fundamental of Big Data with Hadoop and Hive
 
Dynamic Namespace Partitioning with Giraffa File System
Dynamic Namespace Partitioning with Giraffa File SystemDynamic Namespace Partitioning with Giraffa File System
Dynamic Namespace Partitioning with Giraffa File System
 
Design and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryDesign and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on Raspberry
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
 
HDFS for Geographically Distributed File System
HDFS for Geographically Distributed File SystemHDFS for Geographically Distributed File System
HDFS for Geographically Distributed File System
 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 
Nomads
NomadsNomads
Nomads
 
WBDB 2014 Benchmarking Virtualized Hadoop Clusters
WBDB 2014 Benchmarking Virtualized Hadoop ClustersWBDB 2014 Benchmarking Virtualized Hadoop Clusters
WBDB 2014 Benchmarking Virtualized Hadoop Clusters
 
Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5
 
Matlab, Big Data, and HDF Server
Matlab, Big Data, and HDF ServerMatlab, Big Data, and HDF Server
Matlab, Big Data, and HDF Server
 
IJET-V3I2P24
IJET-V3I2P24IJET-V3I2P24
IJET-V3I2P24
 
Distribute Storage System May-2014
Distribute Storage System May-2014Distribute Storage System May-2014
Distribute Storage System May-2014
 
A 3 dimensional data model in hbase for large time-series dataset-20120915
A 3 dimensional data model in hbase for large time-series dataset-20120915A 3 dimensional data model in hbase for large time-series dataset-20120915
A 3 dimensional data model in hbase for large time-series dataset-20120915
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning Paper
 

Semelhante a NameNode and DataNode Coupling for a Power-proportional Hadoop Distributed File System

IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...IRJET Journal
 
Fredrick Ishengoma - HDFS+- Erasure Coding Based Hadoop Distributed File System
Fredrick Ishengoma -  HDFS+- Erasure Coding Based Hadoop Distributed File SystemFredrick Ishengoma -  HDFS+- Erasure Coding Based Hadoop Distributed File System
Fredrick Ishengoma - HDFS+- Erasure Coding Based Hadoop Distributed File SystemFredrick Ishengoma
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Simplilearn
 
Introduction to hadoop and hdfs
Introduction to hadoop and hdfsIntroduction to hadoop and hdfs
Introduction to hadoop and hdfsshrey mehrotra
 
Big data | Hadoop | components of hadoop |Rahul Gulab Sing
Big data | Hadoop | components of hadoop |Rahul Gulab SingBig data | Hadoop | components of hadoop |Rahul Gulab Sing
Big data | Hadoop | components of hadoop |Rahul Gulab SingRahul Singh
 
Presentation sreenu dwh-services
Presentation sreenu dwh-servicesPresentation sreenu dwh-services
Presentation sreenu dwh-servicesSreenu Musham
 
Data center disaster recovery.ppt
Data center disaster recovery.ppt Data center disaster recovery.ppt
Data center disaster recovery.ppt omalreda
 
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...The HDF-EOS Tools and Information Center
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)Hadoop File system (HDFS)
Hadoop File system (HDFS)Prashant Gupta
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataRobert Grossman
 
Geo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceGeo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceeSAT Publishing House
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Robert Grossman
 
Hadoop training in bangalore-kellytechnologies
Hadoop training in bangalore-kellytechnologiesHadoop training in bangalore-kellytechnologies
Hadoop training in bangalore-kellytechnologiesappaji intelhunt
 

Semelhante a NameNode and DataNode Coupling for a Power-proportional Hadoop Distributed File System (20)

Introduction to HDFS
Introduction to HDFSIntroduction to HDFS
Introduction to HDFS
 
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
 
Fredrick Ishengoma - HDFS+- Erasure Coding Based Hadoop Distributed File System
Fredrick Ishengoma -  HDFS+- Erasure Coding Based Hadoop Distributed File SystemFredrick Ishengoma -  HDFS+- Erasure Coding Based Hadoop Distributed File System
Fredrick Ishengoma - HDFS+- Erasure Coding Based Hadoop Distributed File System
 
Hadoop paper
Hadoop paperHadoop paper
Hadoop paper
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
 
Introduction to hadoop and hdfs
Introduction to hadoop and hdfsIntroduction to hadoop and hdfs
Introduction to hadoop and hdfs
 
Big data | Hadoop | components of hadoop |Rahul Gulab Sing
Big data | Hadoop | components of hadoop |Rahul Gulab SingBig data | Hadoop | components of hadoop |Rahul Gulab Sing
Big data | Hadoop | components of hadoop |Rahul Gulab Sing
 
Presentation sreenu dwh-services
Presentation sreenu dwh-servicesPresentation sreenu dwh-services
Presentation sreenu dwh-services
 
Data center disaster recovery.ppt
Data center disaster recovery.ppt Data center disaster recovery.ppt
Data center disaster recovery.ppt
 
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)Hadoop File system (HDFS)
Hadoop File system (HDFS)
 
Unit-3.pptx
Unit-3.pptxUnit-3.pptx
Unit-3.pptx
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
Geo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceGeo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduce
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
 
Unit 1
Unit 1Unit 1
Unit 1
 
Understanding Hadoop
Understanding HadoopUnderstanding Hadoop
Understanding Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop training in bangalore-kellytechnologies
Hadoop training in bangalore-kellytechnologiesHadoop training in bangalore-kellytechnologies
Hadoop training in bangalore-kellytechnologies
 
Distributed storage system
Distributed storage systemDistributed storage system
Distributed storage system
 

Último

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Último (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 

NameNode and DataNode Coupling for a Power-proportional Hadoop Distributed File System

  • 1. NAMENODE AND DATANODE COUPLING FOR A POWER-PROPORTIONAL HADOOP DISTRIBUTED FILE SYSTEM Hieu Hanh Le, Satoshi Hikida and Haruo Yokota Tokyo Institute of Technology Appeared in DASFAA 2013 The 18th International Conference on Database Systems for Advanced Applications (Wuhan, China) 1
  • 2.  Background  Research Motivation  Goal and Approach  Proposals  Experimental Evaluation  Conclusion Agenda 2
  • 3. Background  Hadoop Distributed File System (HDFS) is widely used as data storage for applications in the Cloud  Commercial Off-the-self-based system  Support MapReduce framework  Good scalability  Utilize a huge number of DataNodes to store huge amount of data requested by data-intensive applications  Expand the power consumption of storage system  Power-aware file systems are moving towards power-proportional design 3
  • 4. [Background] Power-proportional Storage System  System should consume energy in proportion to amount of work performed [Barroso and Holzle, 2007]  Set system’s operation to multiple gears containing different number of data nodes  Made possible by data placement methods 4 High Gear Node 1 Node 2 D2 Node 3 D3D1 Node 4 D4 Low Gear Node 1 Node 4 Node 3 Node 2 D2 D3D1 D4 D1 D4 migration
  • 5. Research Motivation 5  Gear-shifting is vital in power-proportional system  The system needs to reflect updated data that was modified in a lower gear to guarantee the higher performance  Re-transfer the updated data according to the data placement  The inefficient gear-shifting process in current methods for the HDFS [Rabbit, Sierra]  Bottleneck in metadata access  High communication cost among nodes Rabbit: Robust and Flexible Power-proportional Storage, ACM SOCC 2010 Sierra: Practical Power-proportionality for Data Center Storage, ACM EuroSys 2011
  • 6. Gear-shifting in current HDFS-based methods [1/10] 6 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up Eg: Rabbit, Sierra D1 D2 D3 D4 D2 D3 D1 D4 Low Gear High Gear
  • 7. Gear-shifting in current HDFS-based methods [2/10] 7 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up Eg: Rabbit, Sierra 1. Access metadata to identify updated blocks Congestion D1 D2 D3 D4 D2 D3 D1 D4 Low Gear High Gear
  • 8. Gear-shifting in current HDFS-based methods [3/10] 8 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 9. Gear-shifting in current HDFS-based methods [4/10] 9 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 10. Gear-shifting in current HDFS-based methods [5/10] 10 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 11. Gear-shifting in current HDFS-based methods [6/10] 11 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 12. Gear-shifting in current HDFS-based methods [7/10] 12 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 13. Gear-shifting in current HDFS-based methods [8/10] 13 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 14. Gear-shifting in current HDFS-based methods [9/10] 14 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Congestion D1 D2 D3 D4 D2 D3 D1 D4D1 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 15. Gear-shifting in current HDFS-based methods [10/10] 15 Data Node1 Data Node4 Data Node2 Data Node3 Name Node Write Dataset D = {D1, D2, D3, D4} Data Node1 Data Node4 Data Node2 Data Node3 Name Node Gear Up 2. Transfer updated blocks Eg: Rabbit, Sierra Sequentially (1 block/connection) Congestion InefficiencyD1 D2 D3 D4 D2 D3 D1 D4D1 D4 2.1 Command issuance 2.2 Transfer block Low Gear High Gear 1. Access metadata to identify updated blocks
  • 16. Goal and Approach  Goal  Propose a novel architecture for efficient gear-shifting for power-proportional HDFS  Approach  Utilize distributed metadata management (MDM)  Eliminate the bottleneck of the centralized MDM  Coupling NameNode and DataNode (NDCouplingHDFS)  Localize the range of updated blocks maintained by metadata management  Reduce the communication cost among nodes  Enable multiple blocks transfer to improve the efficiency in HDFS 16
  • 17. [Proposals] Distributed MDM  Distribute MDM to multiple nodes to decentralize the load during gear-shiftings  Require a distributed MDM that is update conscious  The MDM is transferred when the system shifts gears  Low cost of search/insert/delete operations  Inefficient distributed hash table based method  For each transferred file, the hash function is needed to be applied  Efficient range based method  For a range of files, all the metadata can be transferred within a limited structure transverses  Apply two range-based methods  Each node statically maintains a separate subnamespace (Static Directory Partition-SDP)  Parallel index technique with well concurrency control (Fat-Btree) [*] 17 [*] A Concurrency Control Protocol for Parallel B-tree structure without latch-coupling for explosively growing digital content, EDBT 2008
  • 18. [Proposals] NDCouplingHDFS with Distributed MDM  Each node maintains a subnamespace of the whole namspace of the system  The mapping information [Node, Range] is managed by Distributed MDM 18 Data Management Distributed MDM ND1 Distributed MDM Data Management ND2 Distributed MDM Data Management ND3 Distributed MDM Data Management ND4 2. Forward request to responsible nodes 3. Serve the request and return the results 1. Send request of 25 4. Return results A NDCoulingHDFS node ND1: [1, 10] ND2: [11,20] ND3: [21, 30] ND4: [31,~]
  • 19. [Proposals] Efficient Gear-shifting [1/6]19 Data Management Distributed MDM Distributed MDM Data Management Distributed MDM Data Management Distributed MDM Data Management A DB C A1 B1 C1 D1 WOL Log WOL Log A A1 D D1 <File, Temp Node, Intended Node> Reactivated Reactivated A1 B1 C1 D1A1 D1  The process is distributed to multiple nodes  The command issuance from Disitributed MDM and Data Management is locally performed  Updated blocks are transferred in batch way (multiple blocks per connection)
  • 20. [Proposals] Efficient Gear-shifting [2/6]20 Data Management Distributed MDM Distributed MDM Data Management Distributed MDM Data Management Distributed MDM Data Management A DB C A1 B1 C1 D1 WOL Log WOL Log A A1 D D1 <File, Temp Node, Intended Node> Reactivated Reactivated A1 B1 C1 D1A1 D1 1. Transfer updated metadata 1. Transfer updated metadata  The process is distributed to multiple nodes  The command issuance from Disitributed MDM and Data Management is locally performed  Updated blocks are transferred in batch way (multiple blocks per connection)
  • 21. [Proposals] Efficient Gear-shifting [3/6]21 Data Management Distributed MDM Distributed MDM Data Management Distributed MDM Data Management Distributed MDM Data Management A DB C A1 B1 C1 D1 WOL Log WOL Log A A1 D D1 <File, Temp Node, Intended Node> Reactivated Reactivated A1 B1 C1 D1A1 D1 1. Transfer updated metadata 1. Transfer updated metadata  The process is distributed to multiple nodes  The command issuance from Disitributed MDM and Data Management is locally performed  Updated blocks are transferred in batch way (multiple blocks per connection) 2. Command issuance 2. Command issuance
  • 22. [Proposals] Efficient Gear-shifting [4/6]22 Data Management Distributed MDM Distributed MDM Data Management Distributed MDM Data Management Distributed MDM Data Management A DB C A1 B1 C1 D1 WOL Log WOL Log A A1 D D1 <File, Temp Node, Intended Node> Reactivated Reactivated A1 B1 C1 D1A1 D1 1. Transfer updated metadata 1. Transfer updated metadata  The process is distributed to multiple nodes  The command issuance from Disitributed MDM and Data Management is locally performed  Updated blocks are transferred in batch way (multiple blocks per connection) 2. Command issuance 2. Command issuance 3. Transfer blocks3. Transfer blocks
  • 23. [Proposals] Efficient Gear-shifting [5/6]23 Data Management Distributed MDM Distributed MDM Data Management Distributed MDM Data Management Distributed MDM Data Management A DB C A1 B1 C1 D1 WOL Log WOL Log A A1 D D1 <File, Temp Node, Intended Node> Reactivated Reactivated A1 B1 C1 D1A1 D1 1. Transfer updated metadata 1. Transfer updated metadata  The process is distributed to multiple nodes  The command issuance from Disitributed MDM and Data Management is locally performed  Updated blocks are transferred in batch way (multiple blocks per connection) 2. Command issuance 2. Command issuance 3. Transfer blocks3. Transfer blocks 4. Updated metadata4. Updated metadata
  • 24. [Proposals] Efficient Gear-shifting [6/6]24 Data Management Distributed MDM Distributed MDM Data Management Distributed MDM Data Management Distributed MDM Data Management A DB C A1 B1 C1 D1 WOL Log WOL Log A A1 D D1 <File, Temp Node, Intended Node> Reactivated Reactivated A1 B1 C1 D1A1 D1 1. Transfer updated metadata 1. Transfer updated metadata  The process is distributed to multiple nodes  The command issuance from Disitributed MDM and Data Management is locally performed  Updated blocks are transferred in batch way (multiple blocks per connection) 2. Command issuance 2. Command issuance 3. Transfer blocks3. Transfer blocks 4. Updated metadata4. Updated metadata Parallelism Reduce network cost Efficient block transfer
  • 25. Experiment Evaluation  Experiment 1  Verify the effectiveness of proposals in gear-shifting process by comparing with the normal HDFS  Updated block reflection is the major cost  Coupling architecture, batch block transferring  Experiment 2  Evaluate the effectiveness of distributed index technique to NDCouplingHDFS  SDP and Fat-Btree through changing the number of nodes 25
  • 26. [Experiment 1] Validity of NDCouplingHDFS in Gear-shifting 26 Updated Data Reflection # Gears 2 # Active nodes at Low Gear 8 # Active nodes at High Gear 16 # files 16000 File size 1MB HDFS Version 0.20.2 Maximum number of transferred blocks 100 Heartbeat interval 1s  Compare the execution time of updated data reflection the NDCouplingHDFS with the normal HDFS based on five configurations  Combinations of architecture, distributed MDM (SDP, Fat-Btree), command issuance, block transfer  Environment
  • 27. 0 5 10 15 20 25 30 35 40 45 0 10 20 30 40 50 60 70 NormalHDFS SSS SBS SBB FBB Execution time Number of communication connections [commnand issuance] [Experiment 1] Experimental Results27 46%41% Configuration Normal HDFS SSS SBS SBB FBB Architecture HDFS Coupling Coupling Coupling Coupling MDM Central SDP SDP SDP Fat-Btree Command issuance Sequential Sequential Batch Batch Batch Block transference Sequential Sequential Sequential Batch Batch Coupling architecture and Batch block transferring highly effected the performance [s]
  • 28. [Experiment 2] Scalability of metadata operations  Evaluate SDP vs. Fat-Btree  Change the number of files and number of nodes 28 Machine # 1, 2, 4, 8 CPU TM8600 1.0GHz Memory DRAM 4GB NIC 1000Mb/s OS Linux 3.0 64bit Java JDK-1.7.0 Fat-Btree Fanout 16 Control Concurrency LCFB [Yoshihara, 2007] Workload #files 3000 File size 1MB
  • 29.  Fat-Btree gained better scalability when the number of nodes increases  The read throughput scaled well due to better search cost and concurrency control  The efficiency in write throughput is limited due to the synchronization cost in updating tree structure [Experiment 2] Experimental Results29 0 50 100 150 200 250 300 350 1 2 4 8 SDP Fat-Btree 0 5 10 15 20 25 30 1 2 4 8 SDP FBT ReadThroughput[operation/s] WriteThroughput[operations/s] A transaction: open/create metadata and read/write files
  • 30. Conclusion  Proposed NDCouplingHDFS for efficient gear-shifting in power-proportional HDFS  Significantly reduced at most 46% the execution time of reflecting updated data compare with the normal HDFS  Coupling architecture and batch block transferring  Improved the IO performance by applying distributed index technique to NDCouplingHDFS  NDCouplingHDFS  Maintains supporting MapReduce  Exptected to achieve real power-proportionality including power consumption of metadata management 30
  • 31. NameNode and DataNode Coupling for a Power-proportional Hadoop Distributed File System Thank you for your attention!31