HDFS Blocks

The document discusses the differences between HDFS and network attached storage. HDFS is the primary storage system for Hadoop that stores very large files across a cluster of commodity hardware. In contrast, NAS provides file-level data storage on dedicated hardware. HDFS distributes blocks across all machines in a cluster, while NAS stores data separately on its own hardware. HDFS is designed to work with MapReduce to move computation to data, which NAS does not support as well.

Uploaded by

sharan kommi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views2 pages

HDFS Blocks

Uploaded by

sharan kommi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Previous year last question

DFS blocks are large compared to disk blocks, because to minimize the cost of
seeks. If we have many smaller size disk blocks, the seek time would be
maximum (time spent to seek/look for an information). And also, having
multiple small sized blocks is the burden on name node/master, as ultimately
the name node stores metadata, so it has to save this disk block information.
If the Data Block is large enough, the time it takes to transfer the data from the
disk can be significantly longer than the time to seek to the start of the block.
Thus, transferring a large file made of multiple blocks operates at the disk
transfer rate.
For each block we need a Mapper. So, in the case of small-sized blocks, there
will be a lot of Mappers. Each will be processing the data, which isn’t efficient.

Diff b/w HDFS and network attacked storage

 1) HDFS is the primary storage system of Hadoop.

HDFS designs to store very large files running on a cluster
of commodity hardware.
Network-attached storage (NAS) is a file-level computer
data storage server.
NAS provides data access to a heterogeneous group of
clients.

2) HDFS distribute blocks across all the machines in a

Hadoop cluster.
NAS data stores on a dedicated hardware.

3) HDFS is designed to work

with MapReduce Framework.
In MapReduce Framework computation move to the data
instead of Data to computation.
NAS is not suitable for MapReduce, as it stores data
separately from the computations.
 September 20, 2018 at 4:03 pm#5730

DataFlair Team
1)NAS stands for Network Attached storage which is a
file-level computer data storage server connected to a
computer network providing network access to
heterogeneous group of clients
HDFS stands for Hadoop distributed file system which is
a java based file system that provides scalable and reliable
data storage and is designed to span large clusters of
commodity hardware.
2)In HDFS data blocks are distributed across the local
drives of all machines in a cluster whereas in NAS data is
stored on a dedicated server.

3)HDFS includes commodity hardware which will be

cost-effective, but NAS is a high-end storage device
which is expensive.

4)It includes features like rack-awarenessHDFS, data

locality which makes it more scalable and effective then
NAS.

Tutorial T2: Fundamentals of Memory Subsystem Design For HPC and AI
No ratings yet
Tutorial T2: Fundamentals of Memory Subsystem Design For HPC and AI
105 pages
Bd2013 Fineberg
No ratings yet
Bd2013 Fineberg
25 pages
Unit-Iv CC&BD CS71
No ratings yet
Unit-Iv CC&BD CS71
148 pages
HDFS
No ratings yet
HDFS
11 pages
Unit-4 BDA as on 25-11-2024
No ratings yet
Unit-4 BDA as on 25-11-2024
248 pages
5.Apache Hadoop Updated
No ratings yet
5.Apache Hadoop Updated
57 pages
Unit II Big Data Analytics
No ratings yet
Unit II Big Data Analytics
11 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
BDA-3
No ratings yet
BDA-3
70 pages
Lec 5 - Big Data Storage Technologies I - Hadoop
No ratings yet
Lec 5 - Big Data Storage Technologies I - Hadoop
44 pages
Bigdata Unit 3
No ratings yet
Bigdata Unit 3
96 pages
Module 1 PDF
No ratings yet
Module 1 PDF
49 pages
Unit-4 BDA as on 25-11-2024
No ratings yet
Unit-4 BDA as on 25-11-2024
258 pages
BIG DATA - Unit 4 HADOOP AND MAP REDUCE -mini xerox - easy read
No ratings yet
BIG DATA - Unit 4 HADOOP AND MAP REDUCE -mini xerox - easy read
16 pages
500+ Interview Questions-1
No ratings yet
500+ Interview Questions-1
126 pages
Notes - 3 Unit neha
No ratings yet
Notes - 3 Unit neha
25 pages
UNIT 3 HDFS, Hadoop Environment Part 1
No ratings yet
UNIT 3 HDFS, Hadoop Environment Part 1
9 pages
500+ Data Engineering Interview_Questions
No ratings yet
500+ Data Engineering Interview_Questions
118 pages
Unit-1 Introduction To Big Data
No ratings yet
Unit-1 Introduction To Big Data
38 pages
bda 2_hadoop
No ratings yet
bda 2_hadoop
112 pages
Hdfs Part 1
No ratings yet
Hdfs Part 1
72 pages
BD Unit-IIINotes
No ratings yet
BD Unit-IIINotes
17 pages
3
No ratings yet
3
20 pages
Hadoop Intro
No ratings yet
Hadoop Intro
40 pages
Notes
88% (8)
Notes
18 pages
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
No ratings yet
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
24 pages
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
No ratings yet
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
24 pages
Module III Hadoop Framework
No ratings yet
Module III Hadoop Framework
21 pages
HDFS
No ratings yet
HDFS
8 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
5 pages
Unit 2 Da Material
No ratings yet
Unit 2 Da Material
71 pages
10th August Morning and Afternoon session Hadoop (1)
No ratings yet
10th August Morning and Afternoon session Hadoop (1)
18 pages
DATA228 Lecture Notes Week 4
No ratings yet
DATA228 Lecture Notes Week 4
21 pages
HDFS Internals
No ratings yet
HDFS Internals
30 pages
3.1 Hadoop Ecosystem
No ratings yet
3.1 Hadoop Ecosystem
48 pages
HCIA Big Data
No ratings yet
HCIA Big Data
20 pages
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
No ratings yet
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
45 pages
DE - QBANK
No ratings yet
DE - QBANK
125 pages
Apex Institute of Technology: Big Data Security
No ratings yet
Apex Institute of Technology: Big Data Security
30 pages
BDP 2024 06
No ratings yet
BDP 2024 06
14 pages
Wa0001.
No ratings yet
Wa0001.
56 pages
HDFS 3
No ratings yet
HDFS 3
51 pages
Big-Data Final
No ratings yet
Big-Data Final
7 pages
Chapter 4 - Hadoop Ecosystem
No ratings yet
Chapter 4 - Hadoop Ecosystem
24 pages
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
No ratings yet
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
17 pages
5_bdp-2024-06
No ratings yet
5_bdp-2024-06
14 pages
BCS061_Notes_Unit3
No ratings yet
BCS061_Notes_Unit3
23 pages
(17CS82) 8 Semester CSE: Big Data Analytics
No ratings yet
(17CS82) 8 Semester CSE: Big Data Analytics
169 pages
unit IV
No ratings yet
unit IV
248 pages
SergeBazhievsky Introduction To Hadoop MapReduce v2
No ratings yet
SergeBazhievsky Introduction To Hadoop MapReduce v2
67 pages
Bda Unit2
No ratings yet
Bda Unit2
24 pages
CS19741-Cloud Computing-Unit 3 Notes
No ratings yet
CS19741-Cloud Computing-Unit 3 Notes
37 pages
BD U-3 Notes
No ratings yet
BD U-3 Notes
27 pages
UNIT 3 FULL
No ratings yet
UNIT 3 FULL
89 pages
Hdfs Part 2
No ratings yet
Hdfs Part 2
42 pages
BDT - Unit - II - Hdfs and Hadoop Io
No ratings yet
BDT - Unit - II - Hdfs and Hadoop Io
42 pages
Distributed File Systems Leading To Hadoop File System: UNIT-2
No ratings yet
Distributed File Systems Leading To Hadoop File System: UNIT-2
12 pages
Unit 2
No ratings yet
Unit 2
53 pages
Hadoop File System
No ratings yet
Hadoop File System
36 pages
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
From Everand
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
Peter Jones
No ratings yet
Hard Circle Drives (HDDs): Uncovering the Center of Information Stockpiling
From Everand
Hard Circle Drives (HDDs): Uncovering the Center of Information Stockpiling
Friend Good
No ratings yet
HDFS Intro
No ratings yet
HDFS Intro
9 pages
Hadoop Features 2
No ratings yet
Hadoop Features 2
3 pages
Pstarguide
No ratings yet
Pstarguide
72 pages
Velocity Planning
No ratings yet
Velocity Planning
5 pages
Capacity in Sprint
No ratings yet
Capacity in Sprint
4 pages
10 Ten Key Factors For Agile Project Success
No ratings yet
10 Ten Key Factors For Agile Project Success
4 pages
Pipes and Filters Pattern
No ratings yet
Pipes and Filters Pattern
10 pages
Definition of Done
No ratings yet
Definition of Done
3 pages
Pretorius 2012 Vortex Grit Official
No ratings yet
Pretorius 2012 Vortex Grit Official
21 pages
CQRS Pattern
No ratings yet
CQRS Pattern
9 pages
Circuit Breaker Pattern
No ratings yet
Circuit Breaker Pattern
10 pages
Pre Bid Meeting BLR NGT
No ratings yet
Pre Bid Meeting BLR NGT
1 page
Message Oriented Middleware
No ratings yet
Message Oriented Middleware
5 pages
Cloud Computing: BITS Pilani
No ratings yet
Cloud Computing: BITS Pilani
8 pages
Dimension Two (B) : Identifying Implied Main Ideas: For Your Better Understanding
No ratings yet
Dimension Two (B) : Identifying Implied Main Ideas: For Your Better Understanding
1 page
Nellore
No ratings yet
Nellore
111 pages
Plan Showing The Proposed Sewer Network in Pulivendula Municipality
No ratings yet
Plan Showing The Proposed Sewer Network in Pulivendula Municipality
1 page
Main Idea
No ratings yet
Main Idea
1 page
Your Needs Our Solutions
No ratings yet
Your Needs Our Solutions
1 page
6" Borewell Submersible Pumps: For Agriculture & Domestic Applications
No ratings yet
6" Borewell Submersible Pumps: For Agriculture & Domestic Applications
1 page
Topic: - Main Idea
No ratings yet
Topic: - Main Idea
1 page
Topic: - Main Idea
No ratings yet
Topic: - Main Idea
1 page
SBR - 6 MLD
100% (2)
SBR - 6 MLD
38 pages
Jharkhand DWSD Proposal For Jal Jeevan Mission
No ratings yet
Jharkhand DWSD Proposal For Jal Jeevan Mission
14 pages
Bagepalli - 4.4 MLD and 0.55 MLD STP - SBT
100% (1)
Bagepalli - 4.4 MLD and 0.55 MLD STP - SBT
9 pages
Disciplinary Action Company Policy
100% (2)
Disciplinary Action Company Policy
3 pages
Blower and Design Calculation
50% (2)
Blower and Design Calculation
1 page
) Perational Vlaintena, Nce Manual: I UGRK Series
No ratings yet
) Perational Vlaintena, Nce Manual: I UGRK Series
22 pages
Fecal Sludge DPR
100% (2)
Fecal Sludge DPR
56 pages
C N N C N CC y L TD L D L DC N D Dy: A) We Have To Show That at Steady Sate and y L
No ratings yet
C N N C N CC y L TD L D L DC N D Dy: A) We Have To Show That at Steady Sate and y L
1 page
Document
No ratings yet
Document
17 pages
Cf/Cfast Comparison: Pata Sata
No ratings yet
Cf/Cfast Comparison: Pata Sata
7 pages
B550 Phantom Gaming 4
No ratings yet
B550 Phantom Gaming 4
91 pages
IWIN180 VIrtualisation Windows Server
No ratings yet
IWIN180 VIrtualisation Windows Server
208 pages
Guide to Operating Systems Greg Tomsho - Own the ebook now and start reading instantly
100% (3)
Guide to Operating Systems Greg Tomsho - Own the ebook now and start reading instantly
57 pages
Mychaela Falconia (She/her) 760-787-0545: Core Skill Areas
No ratings yet
Mychaela Falconia (She/her) 760-787-0545: Core Skill Areas
5 pages
Basic Troubleshooting and Repair Techniques
No ratings yet
Basic Troubleshooting and Repair Techniques
11 pages
Forensic Data Acquisition Tools
100% (1)
Forensic Data Acquisition Tools
45 pages
Gigabyte Ga-Z97-D3h Rev 111 PDF
No ratings yet
Gigabyte Ga-Z97-D3h Rev 111 PDF
34 pages
Builders Satisfaction in Asianpaints Coverage: Analysis
No ratings yet
Builders Satisfaction in Asianpaints Coverage: Analysis
5 pages
Mounting Instructions For Switchboard Instruments 4189320059 Uk
No ratings yet
Mounting Instructions For Switchboard Instruments 4189320059 Uk
2 pages
Manual Provit B&R 2200
No ratings yet
Manual Provit B&R 2200
332 pages
ULTRA 3000: Subject To Reasonable Modifications Due To Technical Advances
No ratings yet
ULTRA 3000: Subject To Reasonable Modifications Due To Technical Advances
1 page
OFERTE Componenta 1
No ratings yet
OFERTE Componenta 1
2 pages
Java Platform Micro Edition Software Development Kit 3
No ratings yet
Java Platform Micro Edition Software Development Kit 3
18 pages
CH-01 Introduction to Microcomputers
No ratings yet
CH-01 Introduction to Microcomputers
27 pages
EVK_F101_UserGuide_UBXDOC_963802114_12945-3511429
No ratings yet
EVK_F101_UserGuide_UBXDOC_963802114_12945-3511429
34 pages
Ch01 - How Hardware and Software Works Together
No ratings yet
Ch01 - How Hardware and Software Works Together
49 pages
Kontron 2 18008-0000-16-0
No ratings yet
Kontron 2 18008-0000-16-0
2 pages
Ipisb Ag r106
No ratings yet
Ipisb Ag r106
82 pages
RecoverPoint_Replacement Procedure-RPA Gen 6(Components and Chassis)_HW_Guide
No ratings yet
RecoverPoint_Replacement Procedure-RPA Gen 6(Components and Chassis)_HW_Guide
85 pages
HC-J3781-S1 (RTD1319) Product Datasheet - 20221215
100% (1)
HC-J3781-S1 (RTD1319) Product Datasheet - 20221215
8 pages
Lec 2
No ratings yet
Lec 2
21 pages
PM5 CSAFECommunicationDefinition
No ratings yet
PM5 CSAFECommunicationDefinition
161 pages
Debug 1214
No ratings yet
Debug 1214
3 pages
Students Guide in Computer System Servicing 9 With Rubrics
No ratings yet
Students Guide in Computer System Servicing 9 With Rubrics
7 pages
Cas Lab
No ratings yet
Cas Lab
10 pages
Micro Controller Lab Manual (06ESL47)
No ratings yet
Micro Controller Lab Manual (06ESL47)
67 pages
1.4 Storage Notes
No ratings yet
1.4 Storage Notes
2 pages