Big Data Technologies On Map Reduce and Hadoop

Normalisation

Uploaded by

priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views2 pages

Big Data Technologies On Map Reduce and Hadoop

Normalisation

Uploaded by

priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Big Data Technologies on Map Reduce and Hadoop

Big Data technologies like MapReduce and Hadoop are

foundational for processing and analyzing vast amounts
of data. Here’s an overview of both:
Hadoop
Hadoop is an open-source framework that allows for the
distributed storage and processing of large datasets
across clusters of computers. Its main components
include:
1. Hadoop Distributed File System (HDFS): A
distributed file system that stores data across
multiple machines, providing high throughput
access to application data.
2. YARN (Yet Another Resource Negotiator): The
resource management layer of Hadoop, which
schedules and allocates resources across various
applications running on the cluster.
3. Hadoop Common: The libraries and utilities that
support the other Hadoop modules.
MapReduce
MapReduce is a programming model and processing
engine used within the Hadoop ecosystem to handle
large-scale data processing tasks. It consists of two
main functions:
1. Map: This phase takes input data, processes it,
and transforms it into a set of key-value pairs. Each
mapper operates in parallel across the distributed
data.
2. Reduce: The reducer takes the output of the
mappers and merges those key-value pairs into a
smaller set of results. This phase also runs in
parallel.
Key Benefits
 Scalability: Both technologies can scale
horizontally, meaning you can add more nodes to
handle increased data loads.
 Fault Tolerance: Hadoop is designed to handle
hardware failures gracefully, redistributing tasks
and data as necessary.
 Flexibility: They can process a variety of data
types, including structured, semi-structured, and
unstructured data.
Use Cases
 Data Warehousing: Storing and processing large
datasets for business intelligence.
 Log Analysis: Processing logs from various
systems to gain insights.
 Machine Learning: Training models on large
datasets using frameworks like Apache Mahout or
Spark.
Ecosystem
Hadoop has a rich ecosystem of tools that enhance its
capabilities, including:
 Apache Hive: A data warehouse software that
provides SQL-like querying capabilities.
 Apache Pig: A high-level platform for creating
programs that run on Hadoop.
 Apache HBase: A NoSQL database that runs on
top of HDFS for real-time read/write access to large
datasets.
Conclusion
MapReduce and Hadoop are integral to the Big Data
landscape, enabling organizations to efficiently process
and analyze massive datasets. Their open-source
nature and extensibility through various tools make
them popular choices for data-driven projects.

Hadoop & MapReduce Overview
No ratings yet
Hadoop & MapReduce Overview
18 pages
Chapter - 2 Hadoop
100% (1)
Chapter - 2 Hadoop
32 pages
Inside Cloud - Case Study
No ratings yet
Inside Cloud - Case Study
11 pages
BDA Unit2 Notes
No ratings yet
BDA Unit2 Notes
23 pages
BDA Unit 3
No ratings yet
BDA Unit 3
6 pages
CC Unit 2
No ratings yet
CC Unit 2
29 pages
IOT and Comp - Architecture
No ratings yet
IOT and Comp - Architecture
17 pages
Unit Iii
No ratings yet
Unit Iii
20 pages
BDA Notes Unit-2
No ratings yet
BDA Notes Unit-2
27 pages
Big Data 2 - Part
No ratings yet
Big Data 2 - Part
40 pages
CC Unit - 5
No ratings yet
CC Unit - 5
27 pages
Unit Ii BDT F
No ratings yet
Unit Ii BDT F
13 pages
Hadoop in Bigdata Processing Concept
No ratings yet
Hadoop in Bigdata Processing Concept
2 pages
Big Data Analytics Presentation
No ratings yet
Big Data Analytics Presentation
30 pages
Introduction To Big Dat1
No ratings yet
Introduction To Big Dat1
6 pages
Big Data
No ratings yet
Big Data
27 pages
Hadoop - Quick Guide Hadoop - Big Data Overview
No ratings yet
Hadoop - Quick Guide Hadoop - Big Data Overview
32 pages
Hadoop Quick Guide
No ratings yet
Hadoop Quick Guide
32 pages
Bda Ese
No ratings yet
Bda Ese
21 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
14 pages
Big Data Technology
No ratings yet
Big Data Technology
9 pages
Unit 2,3
No ratings yet
Unit 2,3
24 pages
Big Data Overview
No ratings yet
Big Data Overview
18 pages
Benefits of Hadoop MapReduce
No ratings yet
Benefits of Hadoop MapReduce
1 page
Cloud Security UNIT 5
No ratings yet
Cloud Security UNIT 5
4 pages
MA - VaishuAchini - VIT - 24 - ICT703 - A3
No ratings yet
MA - VaishuAchini - VIT - 24 - ICT703 - A3
21 pages
Data Analyst
No ratings yet
Data Analyst
9 pages
Big Data Analytics Overview
No ratings yet
Big Data Analytics Overview
17 pages
Big Data Hadoop Complete Final Spaced
No ratings yet
Big Data Hadoop Complete Final Spaced
15 pages
Cloud Security UNIT 5
No ratings yet
Cloud Security UNIT 5
6 pages
Unit 2 - Intro To Hadoop
No ratings yet
Unit 2 - Intro To Hadoop
51 pages
Unit-III Big Data
No ratings yet
Unit-III Big Data
10 pages
Big Data Unit 4
No ratings yet
Big Data Unit 4
96 pages
Big Data Analysis PDF 2
No ratings yet
Big Data Analysis PDF 2
18 pages
CS 4407 Discussion Forum Unit 2
No ratings yet
CS 4407 Discussion Forum Unit 2
2 pages
Introduction to Hadoop Basics
No ratings yet
Introduction to Hadoop Basics
12 pages
CC Unit4
No ratings yet
CC Unit4
14 pages
What Is The Hadoop Ecosystem?
No ratings yet
What Is The Hadoop Ecosystem?
4 pages
Hadoop for Big Data Professionals
No ratings yet
Hadoop for Big Data Professionals
13 pages
Unit 3 Bda
No ratings yet
Unit 3 Bda
13 pages
MapReduce Based Algorithms For Efficient Big Data Processing
No ratings yet
MapReduce Based Algorithms For Efficient Big Data Processing
7 pages
Big Data Analytics
No ratings yet
Big Data Analytics
12 pages
Unit 2
No ratings yet
Unit 2
9 pages
BDA Presentations Unit-4 - Hadoop, Ecosystem
100% (1)
BDA Presentations Unit-4 - Hadoop, Ecosystem
25 pages
MapReduce Unit3
No ratings yet
MapReduce Unit3
27 pages
Unit 3 & 4 Big Data
No ratings yet
Unit 3 & 4 Big Data
18 pages
Introduction To Big DAta
No ratings yet
Introduction To Big DAta
2 pages
Big Data and Mapreduce Challenges, Opportunities and Trends
No ratings yet
Big Data and Mapreduce Challenges, Opportunities and Trends
9 pages
7) Intro To Hadoop and Mapreducer
No ratings yet
7) Intro To Hadoop and Mapreducer
10 pages
Attachment
No ratings yet
Attachment
11 pages
Unit 2-1
No ratings yet
Unit 2-1
43 pages
BDA Module-2 Notes PDF
100% (1)
BDA Module-2 Notes PDF
14 pages
Software Frameworks & Big Data Tools
No ratings yet
Software Frameworks & Big Data Tools
21 pages
Unit Iii
No ratings yet
Unit Iii
22 pages
Week 5 Researchpaper
No ratings yet
Week 5 Researchpaper
7 pages
Unit 2
No ratings yet
Unit 2
17 pages
BigData Unit 2
No ratings yet
BigData Unit 2
15 pages
Java Applet and Frame Guide
No ratings yet
Java Applet and Frame Guide
10 pages
Unit 2
No ratings yet
Unit 2
31 pages
Unit 3
No ratings yet
Unit 3
32 pages
Algorithm With Example
No ratings yet
Algorithm With Example
6 pages
JETIR2211292
No ratings yet
JETIR2211292
3 pages
JETIR2211472 Reseach Paper PDF
No ratings yet
JETIR2211472 Reseach Paper PDF
4 pages
COS 111 - Introduction To Computing Sciences - Lecture Note Week 2
No ratings yet
COS 111 - Introduction To Computing Sciences - Lecture Note Week 2
14 pages
Some Useful P-Adic Formulas
No ratings yet
Some Useful P-Adic Formulas
2 pages
Internship Projects
No ratings yet
Internship Projects
26 pages
Gee3 Midterm New
No ratings yet
Gee3 Midterm New
28 pages
Microsoft Intune-1
No ratings yet
Microsoft Intune-1
14 pages
Introduction and Conclusion Entity-Relationship Model
No ratings yet
Introduction and Conclusion Entity-Relationship Model
2 pages
Han Et Al 2025 MR Deepfakes Study
No ratings yet
Han Et Al 2025 MR Deepfakes Study
21 pages
NT Assignment Ipv6
No ratings yet
NT Assignment Ipv6
3 pages
Basics of Computer Programming Langauage
No ratings yet
Basics of Computer Programming Langauage
2 pages
Lesson 1-17: Identifying Ports: IC Training - Module One: Computing Fundamentals
No ratings yet
Lesson 1-17: Identifying Ports: IC Training - Module One: Computing Fundamentals
2 pages
Azure VMware Solution
No ratings yet
Azure VMware Solution
4 pages
TCAS Pres For IRISET Course
No ratings yet
TCAS Pres For IRISET Course
84 pages
ONVIF Streaming Spec
No ratings yet
ONVIF Streaming Spec
37 pages
RPA (Open Elective) Lesson Planning 2022-23 ODD SEM
No ratings yet
RPA (Open Elective) Lesson Planning 2022-23 ODD SEM
12 pages
FPGA Based Systems:: Radha R C, Dept of ECE, BMSCE
No ratings yet
FPGA Based Systems:: Radha R C, Dept of ECE, BMSCE
24 pages
Understanding the HTTP GET Method
No ratings yet
Understanding the HTTP GET Method
2 pages
Sampling Quick Guide by KPR
No ratings yet
Sampling Quick Guide by KPR
4 pages
MBC Payment File Transfer Guide
No ratings yet
MBC Payment File Transfer Guide
11 pages
Beginners - Joomla! Documentation
No ratings yet
Beginners - Joomla! Documentation
8 pages
Iexpenses Guide
No ratings yet
Iexpenses Guide
89 pages
HCI - Chapter 3 - Computer in HCI
100% (1)
HCI - Chapter 3 - Computer in HCI
28 pages
EZ Wifi Vendo DIY Guide
100% (2)
EZ Wifi Vendo DIY Guide
4 pages
Algebraic Expressions Worksheets Grade 7 Worksheet 1
No ratings yet
Algebraic Expressions Worksheets Grade 7 Worksheet 1
12 pages
Unit-2@IP (Ritik Chauhan)
No ratings yet
Unit-2@IP (Ritik Chauhan)
10 pages
IDM User Guide
No ratings yet
IDM User Guide
64 pages
Lab 12 Solutions CS 61A Summer 2020 PDF
No ratings yet
Lab 12 Solutions CS 61A Summer 2020 PDF
8 pages
Sensors
No ratings yet
Sensors
16 pages
NLCA Monitoring Form
100% (1)
NLCA Monitoring Form
2 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
Impact of Artificial Intelligence On Accounting
100% (1)
Impact of Artificial Intelligence On Accounting
28 pages

Big Data Technologies On Map Reduce and Hadoop

Uploaded by

Big Data Technologies On Map Reduce and Hadoop

Uploaded by

Big Data Technologies on Map Reduce and Hadoop

Big Data technologies like MapReduce and Hadoop are

You might also like