0% found this document useful (0 votes)

40 views11 pages

PRACTICAL 4 - Single and Multi Node Hadoop Install

Uploaded by

rodylogin69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views11 pages

PRACTICAL 4 - Single and Multi Node Hadoop Install

Uploaded by

rodylogin69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Practical – 4

Aim: Hadoop installation as single node cluster and multi node cluster.

Pre-requisite:
OS: UBUNTU 14.04 LTS
FRAMEWORK: Hadoop 2.7.3
JAVA VERSION: 1.7.0_131

Single node Cluster:

Steps:

1. check if linux repository service is working or not:

Gcet@gfl1-5:~$ sudo apt-get update

Ign http://extras.ubuntu.com trusty InRelease

Ign http://in.archive.ubuntu.com trusty InRelease
Get:1 http://extras.ubuntu.com trusty Release.gpg [72 B]

Hit http://in.archive.ubuntu.com trusty/universe Translation-en

Ign http://in.archive.ubuntu.com trusty/main Translation-en_IN
Ign http://in.archive.ubuntu.com trusty/multiverse Translation-en_IN
Ign http://in.archive.ubuntu.com trusty/restricted Translation-en_IN
Ign http://in.archive.ubuntu.com trusty/universe Translation-en_IN
Fetched 4,302 kB in 40s (107 kB/s)
Reading package lists... Done

2. Check java version:

Gcet@gfl1-5:~$ java -version

java version "1.7.0_131"

OpenJDK Runtime Environment (IcedTea 2.6.9) (7u131-2.6.9-0ubuntu0.14.04.2)
OpenJDK Server VM (build 24.131-b00, mixed mode)

3. Download Hadoop from apache.hadoop.org site and to install hadoop perform the step
as under:
Gcet@gfl1-5:~$ tar -xvf Hadoop-2.7.3.tar.gz

hadoop-2.7.3/share/hadoop/tools/lib/hadoop-extras-2.7.3.jar
hadoop-2.7.3/share/hadoop/tools/lib/asm-3.2.jar
hadoop-2.7.3/include/
hadoop-2.7.3/include/hdfs.h
hadoop-2.7.3/include/Pipes.hh
hadoop-2.7.3/include/TemplateFactory.hh
hadoop-2.7.3/include/StringUtils.hh
hadoop-2.7.3/include/SerialUtils.hh
hadoop-2.7.3/LICENSE.txt
hadoop-2.7.3/NOTICE.txt
hadoop-2.7.3/README.txt

Gcet@gfl1-5:~$ sudo mv/home/Gcet/Downloads/Hadoop-2.7.3 /usr/local/Hadoop

4. check if hadoop is working properly or not using the command under:
Gcet@gfl1-5:~$ /usr/local/hadoop/hadoop-2.7.3/bin/hadoop

classpath prints the class path needed to get the

credential interact with credential providers
Hadoop jar and the required libraries
daemonlog get/set the log level for each daemon
trace view and modify Hadoop tracing settings

Most commands print help when invoked w/o parameters.

5. install openssl, ssh and rsync:

Gcet@gfl1-5:~$ sudo apt-get install openssl

[sudo] password for Gcet:

Reading package lists... Done
Building dependency tree
Reading state information... Done
openssl is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 460 not upgraded.

Gcet@gfl1-5:~$ sudo apt-get install ssh

Reading package lists... Done

Building dependency tree
Reading state information... Done
ssh is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 460 not upgraded.

Gcet@gfl1-5:~$ sudo apt-get install ssl

Reading package lists... Done

Building dependency tree
Reading state information... Done
rsync is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 460 not upgraded.

6. set environment variable for java:

Gcet@gfl1-5:~$ export JAVA_HOME=/usr/lib/jvm/java-1.7.0-openjdk-i386

7. to run examples on single system requires to create input file

Gcet@gfl1-5:~$ mkdir input1
Gcet@gfl1-5:~$ mkdir output1

8. Now copy xml file from Hadoop folder to input folder

Gcet@gfl1-5:~$ cp /usr/local/hadoop/hadoop-2.7.3/etc/hadoop/capacity-scheduler.xml
/home/Gcet/Desktop/input1
9. Now run Hadoop examples:
Gcet@gfl1-5:~$ /usr/local/hadoop/hadoop-2.7.3/bin/hadoop jar /usr/local/hadoop/hadoop-
2.7.3/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.3.jar grep /home/Gcet/Desktop/input1/
/home/Gcet/output1/output1 'principal[.]*'
17/07/26 15:15:51 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your
platform... using builtin-java classes where applicable
17/07/26 15:15:52 INFO Configuration.deprecation: session.id is deprecated. Instead, use
dfs.metrics.session-id
17/07/26 15:15:52 INFO jvm.JvmMetrics: Initializing JVM Metrics with
processName=JobTracker, sessionId=
17/07/26 15:15:52 INFO input.FileInputFormat: Total input paths to process : 2
17/07/26 15:15:52 INFO mapreduce.JobSubmitter: number of splits:2
17/07/26 15:15:53 INFO mapreduce.JobSubmitter: Submitting tokens for job:
job_local1790612813_0001
17/07/26 15:15:53 INFO mapreduce.Job: The url to track the job: http://localhost:8080/
17/07/26 15:15:53 INFO mapreduce.Job: Running job: job_local1790612813_0001
17/07/26 15:15:53 INFO mapred.LocalJobRunner: OutputCommitter set in config null
17/07/26 15:15:53 INFO output.FileOutputCommitter: File Output Committer Algorithm version
is 1
17/07/26 15:15:53 INFO mapred.LocalJobRunner: OutputCommitter is
org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter
17/07/26 15:15:53 INFO mapred.LocalJobRunner: Waiting for map tasks
17/07/26 15:15:53 INFO mapred.LocalJobRunner: Starting task:
attempt_local1790612813_0001_m_000000_0
17/07/26 15:15:53 INFO output.FileOutputCommitter: File Output Committer Algorithm version
is 1
17/07/26 15:15:53 INFO mapred.Task: Using ResourceCalculatorProcessTree : [ ]
.
.
.
.
17/07/26 15:15:55 INFO mapreduce.Job: Job job_local192240145_0002 running in uber mode :
false
17/07/26 15:15:55 INFO mapreduce.Job: map 100% reduce 100%
17/07/26 15:15:55 INFO mapreduce.Job: Job job_local192240145_0002 completed successfully
17/07/26 15:15:55 INFO mapreduce.Job: Counters: 30
File System Counters
FILE: Number of bytes read=1195494
FILE: Number of bytes written=2315812
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
Map-Reduce Framework
Map input records=0

Spilled Records=0
Shuffled Maps =1
GC time elapsed (ms)=10
Total committed heap usage (bytes)=854065152
Shuffle BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=98
File Output Format Counters
Bytes Written=8
Multi node cluster:
Steps:

We have two machines (master and slave) with IP:

Master IP: 192.168.56.102

Slave IP: 192.168.56.103

STEP 1: Check the IP address of all machines.

Command: ip addr show (you can use the ifconfig command as well)

STEP 2: Disable the firewall restrictions.

Command: service iptables stop

Command: sudo chkconfig iptables off

STEP 3: Open hosts file to add master and data node with their respective IP addresses.

Command: sudo nano /etc/hosts

Same properties will be displayed in the master and slave hosts files.
STEP 4: Restart the sshd service.

Command: service sshd restart

STEP 5: Create the SSH Key in the master node. (Press enter button when it asks you to enter a filename to

save the key).

Command: ssh-keygen -t rsa -P “”

STEP 6: Copy the generated ssh key to master node’s authorized keys.

Command: cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys

STEP 7: Copy the master node’s ssh key to slave’s authorized keys.

Command: ssh-copy-id -i $HOME/.ssh/id_rsa.pub edureka@slave

STEP 8: Click here to download the Java 8 Package. Save this file in your home directory.

STEP 9: Extract the Java Tar File on all nodes.

Command: tar -xvf jdk-8u101-linux-i586.tar.gz

STEP 10: Download the Hadoop 2.7.3 Package on all nodes.

Command: wget https://archive.apache.org/dist/hadoop/core/hadoop-2.7.3/hadoop-2.7.3.tar.gz

STEP 11: Extract the Hadoop tar File on all nodes.

Command: tar -xvf hadoop-2.7.3.tar.gz

STEP 12: Add the Hadoop and Java paths in the bash file (.bashrc) on all nodes.

Open. bashrc file. Now, add Hadoop and Java Path as shown below:

Command: sudo gedit .bashrc

Then, save the bash file and close it.

For applying all these changes to the current Terminal, execute the source command.

Command: source .bashrc

To make sure that Java and Hadoop have been properly installed on your system and can be

accessed through the Terminal, execute the java -version and hadoop version commands.

Command: java -version

Command: hadoop version

Now edit the configuration files in hadoop-2.7.3/etc/hadoop directory.

STEP 13: Create masters file and edit as follows in both master and slave machines as below:

Command: sudo gedit masters

STEP 14: Edit slaves file in master machine as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/slaves

STEP 15: Edit slaves file in slave machine as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/slaves

STEP 16: Edit core-site.xml on both master and slave machines as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/core-site.xml

1<?xml version="1.0" encoding="UTF-8"?>

2<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
3<configuration>
4<property>
5<name>fs.default.name</name>
6<value>hdfs://master:9000</value>
7</property>
8</configuration>

STEP 7: Edit hdfs-site.xml on master as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/hdfs-site.xml

1 <?xml version="1.0" encoding="UTF-8"?>

2 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
3 <configuration>
4 <property>
5 <name>dfs.replication</name>
6 <value>2</value>
7 </property>
8 <property>
9 <name>dfs.permissions</name>
10<value>false</value>
11</property>
12<property>
13<name>dfs.namenode.name.dir</name>
14<value>/home/edureka/hadoop-2.7.3/namenode</value>
15</property>
16<property>
17<name>dfs.datanode.data.dir</name>
18<value>/home/edureka/hadoop-2.7.3/datanode</value>
19</property>
20</configuration>

STEP 18: Edit hdfs-site.xml on slave machine as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/hdfs-site.xml

1 <?xml version="1.0" encoding="UTF-8"?>
2 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
3 <configuration>
4 <property>
5 <name>dfs.replication</name>
6 <value>2</value>
7 </property>
8 <property>
9 <name>dfs.permissions</name>
10<value>false</value>
11</property>
12<property>
13<name>dfs.datanode.data.dir</name>
14<value>/home/edureka/hadoop-2.7.3/datanode</value>
15</property>
16</configuration>

STEP 19: Copy mapred-site from the template in configuration folder and the edit mapred-site.xml on both

master and slave machines as follows:

Command: cp mapred-site.xml.template mapred-site.xml

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/mapred-site.xml

1<?xml version="1.0" encoding="UTF-8"?>

2<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
3<configuration>
4<property>
5<name>mapreduce.framework.name</name>
6<value>yarn</value>
7</property>
8</configuration>

STEP 20: Edit yarn-site.xml on both master and slave machines as follows:
Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/yarn-site.xml

1 <?xml version="1.0" encoding="UTF-8"?>

2 <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
3 <configuration>
4 <property>
5 <name>yarn.nodemanager.aux-services</name>
6 <value>mapreduce_shuffle</value>
7 </property>
8 <property>
9 <name>yarn.nodemanager.auxservices.mapreduce.shuffle.class</name>
10<value>org.apache.hadoop.mapred.ShuffleHandler</value>
11</property>
12</configuration>

STEP 21: Format the namenode (Only on master machine).

Command: hadoop namenode -format

STEP 22: Start all daemons (Only on master machine).

Command: ./sbin/start-all.sh

STEP 23: Check all the daemons running on both master and slave machines.

Command: jps

On master

On slave
At last, open the browser and go to master:50070/dfshealth.html on your master machine, this will give

you the NameNode interface. Scroll down and see for the number of live nodes, if its 2, you have

successfully setup a multi node Hadoop cluster. In case, it’s not 2, you might have missed out any of

the steps which I have mentioned above. But no need to worry, you can go back and verify all the

configurations again to find the issues and then correct them.

Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
5 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Online:: Setting Up The Environment
No ratings yet
Online:: Setting Up The Environment
9 pages
Hadoop Multinode Cluster Installation
No ratings yet
Hadoop Multinode Cluster Installation
4 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Install Single Node Hadoop on Ubuntu
No ratings yet
Install Single Node Hadoop on Ubuntu
13 pages
Hadoop
No ratings yet
Hadoop
4 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hadoop Multi Node Cluster
No ratings yet
Hadoop Multi Node Cluster
7 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
19 pages
Steps Single Node Setup
No ratings yet
Steps Single Node Setup
4 pages
Installing A Single Node Hadoop Cluster
No ratings yet
Installing A Single Node Hadoop Cluster
4 pages
Hadoop Setup Guide for Ubuntu 16.04/18.04
No ratings yet
Hadoop Setup Guide for Ubuntu 16.04/18.04
20 pages
CC 7
No ratings yet
CC 7
7 pages
Big Data Analytics - Lab-Manual
No ratings yet
Big Data Analytics - Lab-Manual
19 pages
Hadoop Setup Guide for Linux Users
No ratings yet
Hadoop Setup Guide for Linux Users
23 pages
Exp 1 1
No ratings yet
Exp 1 1
24 pages
Updated CMD
No ratings yet
Updated CMD
23 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Exp 1
No ratings yet
Exp 1
24 pages
Hadoop Installation Final
No ratings yet
Hadoop Installation Final
32 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
80 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Install Hadoop on Ubuntu Guide
No ratings yet
Install Hadoop on Ubuntu Guide
3 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
80 pages
Single Node Cluster Creation in AWS Educate EC2
No ratings yet
Single Node Cluster Creation in AWS Educate EC2
4 pages
EX. NO Date Program NO Sign
No ratings yet
EX. NO Date Program NO Sign
80 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
6 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Installationof Hadoop 3
No ratings yet
Installationof Hadoop 3
6 pages
Bdamanual
No ratings yet
Bdamanual
8 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
5 pages
Hadoop Installatio1
No ratings yet
Hadoop Installatio1
22 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
Hadoop Cluster Setup Guide
No ratings yet
Hadoop Cluster Setup Guide
5 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Exp 1 Hadoop Installation Steps
No ratings yet
Exp 1 Hadoop Installation Steps
4 pages
Hadoop Installation On Linux
No ratings yet
Hadoop Installation On Linux
4 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Lab Manual
No ratings yet
Lab Manual
27 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
49 pages
Big Data Record
No ratings yet
Big Data Record
69 pages
6 Hadoop
No ratings yet
6 Hadoop
20 pages
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
7 pages
Hadoop Setup Guide for Ubuntu Users
No ratings yet
Hadoop Setup Guide for Ubuntu Users
12 pages
Type 2 Multi Access Control and Locks With Checkpointing
No ratings yet
Type 2 Multi Access Control and Locks With Checkpointing
3 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
LED Blinking with 8051 Assembly
No ratings yet
LED Blinking with 8051 Assembly
2 pages
SEO Services by HelpWriting.net
100% (2)
SEO Services by HelpWriting.net
11 pages
"Automate Your Network" by John W. Capobianco
No ratings yet
"Automate Your Network" by John W. Capobianco
212 pages
ICT450
No ratings yet
ICT450
6 pages
Index Sequential Access & Prefix B+ Tree: File Structures - Module IV
No ratings yet
Index Sequential Access & Prefix B+ Tree: File Structures - Module IV
14 pages
4exploring Hadoop Ecosystem With Simple Linux Commands
No ratings yet
4exploring Hadoop Ecosystem With Simple Linux Commands
10 pages
Mobile App & Backend Software Development
No ratings yet
Mobile App & Backend Software Development
3 pages
IT Report Ticket System
No ratings yet
IT Report Ticket System
8 pages
Learn Microservices With Spring Boot 3: A Practical Approach Using Event-Driven Architecture, Cloud-Native Patterns, and Containerization 3rd Edition Moisés Macero García Full Chapters Included
100% (1)
Learn Microservices With Spring Boot 3: A Practical Approach Using Event-Driven Architecture, Cloud-Native Patterns, and Containerization 3rd Edition Moisés Macero García Full Chapters Included
177 pages
EGCP 446: Advanced Digital Design: Part One - Moore State Machine On FPGA
No ratings yet
EGCP 446: Advanced Digital Design: Part One - Moore State Machine On FPGA
16 pages
Roadmap I Followed To Make 15,000+$ Bounties in My First 8 Months of Starting Out and My Journey
No ratings yet
Roadmap I Followed To Make 15,000+$ Bounties in My First 8 Months of Starting Out and My Journey
8 pages
Critical Design Review
100% (1)
Critical Design Review
6 pages
Iso 13374 4 2015
No ratings yet
Iso 13374 4 2015
9 pages
Senior Software Engineer Profile
No ratings yet
Senior Software Engineer Profile
8 pages
Kanishka Thakur
No ratings yet
Kanishka Thakur
18 pages
Elephants Dancing Elephants
No ratings yet
Elephants Dancing Elephants
4 pages
Ne Report
No ratings yet
Ne Report
6 pages
Assignment MAD
No ratings yet
Assignment MAD
2 pages
Software Analyst
No ratings yet
Software Analyst
2 pages
MODULE 5 File Handling in C
No ratings yet
MODULE 5 File Handling in C
25 pages
App Development Codealpha
No ratings yet
App Development Codealpha
11 pages
Python Tutorial For Beginners
No ratings yet
Python Tutorial For Beginners
20 pages
ServiceNow CAD Tests - 2
No ratings yet
ServiceNow CAD Tests - 2
38 pages
Openmp-Examples-5 0 0
No ratings yet
Openmp-Examples-5 0 0
400 pages
Deep Freeze Comparison Matrix
No ratings yet
Deep Freeze Comparison Matrix
2 pages
Anjali M: Software Engineer Resume
No ratings yet
Anjali M: Software Engineer Resume
2 pages
Cisco IOS Fuzzing & Debugging Guide
No ratings yet
Cisco IOS Fuzzing & Debugging Guide
21 pages
Online Auction
88% (8)
Online Auction
29 pages
Harsha - Java
No ratings yet
Harsha - Java
6 pages
Zero2prod With Cover Light Theme 20211228
No ratings yet
Zero2prod With Cover Light Theme 20211228
317 pages

PRACTICAL 4 - Single and Multi Node Hadoop Install

Uploaded by

PRACTICAL 4 - Single and Multi Node Hadoop Install

Uploaded by

Practical – 4

Single node Cluster:

1. check if linux repository service is working or not:

Ign http://extras.ubuntu.com trusty InRelease

Hit http://in.archive.ubuntu.com trusty/universe Translation-en

2. Check java version:

java version "1.7.0_131"

Gcet@gfl1-5:~$ sudo mv/home/Gcet/Downloads/Hadoop-2.7.3 /usr/local/Hadoop

classpath prints the class path needed to get the

Most commands print help when invoked w/o parameters.

5. install openssl, ssh and rsync:

[sudo] password for Gcet:

Gcet@gfl1-5:~$ sudo apt-get install ssh

Reading package lists... Done

Gcet@gfl1-5:~$ sudo apt-get install ssl

Reading package lists... Done

6. set environment variable for java:

7. to run examples on single system requires to create input file

8. Now copy xml file from Hadoop folder to input folder

We have two machines (master and slave) with IP:

Master IP: 192.168.56.102

Slave IP: 192.168.56.103

STEP 1: Check the IP address of all machines.

STEP 2: Disable the firewall restrictions.

Command: service iptables stop

Command: sudo chkconfig iptables off

Command: sudo nano /etc/hosts

Command: service sshd restart

save the key).

Command: ssh-keygen -t rsa -P “”

Command: cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys

Command: ssh-copy-id -i $HOME/.ssh/id_rsa.pub edureka@slave

STEP 9: Extract the Java Tar File on all nodes.

Command: tar -xvf jdk-8u101-linux-i586.tar.gz

STEP 10: Download the Hadoop 2.7.3 Package on all nodes.

Command: wget https://archive.apache.org/dist/hadoop/core/hadoop-2.7.3/hadoop-2.7.3.tar.gz

STEP 11: Extract the Hadoop tar File on all nodes.

Command: tar -xvf hadoop-2.7.3.tar.gz

Command: sudo gedit .bashrc

Then, save the bash file and close it.

Command: source .bashrc

Command: java -version

Command: hadoop version

Now edit the configuration files in hadoop-2.7.3/etc/hadoop directory.

Command: sudo gedit masters

STEP 14: Edit slaves file in master machine as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/slaves

STEP 15: Edit slaves file in slave machine as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/slaves

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/core-site.xml

1<?xml version="1.0" encoding="UTF-8"?>

STEP 7: Edit hdfs-site.xml on master as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/hdfs-site.xml

1 <?xml version="1.0" encoding="UTF-8"?>

STEP 18: Edit hdfs-site.xml on slave machine as follows:

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/hdfs-site.xml

master and slave machines as follows:

Command: cp mapred-site.xml.template mapred-site.xml

Command: sudo gedit /home/edureka/hadoop-2.7.3/etc/hadoop/mapred-site.xml

1<?xml version="1.0" encoding="UTF-8"?>

1 <?xml version="1.0" encoding="UTF-8"?>

STEP 21: Format the namenode (Only on master machine).

Command: hadoop namenode -format

STEP 22: Start all daemons (Only on master machine).

configurations again to find the issues and then correct them.

You might also like