0% found this document useful (0 votes)

8 views3 pages

Word Count Program

Uploaded by

harshith123cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views3 pages

Word Count Program

Uploaded by

harshith123cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 3

1.

Map Reduce program to count the number of occurrences of each word in a

given input text.

driver.java
package wordcount;
import java.io. *;
import java.util.*;
import org.apache.hadoop.mapred.*;
import org.apache.hadoop.io.*;
import org.apache.hadoop.fs.Path;
public class driver
{
public static void main(String args[]) throws IOException
{
JobConf conf=new JobConf(driver.class);
conf.setMapperClass(mapper.class);
conf.setReducerClass(reducer.class);
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(conf, new Path(args[0]));
FileOutputFormat.setOutputPath(conf,new Path(args[1]));
JobClient.runJob(conf);
}
}

mapper.java
package wordcount;
import java.io.*;
import java.util.*;
import org.apache.hadoop.mapred.*;
import org.apache.hadoop.io.*;

public class mapper extends MapReduceBase implements Mapper<LongWritable, Text, Text,

IntWritable> {

// Static final variable for the count of 1

private final static IntWritable one = new IntWritable(1);

// Reusable Text object to hold each word

private Text word = new Text();

// The map function

public void map(LongWritable key, Text value, OutputCollector<Text, IntWritable> output,
Reporter reporter)
throws IOException {

// Convert the input value (line of text) to a string

String line = value.toString();

// Tokenize the line into words

StringTokenizer tokenizer = new StringTokenizer(line);
// Iterate through the tokens (words)
while (tokenizer.hasMoreTokens()) {
// Set the current word into the Text object
word.set(tokenizer.nextToken());

// Collect the word and emit (word, 1) as key-value pairs

output.collect(word, one);
}
}
}

reducer.java
package wordcount;
import java.io.*;
import java.util.*;
import org.apache.hadoop.mapred.*;
import org.apache.hadoop.io.*;

public class reducer extends MapReduceBase implements Reducer<Text, IntWritable, Text,

IntWritable> {

public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable>

output,
Reporter reporter) throws IOException {

int sum = 0;

// Sum up the counts for each word

while (values.hasNext()) {
sum += values.next().get();
}

// Emit the word with the total count

output.collect(key, new IntWritable(sum));
}
}

Steps to run
1. Create a New File named Bash.sh
2. Copy the Below code and Paste inside Bash.sh and save that File.
export JAVA_HOME=$(readlink -f $(which javac) | awk 'BEGIN {FS="/bin"} {print $1}')
export PATH=$(echo $PATH):$(pwd)/bin
export CLASSPATH=$(hadoop classpath)
3. Execute the bash.sh File using following command source Bash.sh.
4. Verify JAVA_HOME variable to be set to Java Path and PATH variable has your USN
Hadoop Folder.
If any previous PATH set to Hadoop Folder remove that inside .bashrc file.
5. Verify Hadoop is Installed or not by executing hadoop command.if command gives
Information about
Hadoop command then Hadoop is Successfully Installed.
6. Create a folder word count and move to that folder.
7. Make the driver.java , mapper.java and reducer.java files.
8. Compile all java files (driver.java mapper.java reducer.java)
javac -d . *.java
9. Set driver class in manifest
echo Main-Class: wordcount.driver > Manifest.txt
10. Create an executable jar file
jar cfm wordcount.jar Manifest.txt word count/*.class
11. oe.txt is input file for Oddeven create Input File
echo “hello good morning, hello have a nice day” > input.txt
12. Run the jar file
hadoop jar wordcount.jar input.txt output
13. To see the Output
cat output/*

Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
Lab-1-Steps-Word Count Problem-Hadoop
No ratings yet
Lab-1-Steps-Word Count Problem-Hadoop
6 pages
Java WordCount with Hadoop Guide
No ratings yet
Java WordCount with Hadoop Guide
6 pages
B1 Instructions
No ratings yet
B1 Instructions
9 pages
Systems and Data Sharding
No ratings yet
Systems and Data Sharding
5 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
WordCount Java Program with Hadoop
No ratings yet
WordCount Java Program with Hadoop
2 pages
PART 1 - Install Java and Hadoop On Ubuntu
No ratings yet
PART 1 - Install Java and Hadoop On Ubuntu
4 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Wordcount
No ratings yet
Wordcount
3 pages
Exp3 - Map Reduce Code
No ratings yet
Exp3 - Map Reduce Code
2 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Sanoob BDA - 2
No ratings yet
Sanoob BDA - 2
4 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
Sribharanitharan.M 71762234049
No ratings yet
Sribharanitharan.M 71762234049
2 pages
WordCountApp
No ratings yet
WordCountApp
2 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
Codigo Haddop
No ratings yet
Codigo Haddop
3 pages
BDAPract 4
No ratings yet
BDAPract 4
5 pages
Source Code For Wordcount
No ratings yet
Source Code For Wordcount
3 pages
1 Word Count
No ratings yet
1 Word Count
2 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Kick Start Hadoop: Word Count - Hadoop Map Reduce Example
No ratings yet
Kick Start Hadoop: Word Count - Hadoop Map Reduce Example
13 pages
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
No ratings yet
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
5 pages
Word Count Program
No ratings yet
Word Count Program
2 pages
Sanjith BDA 2
No ratings yet
Sanjith BDA 2
4 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
MapReduce Word Count Guide
No ratings yet
MapReduce Word Count Guide
1 page
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Wordcount
No ratings yet
Wordcount
3 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Hadoop WordCount
No ratings yet
Hadoop WordCount
2 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
BDA3
No ratings yet
BDA3
7 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
ADA Lab Manual
No ratings yet
ADA Lab Manual
34 pages
Aji Bda2 Final
No ratings yet
Aji Bda2 Final
4 pages
Map Reduce Program
No ratings yet
Map Reduce Program
2 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
5 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
3 MapReduce Program Ex Code
No ratings yet
3 MapReduce Program Ex Code
14 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
CTBD Sol02
No ratings yet
CTBD Sol02
2 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
Word Count (2021)
No ratings yet
Word Count (2021)
50 pages
Exp 11
No ratings yet
Exp 11
4 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Dsbda 11
No ratings yet
Dsbda 11
15 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Configuring The Cisco 2960X Series of Switches For Livewire®
No ratings yet
Configuring The Cisco 2960X Series of Switches For Livewire®
5 pages
Pentium Vs 604e
No ratings yet
Pentium Vs 604e
1 page
Basic Theorems On Infinite Series
No ratings yet
Basic Theorems On Infinite Series
5 pages
Social Networking Platform For Education
No ratings yet
Social Networking Platform For Education
30 pages
Practical Physics
No ratings yet
Practical Physics
2 pages
Math 152, Section 55 (Vipul Naik) : X 1 R X 0 X
No ratings yet
Math 152, Section 55 (Vipul Naik) : X 1 R X 0 X
12 pages
TNPSC Ae Ece Classroom Coaching - 2023
No ratings yet
TNPSC Ae Ece Classroom Coaching - 2023
3 pages
Generative Artificial Intelligence and Language Teaching
No ratings yet
Generative Artificial Intelligence and Language Teaching
94 pages
Centrifugal Pumps
No ratings yet
Centrifugal Pumps
0 pages
Daewoo Kot-150 - Kot-151 - Kot-152 - Kot-155 - SM
No ratings yet
Daewoo Kot-150 - Kot-151 - Kot-152 - Kot-155 - SM
72 pages
Annex-2 (Inverter - SP-Manual)
50% (2)
Annex-2 (Inverter - SP-Manual)
16 pages
Haemonetics PCS2 - Service Manual
No ratings yet
Haemonetics PCS2 - Service Manual
134 pages
Bonfiglioli Transmission PVT LTD, Chennai - 21aug18 PDF
No ratings yet
Bonfiglioli Transmission PVT LTD, Chennai - 21aug18 PDF
1 page
Volcano Device 1v1 Building Docs
No ratings yet
Volcano Device 1v1 Building Docs
8 pages
NSR Registration
No ratings yet
NSR Registration
25 pages
FortiOS 5.0 - Security Profiles
No ratings yet
FortiOS 5.0 - Security Profiles
182 pages
Scope of Work Q1 Civil Construction Work: IT Department and Data Center Preparation
No ratings yet
Scope of Work Q1 Civil Construction Work: IT Department and Data Center Preparation
2 pages
JAVA - Unit 1
No ratings yet
JAVA - Unit 1
36 pages
Edfamux Series
No ratings yet
Edfamux Series
18 pages
Service Bulletin: Piper Considers Compliance Mandatory
No ratings yet
Service Bulletin: Piper Considers Compliance Mandatory
8 pages
Exam On Reference Sources and Services
100% (1)
Exam On Reference Sources and Services
143 pages
Inductive Sensor NBB1,5-8GM40-E0-V1
No ratings yet
Inductive Sensor NBB1,5-8GM40-E0-V1
1 page
OS Comparison for Students
No ratings yet
OS Comparison for Students
1 page
Much Ado About English - Up and Down The Bizarre Byways of A Fascinating Language
No ratings yet
Much Ado About English - Up and Down The Bizarre Byways of A Fascinating Language
156 pages
Advantages and Disadvantages of E-Tendering
67% (3)
Advantages and Disadvantages of E-Tendering
2 pages
Welcome To HUAWEI: User Guide HUAWEI U8185-1
No ratings yet
Welcome To HUAWEI: User Guide HUAWEI U8185-1
75 pages
ASCII - EBCDIC Code Converter - Longpela Expertise
No ratings yet
ASCII - EBCDIC Code Converter - Longpela Expertise
3 pages
Cmsa Sem 5 CC 11 CBCS
No ratings yet
Cmsa Sem 5 CC 11 CBCS
3 pages
Guide To Renaming Files Fast
No ratings yet
Guide To Renaming Files Fast
9 pages
What SA SSL VPN Configuration Is Required For The VPN Tunneling Client To Obtain An IP Address
No ratings yet
What SA SSL VPN Configuration Is Required For The VPN Tunneling Client To Obtain An IP Address
6 pages

Word Count Program

Uploaded by

Word Count Program

Uploaded by

1.

Map Reduce program to count the number of occurrences of each word in a

public class mapper extends MapReduceBase implements Mapper<LongWritable, Text, Text,

// Static final variable for the count of 1

// Reusable Text object to hold each word

// The map function

// Convert the input value (line of text) to a string

// Tokenize the line into words

// Collect the word and emit (word, 1) as key-value pairs

public class reducer extends MapReduceBase implements Reducer<Text, IntWritable, Text,

public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable>

// Sum up the counts for each word

// Emit the word with the total count

You might also like