0% found this document useful (0 votes)

10 views6 pages

NLP Lab Manual

The document outlines three Python programs that utilize Natural Language Processing (NLP) techniques. The first program performs basic word analysis using tokenization, POS tagging, stemming, and lemmatization with the NLTK library. The second program generates a random word from a given corpus, while the third demonstrates morphological analysis through stemming and lemmatization.

Uploaded by

subhashree6124

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

NLP Lab Manual

Uploaded by

subhashree6124

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

1.

Program to Perform Basic Word Analysis Using Natural Language Processing (NLP)

Aim

To perform basic word analysis using Natural Language Processing (NLP) techniques in Python
using the NLTK library.

Algorithm

Step 1: Start the program.

Step 2: Import the required modules from the NLTK library:
word_tokenize, pos_tag, PorterStemmer, WordNetLemmatizer.
Step 3: Download the required NLTK datasets:
- punkt (for tokenization)
- averaged_perceptron_tagger (for POS tagging)
- WordNet (for lemmatization)

Step 3: Define a sample text for analysis.

Step 4: Tokenize the text into words.
Step 5: Perform POS tagging on the tokens.
Step 6: Apply stemming to each token.
Step 7: Apply lemmatization to each token.
Step 8: Display tokens, POS tags, stems, and lemmas.
Step 9: End the program.

PROGRAM:

import nltk

from nltk.tokenize import word_tokenize

from nltk.tag import pos_tag

from nltk.stem import PorterStemmer

from nltk.stem import WordNetLemmatizer

# Download necessary NLTK data

nltk.download('punkt')

nltk.download('averaged_perceptron_tagger')

nltk.download('wordnet')
# Sample text for word analysis

text = "The quick brown fox jumps over the lazy dog."

# Step 1: Tokenization

tokens = word_tokenize(text)

print("Tokens:", tokens)

# Step 2: POS Tagging

pos_tags = pos_tag(tokens)

print("\nPOS Tags:", pos_tags)

# Step 3: Stemming

stemmer = PorterStemmer()

stems = [stemmer.stem(word) for word in tokens]

print("\nStems:", stems)

# Step 4: Lemmatization

lemmatizer = WordNetLemmatizer()

lemmas = [lemmatizer.lemmatize(word, pos='v') for word in tokens]

print("\nLemmas:", lemmas)

OUTPUT:

Tokens: ['The', 'quick', 'brown', 'fox', 'jumps', 'over', 'the', 'lazy', 'dog', '.']

POS Tags: [('The', 'DT'), ('quick', 'JJ'), ('brown', 'NN'), ('fox', 'NN'), ('jumps', 'VBZ'),
('over', 'IN'), ('the', 'DT'), ('lazy', 'JJ'), ('dog', 'NN'), ('.', '.')]

Stems: ['the', 'quick', 'brown', 'fox', 'jump', 'over', 'the', 'lazi', 'dog', '.']

Lemmas: ['The', 'quick', 'brown', 'fox', 'jump', 'over', 'the', 'lazy', 'dog', '.']

Result:

Thus, the Python program to perform basic word analysis using NLP techniques was executed
and verified successfully.
2.Program to Generate a Random Word Using a Given Corpus

Aim

To write a Python program that generates a random word of a given length using a predefined set
of characters (corpus).

Algorithm

Step 1: Start the program.

Step 2: Import the random module.
Step 3: Define a sample corpus containing lowercase English alphabets.
Step 4: Define a function generate_word(length) to create a random word:
4.1: Use a loop to select random characters from the corpus.
4.2: Join them into a single string.
4.3: Return the generated word.
Step 5: Call the function with a specified length (e.g., 6).
Step 6: Display the generated word.
Step 7: End the program.

Program
import random
# Sample corpus of characters
corpus = "abcdefghijklmnopqrstuvwxyz"
# Function to generate a new word
def generate_word(length):
word = "".join(random.choice(corpus) for _ in range(length))
return word
# Generate a word of length 6
new_word = generate_word(6)
print("Generated Word:", new_word)

Output

Generated Word: tnwaey

Result

Thus, the Python program to generate a random word from a given corpus was executed and
verified successfully.
3. Program to Perform Morphological Analysis Using Stemming and Lemmatization

AIM

This program aims to demonstrate morphological analysis in Natural Language Processing

(NLP). Specifically, the program will perform stemming and lemmatization, two common
techniques in morphology, to analyse the structure of words and reduce them to their base forms.

ALGORITHM

Step 1: Start the program.

Step 2: Import PorterStemmer and WordNetLemmatizer from the nltk.stem module.
Step 3: Download the WordNet dataset for lemmatization.
Step 4: Create a list of words for morphological analysis.
Step 5: Initialise the stemmer and lemmatizer objects.
Step 6: For each word in the list:
6.1: Find the stem using the stemmer. stem(word).
6.2: Find the lemma using lemmatizer.lemmatize(word, pos='v').
6.3: Display the original word, stem, and lemma in a tabular format.
Step 7: End the program.

PROGRAM

import nltk

from nltk.stem import PorterStemmer

from nltk.stem import WordNetLemmatizer

# Download necessary NLTK data

nltk.download('wordnet')

# Sample list of words for morphological analysis

words = ["running", "jumps", "easily", "fairly", "happier"]

# Initialize the stemmer and lemmatizer

stemmer = PorterStemmer()

lemmatizer = WordNetLemmatizer()

# Perform stemming and lemmatization

print(f"{'Word':<10} {'Stem':<10} {'Lemma':<10}")

for word in words:

stem = stemmer.stem(word)

lemma = lemmatizer.lemmatize(word, pos='v') # 'v' for verb

print(f"{word:<10} {stem:<10} {lemma:<10}")

OUTPUT

Word Stem Lemma

running run run

jumps jump jump

easily easili easily

fairly fairli fairly

happier happier happier

RESULT

Thus, the Python program to perform morphological analysis using stemming and lemmatization
was executed and verified successfully.

NLP Record
No ratings yet
NLP Record
23 pages
NLP Program 1
No ratings yet
NLP Program 1
3 pages
R22 NLP Python Programs
No ratings yet
R22 NLP Python Programs
15 pages
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
No ratings yet
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
13 pages
Python NLP Tasks with NLTK
No ratings yet
Python NLP Tasks with NLTK
17 pages
7 Exp
No ratings yet
7 Exp
6 pages
3 A Morphology
No ratings yet
3 A Morphology
4 pages
NLP Lab Codes Till Mod3
No ratings yet
NLP Lab Codes Till Mod3
7 pages
NLP - Exp 1 11
No ratings yet
NLP - Exp 1 11
29 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
20BCP123 - NLP Lab Manual
No ratings yet
20BCP123 - NLP Lab Manual
45 pages
Tanvi Chiman 10 BE3 EXP3 A SMA
No ratings yet
Tanvi Chiman 10 BE3 EXP3 A SMA
3 pages
NLP Lab
No ratings yet
NLP Lab
7 pages
Morphology: Affix, Stemming, Lemmatization
No ratings yet
Morphology: Affix, Stemming, Lemmatization
3 pages
NLP Op
No ratings yet
NLP Op
16 pages
Natural Language Processing Lab Manual
No ratings yet
Natural Language Processing Lab Manual
24 pages
NLP Lab Manual - Final
No ratings yet
NLP Lab Manual - Final
15 pages
Sample Program Using Python 3
No ratings yet
Sample Program Using Python 3
5 pages
NLP Record
No ratings yet
NLP Record
15 pages
NLP Intro
No ratings yet
NLP Intro
15 pages
20BCP112 - NLP Lab - LAB - Manual
No ratings yet
20BCP112 - NLP Lab - LAB - Manual
65 pages
Text Preprocessing For NLP
No ratings yet
Text Preprocessing For NLP
15 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
32 pages
Natural Langauage Processing (NLP) : Tokenization of Words
No ratings yet
Natural Langauage Processing (NLP) : Tokenization of Words
8 pages
NLP Practical Journal 2023-24
No ratings yet
NLP Practical Journal 2023-24
22 pages
NLP Lab 2
No ratings yet
NLP Lab 2
6 pages
NLP Lab Manual (R20)
50% (2)
NLP Lab Manual (R20)
24 pages
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
No ratings yet
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
11 pages
NLP Lab Manual for CSE Students
No ratings yet
NLP Lab Manual for CSE Students
28 pages
NLP Lab Programs
No ratings yet
NLP Lab Programs
18 pages
AIPT LAB 24-25 MANUAL EXPE 4 To8
No ratings yet
AIPT LAB 24-25 MANUAL EXPE 4 To8
15 pages
3.Nlp Lab Manual
No ratings yet
3.Nlp Lab Manual
18 pages
Soundarya 256 NLP Practs
No ratings yet
Soundarya 256 NLP Practs
14 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
NLP Core Using NLTK: Dr. Muhammad Nouman Durrani
No ratings yet
NLP Core Using NLTK: Dr. Muhammad Nouman Durrani
42 pages
7 TextAnalysis
No ratings yet
7 TextAnalysis
3 pages
Python NLP Techniques Guide
No ratings yet
Python NLP Techniques Guide
18 pages
NLP Exp 7
No ratings yet
NLP Exp 7
3 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
Natural Language Processing-Section
No ratings yet
Natural Language Processing-Section
25 pages
Stemming, Lemmatization & NLP Basics
No ratings yet
Stemming, Lemmatization & NLP Basics
6 pages
AI Lab Manual Aktu
No ratings yet
AI Lab Manual Aktu
11 pages
NLP Exp-123
No ratings yet
NLP Exp-123
6 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
All Practicals
No ratings yet
All Practicals
33 pages
Experiment 3 Manual
No ratings yet
Experiment 3 Manual
7 pages
123 NLP 456
No ratings yet
123 NLP 456
4 pages
NLPPractical
No ratings yet
NLPPractical
12 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
NLTK
No ratings yet
NLTK
3 pages
TSA Lab Manual New
No ratings yet
TSA Lab Manual New
14 pages
NLP - Practical List
No ratings yet
NLP - Practical List
14 pages
Word Level Analysis (NLP)
No ratings yet
Word Level Analysis (NLP)
28 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
Eyeon - Manual - Network Rendering - Console Slave - VFXPedia
No ratings yet
Eyeon - Manual - Network Rendering - Console Slave - VFXPedia
5 pages
Placement Details - Docx 2
No ratings yet
Placement Details - Docx 2
6 pages
Some Javascripshit
No ratings yet
Some Javascripshit
6 pages
Comprog 11 Switch Statement
No ratings yet
Comprog 11 Switch Statement
3 pages
6 - Arrays and Linked Lists
No ratings yet
6 - Arrays and Linked Lists
21 pages
JNTUK 3-2 1st Mid Big Data Analytics - (R2032121) Online Bits
No ratings yet
JNTUK 3-2 1st Mid Big Data Analytics - (R2032121) Online Bits
10 pages
J2ee Faq
No ratings yet
J2ee Faq
210 pages
05.01 SMF OD V010 Delimited Text Parsing Strategy Final
No ratings yet
05.01 SMF OD V010 Delimited Text Parsing Strategy Final
11 pages
Lecture 3
No ratings yet
Lecture 3
28 pages
4.1 General Principles (10 Hours) : Pseudocode
No ratings yet
4.1 General Principles (10 Hours) : Pseudocode
18 pages
01 11 B1 TB1300 LM en Sol Di XML
No ratings yet
01 11 B1 TB1300 LM en Sol Di XML
5 pages
NUNit
No ratings yet
NUNit
8 pages
EOI-1 Saurabh C - Programming
No ratings yet
EOI-1 Saurabh C - Programming
24 pages
Arrays vs. Structures Explained
No ratings yet
Arrays vs. Structures Explained
2 pages
Case-Study Pendo
No ratings yet
Case-Study Pendo
3 pages
Non Fucntional Requirements Defnition
No ratings yet
Non Fucntional Requirements Defnition
6 pages
Resume - Carlos Henrique Ughini - 2017
No ratings yet
Resume - Carlos Henrique Ughini - 2017
4 pages
W1L1 - Course Outline
No ratings yet
W1L1 - Course Outline
6 pages
Comp-3 To Comp - Decimal Conversions
100% (2)
Comp-3 To Comp - Decimal Conversions
3 pages
SPM - Project Scope Management
No ratings yet
SPM - Project Scope Management
22 pages
Internship-report-23CS183 SUJI
No ratings yet
Internship-report-23CS183 SUJI
16 pages
Application Error Logs Summary
No ratings yet
Application Error Logs Summary
7 pages
CC 5
No ratings yet
CC 5
2 pages
ICT-100 Assignment - PIP Submission 34769413 Thinley (AutoRecovered)
No ratings yet
ICT-100 Assignment - PIP Submission 34769413 Thinley (AutoRecovered)
27 pages
SAP S/4HANA: Amount Field Length Extension: Restriction Note
No ratings yet
SAP S/4HANA: Amount Field Length Extension: Restriction Note
2 pages
Double Linked List
No ratings yet
Double Linked List
4 pages
Pixel Modem Security Insights
No ratings yet
Pixel Modem Security Insights
42 pages
Examples of Linux Operating System
No ratings yet
Examples of Linux Operating System
12 pages
Lab Insttructions
No ratings yet
Lab Insttructions
6 pages

NLP Lab Manual

Uploaded by

NLP Lab Manual

Uploaded by

1.

Step 1: Start the program.​

Step 3: Define a sample text for analysis.​

from nltk.tokenize import word_tokenize

from nltk.tag import pos_tag

from nltk.stem import PorterStemmer

from nltk.stem import WordNetLemmatizer

# Download necessary NLTK data

# Step 2: POS Tagging

print("\nPOS Tags:", pos_tags)

stems = [stemmer.stem(word) for word in tokens]

lemmas = [lemmatizer.lemmatize(word, pos='v') for word in tokens]

Step 1: Start the program.​

Generated Word: tnwaey

This program aims to demonstrate morphological analysis in Natural Language Processing

Step 1: Start the program.​

from nltk.stem import PorterStemmer

from nltk.stem import WordNetLemmatizer

# Download necessary NLTK data

# Sample list of words for morphological analysis

words = ["running", "jumps", "easily", "fairly", "happier"]

# Initialize the stemmer and lemmatizer

# Perform stemming and lemmatization

print(f"{'Word':<10} {'Stem':<10} {'Lemma':<10}")

for word in words:

lemma = lemmatizer.lemmatize(word, pos='v') # 'v' for verb

print(f"{word:<10} {stem:<10} {lemma:<10}")

Word Stem Lemma

running run run

jumps jump jump

easily easili easily

fairly fairli fairly

happier happier happier

You might also like

Step 1: Start the program.

Step 3: Define a sample text for analysis.

Step 1: Start the program.

Step 1: Start the program.