0% found this document useful (0 votes)

11 views3 pages

NLP Program 1

The document outlines two experiments in Natural Language Processing (NLP). The first experiment focuses on word analysis, including tokenization, part-of-speech tagging, stemming, and lemmatization using NLTK, while the second experiment generates new words using a character-level n-gram model. Sample outputs for both experiments are provided, showcasing the results of the analyses and word generation.

Uploaded by

Mohana Priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views3 pages

NLP Program 1

Uploaded by

Mohana Priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

EXP NO: 1 WORD ANALYSIS

AIM:

The aim of this program is to perform basic word analysis using Natural Language Processing (NLP)

techniques.

import nltk

from nltk.tokenize import word_tokenize

from nltk.tag import pos_tag

from nltk.stem import PorterStemmer

from nltk.stem import WordNetLemmatizer

# Download necessary NLTK data

nltk.download('punkt')

nltk.download('averaged_perceptron_tagger')

nltk.download('wordnet')

# Sample text for word analysis

text = "The quick brown fox jumps over the lazy dog."

# Step 1: Tokenization - Splitting the text into individual words

tokens = word_tokenize(text)

print("Tokens:", tokens)

# Step 2: Part-of-Speech (POS) Tagging - Assigning POS tags to each token

pos_tags = pos_tag(tokens)

print("\nPOS Tags:", pos_tags)

# Step 3: Stemming - Reducing words to their root form

stemmer = PorterStemmer()

stems = [stemmer.stem(word) for word in tokens]

print("\nStems:", stems)
# Step 4: Lemmatization - Reducing words to their base or dictionary form

lemmatizer = WordNetLemmatizer()

lemmas = [lemmatizer.lemmatize(word, pos='v') for word in tokens] # pos='v' indicates verbs

print("\nLemmas:", lemmas)

OUTPUT:

Tokens: ['The', 'quick', 'brown', 'fox', 'jumps', 'over', 'the', 'lazy', 'dog', '.']

POS Tags: [('The', 'DT'), ('quick', 'JJ'), ('brown', 'NN'), ('fox', 'NN'), ('jumps', 'VBZ'), ('over', 'IN'), ('the',

'DT'), ('lazy', 'JJ'), ('dog', 'NN'), ('.', '.')]

Stems: ['the', 'quick', 'brown', 'fox', 'jump', 'over', 'the', 'lazi', 'dog', '.']

Lemmas: ['The', 'quick', 'brown', 'fox', 'jump', 'over', 'the', 'lazy', 'dog', '.']

RESULTS:

Running the program with the sample sentence "The quick brown fox jumps over the lazy dog."

produces the following results.

EXP NO: 2 WORD GENERATION

AIM:

The aim of this program is to generate new words or text sequences based on given input using basic

n-gram models.

PROGRAM:

import random

# Sample corpus of characters

corpus = "abcdefghijklmnopqrstuvwxyz"

# Function to generate a new word

def generate_word(length):
word = "".join(random.choice(corpus) for _ in range(length))

return word

# Generate a word of length 6

new_word = generate_word(6)

print("Generated Word:", new_word)

OUTPUT:

Generated Word: tnwaey

RESULTS:

This simple word generation program demonstrates the basics of creating new words using

character-level prediction.

NLP Lab Manual
No ratings yet
NLP Lab Manual
6 pages
R22 NLP Python Programs
No ratings yet
R22 NLP Python Programs
15 pages
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
No ratings yet
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
13 pages
NLP Record
No ratings yet
NLP Record
23 pages
Python NLP Tasks with NLTK
No ratings yet
Python NLP Tasks with NLTK
17 pages
NLP Lab Codes Till Mod3
No ratings yet
NLP Lab Codes Till Mod3
7 pages
Natural Language Processing Lab Manual
No ratings yet
Natural Language Processing Lab Manual
24 pages
NLP Lab Manual - Final
No ratings yet
NLP Lab Manual - Final
15 pages
7 Exp
No ratings yet
7 Exp
6 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
AI Lab Manual Aktu
No ratings yet
AI Lab Manual Aktu
11 pages
NLPPractical
No ratings yet
NLPPractical
12 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
NLP EXP 3 (B) - Word Generation
No ratings yet
NLP EXP 3 (B) - Word Generation
2 pages
NLP Lab Programs
No ratings yet
NLP Lab Programs
18 pages
AIPT LAB 24-25 MANUAL EXPE 4 To8
No ratings yet
AIPT LAB 24-25 MANUAL EXPE 4 To8
15 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
32 pages
NLP - Exp 1 11
No ratings yet
NLP - Exp 1 11
29 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
123 NLP 456
No ratings yet
123 NLP 456
4 pages
Python NLP Techniques Guide
No ratings yet
Python NLP Techniques Guide
18 pages
NLP Lab 2
No ratings yet
NLP Lab 2
6 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
7 pages
Module 5
No ratings yet
Module 5
69 pages
NLP - (Natural Language Processing Lab Manual)
No ratings yet
NLP - (Natural Language Processing Lab Manual)
12 pages
20BCP112 - NLP Lab - LAB - Manual
No ratings yet
20BCP112 - NLP Lab - LAB - Manual
65 pages
NLP Op
No ratings yet
NLP Op
16 pages
DBMS
No ratings yet
DBMS
2 pages
Batch 2
No ratings yet
Batch 2
13 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
20BCP123 - NLP Lab Manual
No ratings yet
20BCP123 - NLP Lab Manual
45 pages
TSA Lab Manual New
No ratings yet
TSA Lab Manual New
14 pages
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
No ratings yet
Tokenization (Breaking Text Into Words) : Import From Import From Import From Import
11 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
21 pages
Self Evaluation Exercises
No ratings yet
Self Evaluation Exercises
12 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
NLP Record
No ratings yet
NLP Record
15 pages
Ai&Ml Bai601 NLP Lab Manual
No ratings yet
Ai&Ml Bai601 NLP Lab Manual
48 pages
Morphology: Affix, Stemming, Lemmatization
No ratings yet
Morphology: Affix, Stemming, Lemmatization
3 pages
3b Word Generation
No ratings yet
3b Word Generation
1 page
Soundarya 256 NLP Practs
No ratings yet
Soundarya 256 NLP Practs
14 pages
1a NLTK
No ratings yet
1a NLTK
10 pages
Sanjey nlp10 Merged
No ratings yet
Sanjey nlp10 Merged
11 pages
Bling
No ratings yet
Bling
7 pages
DSBDL Assn 07
No ratings yet
DSBDL Assn 07
4 pages
Clint-Roy Muvirimi-Mukarakate H1802386 AI Practical Assignment
No ratings yet
Clint-Roy Muvirimi-Mukarakate H1802386 AI Practical Assignment
8 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
13 pages
NLP Lab
No ratings yet
NLP Lab
7 pages
NLB Final Lab Manual
No ratings yet
NLB Final Lab Manual
23 pages
Natural Language Processing Lab 7
No ratings yet
Natural Language Processing Lab 7
10 pages
NLP Lab Manual (R20)
50% (2)
NLP Lab Manual (R20)
24 pages
NLP Exp4
No ratings yet
NLP Exp4
10 pages
3.Nlp Lab Manual
No ratings yet
3.Nlp Lab Manual
18 pages
NLP Practical Journal 2023-24
No ratings yet
NLP Practical Journal 2023-24
22 pages
C24064 - NLP - Lab Manual
No ratings yet
C24064 - NLP - Lab Manual
28 pages
Unit 1 ESSS
No ratings yet
Unit 1 ESSS
34 pages
SDN PPT1
No ratings yet
SDN PPT1
29 pages
NLP AU Ques
No ratings yet
NLP AU Ques
4 pages
CS460/IT632 Natural Language Processing/Language Technology For The Web
No ratings yet
CS460/IT632 Natural Language Processing/Language Technology For The Web
11 pages
Secure Software Notes
No ratings yet
Secure Software Notes
4 pages
FDSA Unit - 2
No ratings yet
FDSA Unit - 2
142 pages
DC Notes 2
No ratings yet
DC Notes 2
15 pages
Secure Software Notes
No ratings yet
Secure Software Notes
3 pages
File Handling
No ratings yet
File Handling
13 pages
NLP Unit5 Discourse and Lexical Resources Elaborated
No ratings yet
NLP Unit5 Discourse and Lexical Resources Elaborated
4 pages
Secure Software Notes
No ratings yet
Secure Software Notes
4 pages
FDS Unit - 5
No ratings yet
FDS Unit - 5
17 pages
FDS Unit - 5
No ratings yet
FDS Unit - 5
26 pages
FDS Unit - 5
No ratings yet
FDS Unit - 5
10 pages
FDS Unit - 5
No ratings yet
FDS Unit - 5
16 pages
FDS Unit - 5
No ratings yet
FDS Unit - 5
13 pages

NLP Program 1

Uploaded by

NLP Program 1

Uploaded by

EXP NO: 1 WORD ANALYSIS

from nltk.tokenize import word_tokenize

from nltk.tag import pos_tag

from nltk.stem import PorterStemmer

from nltk.stem import WordNetLemmatizer

# Download necessary NLTK data

# Sample text for word analysis

# Step 1: Tokenization - Splitting the text into individual words

# Step 2: Part-of-Speech (POS) Tagging - Assigning POS tags to each token

print("\nPOS Tags:", pos_tags)

# Step 3: Stemming - Reducing words to their root form

stems = [stemmer.stem(word) for word in tokens]

lemmas = [lemmatizer.lemmatize(word, pos='v') for word in tokens] # pos='v' indicates verbs

'DT'), ('lazy', 'JJ'), ('dog', 'NN'), ('.', '.')]

produces the following results.

EXP NO: 2 WORD GENERATION

# Sample corpus of characters

# Function to generate a new word

# Generate a word of length 6

print("Generated Word:", new_word)

Generated Word: tnwaey

You might also like