0% found this document useful (0 votes)

40 views6 pages

NLPPR6

The document outlines an assignment on implementing the CKY algorithm for deep parsing in Natural Language Processing. It explains the CKY and Probabilistic CKY (PCKY) algorithms, their workings, and differences, along with a sample code for parsing sentences using a context-free grammar. The conclusion emphasizes the significance of these algorithms in computational linguistics and natural language understanding.

Uploaded by

patilaaryaa85

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views6 pages

NLPPR6

Uploaded by

patilaaryaa85

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Bansilal Ramnath Agarwal Charitable Trust's

Vishwakarma Institute of Information Technology

Department of
Artificial Intelligence and Data Science

Student Name: Vaishnavi Lawate

Class: TY Division: A Roll No: 371032

Semester: 6th Academic Year: 2024-25

Subject Name & Code: Natural Language Processing

Title of Assignment: 6) Implement CKY algorithm for deep parsing

Aim : Implement CKY algorithm for deep parsing

Background Theory :

Introduction

Parsing is a fundamental task in Natural Language Processing (NLP), where the structure of
a sentence is analyzed based on grammar rules. The CKY algorithm (Cocke–Kasami–
Younger) is a well-known parsing algorithm for context-free grammars, especially in
Chomsky Normal Form (CNF). When extended with probabilities, it becomes the
Probabilistic CKY (PCKY) algorithm, which is used to find the most likely parse tree of a
sentence.

CKY Algorithm

What is CKY?
The CKY (Cocke–Kasami–Younger) algorithm is a bottom-up dynamic programming
algorithm used to parse a string and determine whether it can be generated by a context-free
grammar (CFG) in CNF.
Note: Chomsky Normal Form (CNF) is a format where every production rule is either:
• A → BC (where B and C are non-terminal symbols)
• A → a (where a is a terminal symbol)

Working of CKY
Given a string of words w1, w2, ..., wn and a grammar in CNF:
1. Initialize a 2D table (n x n) where each cell [i][j] stores the set of non-terminal
symbols that can generate the substring from wi to wj.
2. Fill the diagonal: For each word wi, add all non-terminals A such that A → wi is a
production rule.
3. Bottom-up filling: For substrings longer than 1, check for all splits k between i and j,
and for all pairs of non-terminals (B in [i][k], C in [k+1][j]), add A to [i][j] if there's a
rule A → BC.
4. Check the top cell: If the start symbol S is in [0][n-1], then the string can be
generated by the grammar.

Example
Given:
• Grammar in CNF: o S → NP VP o NP → Det N
o VP → V NP o
Det → 'the' o
N → 'dog' |
'cat' o V→
'chased'
• Input: "the dog chased the cat"
CKY fills a table where each cell holds possible non-terminals that derive the substring. At
the end, it checks if 'S' (start symbol) is in the topmost cell.

Time and Space Complexity

• Time Complexity: O(n³ * |G|), where n is the number of words, and |G| is the number
of grammar rules.
• Space Complexity: O(n²)

Probabilistic CKY (PCKY)

What is Probabilistic CKY?

Probabilistic CKY extends CKY using Probabilistic Context-Free Grammar (PCFG), where
each production rule has a probability.
Goal: Instead of just checking if a sentence can be generated, PCKY finds the most probable
parse tree based on the given probabilities.

How PCFG Works

In a PCFG, each rule has a probability:
• S → NP VP [1.0]
• NP → Det N [0.6], NP → 'dog' [0.4]
• Det → 'the' [0.8]
• N → 'cat' [0.5], N → 'dog' [0.5]
The probability of a parse tree is the product of all rule probabilities used in the derivation.

Working of Probabilistic CKY

• Similar table-filling as CKY, but each cell also stores the maximum probability for
each non-terminal.
• Instead of just storing a non-terminal, each cell stores a tuple (Non-terminal,
Probability, Backpointer).
• When combining rules (A → B C), compute the probability:
P(A) = P(B in left) * P(C in right) * P(A → B C)
• At the end, choose the parse tree with the highest probability.

Parse Tree Reconstruction

Using the backpointers stored in each cell, we can trace back to construct the most probable
parse tree.

Differences Between CKY and PCKY

Feature CKY Probabilistic CKY (PCKY)
Input CFG in CNF PCFG in CNF
Whether the sentence can be
Output generated Most probable parse tree

Data Non-terminals + probabilities +

Stored Set of non-terminals backpointers
Usage Parsing/validating structure Probabilistic parsing, language modeling
Output
Any valid parse tree Most probable parse tree
Tree
CODE AND OUTPUT:
from nltk import CFG from
nltk.parse.chart import ChartParser
from nltk.tree import Tree

# Define a simple context-free grammar in CNF

# Create a parser parser =

ChartParser(grammar)

# Sentences to parse
sentences = [
"the dog chased a cat",
"a cat saw the ball",
"the dog saw a dog",
"a dog chased the ball",
"the cat saw a cat"
]

# Parse and print each tree for sentence in

sentences: print(f"\nParsing
sentence: \"{sentence}\"") tokens =
sentence.split() trees =
list(parser.parse(tokens)) for tree in trees:
tree.pretty_print()

Parsing sentence: "the dog chased a cat"

S
|

| VP
| |
NP | NP
| | |
Det N V Det N
| | | | |
the dog chased a cat

Parsing sentence: "a cat saw the ball"

S
|

| VP
| |
NP | NP
| | | Det N V Det N
| | | | |
a cat saw the ball
Parsing sentence: "the dog saw a dog"
S
|

| VP
| |
NP | NP
| | |
Det N V Det N
| | | | |
the dog saw a dog

Parsing sentence: "a dog chased the ball"

S
|

| VP
| |
NP | NP
| | |
Det N V Det N
| | | | |
a dog chased the ball

Parsing sentence: "the cat saw a cat"

S
|

| VP
| |
NP | NP
| | |
Det N V Det N
| | | | |
the cat saw a cat

Conclusion
The CKY algorithm provides a powerful technique for parsing sentences using context-free
grammars. Its probabilistic extension, the PCKY algorithm, enables more intelligent parsing by
selecting the most likely parse tree. Together, they form a foundational concept in computational
linguistics and natural language understanding.

Constituency Parsing PPT 2
No ratings yet
Constituency Parsing PPT 2
33 pages
CYK Parsing Notes
No ratings yet
CYK Parsing Notes
5 pages
Parsing Techniques for NLP Students
No ratings yet
Parsing Techniques for NLP Students
60 pages
Chomshky Notes
No ratings yet
Chomshky Notes
8 pages
CFG & PCFG
No ratings yet
CFG & PCFG
15 pages
Parsing
No ratings yet
Parsing
27 pages
Thuật toán NLP
No ratings yet
Thuật toán NLP
57 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
Natural Language Processing UNIT 2
No ratings yet
Natural Language Processing UNIT 2
32 pages
Xu-Ly-Ngon-Ngu-Tu-Nhien - Kai-Wei-Chang - 16-Cky - (Cuuduongthancong - Com)
No ratings yet
Xu-Ly-Ngon-Ngu-Tu-Nhien - Kai-Wei-Chang - 16-Cky - (Cuuduongthancong - Com)
61 pages
Advanced NLP: CFG Parsing Guide
No ratings yet
Advanced NLP: CFG Parsing Guide
28 pages
Unit 3 NLP 8 Question
No ratings yet
Unit 3 NLP 8 Question
6 pages
Module-2 ch-4
No ratings yet
Module-2 ch-4
32 pages
NLP - Shortnotes Unit 3
No ratings yet
NLP - Shortnotes Unit 3
16 pages
Unit 3
No ratings yet
Unit 3
19 pages
NLP Unit-Iii
No ratings yet
NLP Unit-Iii
26 pages
CS6120 35650 - Spring2025 - Assignment - 2-1
No ratings yet
CS6120 35650 - Spring2025 - Assignment - 2-1
5 pages
CYK Algorithm & Tree Based Language Models
No ratings yet
CYK Algorithm & Tree Based Language Models
5 pages
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
21 pages
NLP Unit-2 QB Updated
No ratings yet
NLP Unit-2 QB Updated
10 pages
Lec26 Dynamic Programming 7
No ratings yet
Lec26 Dynamic Programming 7
57 pages
NLP Parsing with CKY Algorithm
No ratings yet
NLP Parsing with CKY Algorithm
60 pages
Context-Free Grammars & CNF Conversion
No ratings yet
Context-Free Grammars & CNF Conversion
23 pages
NPTEL NLP Assignment 5
No ratings yet
NPTEL NLP Assignment 5
4 pages
Assignment 1 - NLP 2024
No ratings yet
Assignment 1 - NLP 2024
2 pages
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
No ratings yet
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
89 pages
NLP Parsing Techniques
No ratings yet
NLP Parsing Techniques
54 pages
NLP Session 16 - Post Midsem Review
No ratings yet
NLP Session 16 - Post Midsem Review
189 pages
Unit 2
No ratings yet
Unit 2
53 pages
NLP Unit 3
No ratings yet
NLP Unit 3
17 pages
NLP Unit-4
No ratings yet
NLP Unit-4
6 pages
NLP 3
No ratings yet
NLP 3
4 pages
CKY) Cocke-Kasami-Younger) Earley Parsing Algorithms: Dor Altshuler
No ratings yet
CKY) Cocke-Kasami-Younger) Earley Parsing Algorithms: Dor Altshuler
81 pages
Unit 4 Earley Parser
No ratings yet
Unit 4 Earley Parser
56 pages
NLP Parsing Techniques Explained
No ratings yet
NLP Parsing Techniques Explained
11 pages
CFG and PCFG
No ratings yet
CFG and PCFG
7 pages
14 Ai Cse551 NLP 2 PDF
No ratings yet
14 Ai Cse551 NLP 2 PDF
39 pages
Lecture 09
No ratings yet
Lecture 09
34 pages
CFG and Probabilistic Parsing Guide
No ratings yet
CFG and Probabilistic Parsing Guide
45 pages
Semiring Parsing
No ratings yet
Semiring Parsing
34 pages
13-Shallow Parsing-05-09-2024
No ratings yet
13-Shallow Parsing-05-09-2024
62 pages
Lecture 40 Cky For Parsing
No ratings yet
Lecture 40 Cky For Parsing
20 pages
Lecture 06 Context-Free Grammars
No ratings yet
Lecture 06 Context-Free Grammars
22 pages
Important Notes How Have Facing Problem in NLP
No ratings yet
Important Notes How Have Facing Problem in NLP
6 pages
Probabilistic Context-Free Grammar
No ratings yet
Probabilistic Context-Free Grammar
13 pages
NLP M3 SPP
No ratings yet
NLP M3 SPP
53 pages
NLP Unit 3 (Part 1)
No ratings yet
NLP Unit 3 (Part 1)
7 pages
Unit 3
No ratings yet
Unit 3
4 pages
Lesson 44
No ratings yet
Lesson 44
43 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Chapter 12: Context-Free Grammars
No ratings yet
Chapter 12: Context-Free Grammars
13 pages
Lec15 CL1-f11
No ratings yet
Lec15 CL1-f11
5 pages
Module 3 NLP
No ratings yet
Module 3 NLP
32 pages
NLP Sem 3 Unit
No ratings yet
NLP Sem 3 Unit
12 pages
Week 3 - Probablistic Context Free Grammars
No ratings yet
Week 3 - Probablistic Context Free Grammars
18 pages
Parsing and Ambiguity in NLP
No ratings yet
Parsing and Ambiguity in NLP
18 pages
14 Syntax 1
No ratings yet
14 Syntax 1
22 pages
CC Manual - Grammar PDF
No ratings yet
CC Manual - Grammar PDF
2 pages
ENG211 English Language History Quiz
No ratings yet
ENG211 English Language History Quiz
92 pages
Compiler Design Practical File
No ratings yet
Compiler Design Practical File
43 pages
C14-E11-E12 CSE2004 Theory of Computation and Compiler Design 100379 SSPSHUKLA Interim 2024-25 Midterm Online Student Copy Printed Version
No ratings yet
C14-E11-E12 CSE2004 Theory of Computation and Compiler Design 100379 SSPSHUKLA Interim 2024-25 Midterm Online Student Copy Printed Version
1 page
rkCD-Chapter 2 - LEXICAL ANALYSIS
No ratings yet
rkCD-Chapter 2 - LEXICAL ANALYSIS
9 pages
Compiler Engineering: Lab # 4: Syntax Analysis (Parsing)
No ratings yet
Compiler Engineering: Lab # 4: Syntax Analysis (Parsing)
25 pages
Development of Bangla Spell and Grammar Checkers - Resource Creation and Evaluation
No ratings yet
Development of Bangla Spell and Grammar Checkers - Resource Creation and Evaluation
19 pages
The Theory of Parsing Translation and Compiling Volume 1 Parsing
100% (3)
The Theory of Parsing Translation and Compiling Volume 1 Parsing
562 pages
Avinash PDF
No ratings yet
Avinash PDF
23 pages
Writing Scientific Papers in English Successfully 2014 GREAT
No ratings yet
Writing Scientific Papers in English Successfully 2014 GREAT
21 pages
Syntax Directed Translation Guide
No ratings yet
Syntax Directed Translation Guide
49 pages
Toc Report
No ratings yet
Toc Report
18 pages
Error Number Mentor Graphics
No ratings yet
Error Number Mentor Graphics
30 pages
Symbol Table Run Time Environment
No ratings yet
Symbol Table Run Time Environment
39 pages
Compiler Design
No ratings yet
Compiler Design
7 pages
SCSA1604-Compiler Design
No ratings yet
SCSA1604-Compiler Design
2 pages
Compiler Design
No ratings yet
Compiler Design
118 pages
Viva With Answers
No ratings yet
Viva With Answers
6 pages
NLP Assignment
No ratings yet
NLP Assignment
8 pages
Compiler Design B.Tech III Year Question Bank
No ratings yet
Compiler Design B.Tech III Year Question Bank
32 pages
Lecture Notes: Sir C R Reddy College of Engineering
No ratings yet
Lecture Notes: Sir C R Reddy College of Engineering
25 pages
Lesson Plan of at&CD Cse (Aiml) II-II
No ratings yet
Lesson Plan of at&CD Cse (Aiml) II-II
23 pages
Software Construction
No ratings yet
Software Construction
16 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
42 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
10 pages
Reviced Exit-Exam
No ratings yet
Reviced Exit-Exam
22 pages
SAP LUW: Ensuring Data Consistency
No ratings yet
SAP LUW: Ensuring Data Consistency
93 pages
Filetype PDF Cambridge English Advanced Grammar in Use Optimized
0% (2)
Filetype PDF Cambridge English Advanced Grammar in Use Optimized
2 pages
Syntax Analysis for CS Students
No ratings yet
Syntax Analysis for CS Students
6 pages
Linguistics: LFG & GPSG Explained
No ratings yet
Linguistics: LFG & GPSG Explained
16 pages
CSharpCompiler Spe
No ratings yet
CSharpCompiler Spe
15 pages

NLPPR6

Uploaded by

NLPPR6

Uploaded by

Bansilal Ramnath Agarwal Charitable Trust's

Vishwakarma Institute of Information Technology

Student Name: Vaishnavi Lawate

Class: TY Division: A Roll No: 371032

Semester: 6th Academic Year: 2024-25

Subject Name & Code: Natural Language Processing

Aim : Implement CKY algorithm for deep parsing

Time and Space Complexity

Probabilistic CKY (PCKY)

What is Probabilistic CKY?

How PCFG Works

Working of Probabilistic CKY

Parse Tree Reconstruction

Differences Between CKY and PCKY

Data Non-terminals + probabilities +

# Define a simple context-free grammar in CNF

# Create a parser parser =

# Parse and print each tree for sentence in

Parsing sentence: "the dog chased a cat"

Parsing sentence: "a cat saw the ball"

Parsing sentence: "a dog chased the ball"

Parsing sentence: "the cat saw a cat"

You might also like