0% found this document useful (0 votes)

6 views9 pages

NLP - Unit II Syntactic Analysis

Uploaded by

tejaforyou5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

NLP - Unit II Syntactic Analysis

Uploaded by

tejaforyou5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Unit - II

What is Syntactic Processing?

Syntactic processing is the process of analyzing the grammatical structure of a sentence to

understand its meaning. This involves identifying the different parts of speech in a sentence,
such as nouns, verbs, adjectives, and adverbs, and how they relate to each other in order to
give proper meaning to the sentence. Let’s start with an example to understand Syntactic
Processing:

 New York is the capital of the United States of America.

 Is the United States of America the of New York capital.

If we observe closely, both sentences have the same set of words, but only the first one is
grammatically correct and which have proper meaning. If we approach both sentences with
lexical processing techniques, we can’t tell the difference between the two sentences. Here,
comes the role of syntactic processing techniques which can help to understand the
relationship between individual words in the sentence.

Difference between Lexical Processing and Syntactic Processing

Lexical processing aims at data cleaning and feature extraction, by using techniques such
as lemmatization, removing stopwords, correcting misspelled words, etc. However,
in syntactic processing, our aim is to understand the roles played by each of the words in the
sentence, and the relationship among words and to parse the grammatical structure
of sentences to understand the proper meaning of the sentence.

How Does Syntactic Processing Work?

To understand the working of syntactic processing, lets again start with an example. For
example, consider the sentence “The cat sat on the mat.” Syntactic processing would involve
identifying important components in the sentence such as “cat” as a noun, “sat” as a verb,
“on” as a preposition, and “mat” as a noun. It would also involve understanding that “cat” is
the subject of the sentence and “mat” is the object.

Syntactic processing involves a series of steps, including tokenization, part-of-speech tagging,

parsing, and semantic analysis. Tokenization is the process of breaking up a sentence into
individual words or tokens. Part-of-speech (PoS) tagging involves identifying the part of
speech of each token. Parsing is the process of analyzing the grammatical structure of a
sentence, including identifying the subject, verb, and object. The semantic analysis involves
understanding the meaning of the sentence in context.

There are several different techniques used in syntactic processing, including rule-based
methods, statistical methods, and machine learning algorithms. Each technique has its own
strengths and weaknesses, and the choice of technique depends on the specific task and the
available data.

Why is Syntactic Processing Important in NLP?

Syntactic processing is a crucial component of many NLP tasks, including machine
translation, sentiment analysis, and question-answering. Without accurate syntactic
processing, it is difficult for computers to understand the underlying meaning of human
language.Syntactic processing also plays an important role in text generation, such as in
chatbots or automated content creation.

Syntactic Structure and its Components

Syntactic structure refers to the arrangement of words or phrases to form grammatically
correct sentences in a language. It involves several components that organize and govern the
way words come together to convey meaning. The fundamental components of syntactic
structure include:
Phrases: Phrases are groups of words functioning as a single unit within a sentence. They can
be noun phrases (NP), verb phrases (VP), prepositional phrases (PP), etc.

Words/Word Classes: Words are the basic building blocks of language. Different word
classes (parts of speech) include nouns, verbs, adjectives, adverbs, prepositions, conjunctions,
determiners, etc. Each word class has its own role and function within a sentence.

Constituents: Constituents are smaller units within a sentence that form larger structures. For
instance, in the sentence "The cat chased the mouse," "the cat" and "chased the mouse" are
constituents that make up the larger sentence.

Syntax Rules: These are the rules or principles that dictate the acceptable arrangement of
words to form grammatically correct sentences in a language. They govern how words can
combine to create phrases and sentences.

Syntax Trees/Parse Trees: These graphical representations illustrate the hierarchical

structure of a sentence. They show how different constituents (phrases) are nested within one
another to form a complete sentence, with the main elements branching out into smaller units.

Syntax refers to the set of rules, principles, processes that govern the structure of sentences
in a natural language. One basic description of syntax is how different words such as
Subject, Verbs, Nouns, Noun Phrases, etc. are sequenced in a sentence.
Some of the syntactic categories of a natural language are as follows:

 Sentence(S)
 Noun Phrase(NP)
 Determiner(Det)
 Verb Phrase(VP)
 Prepositional Phrase(PP)
 Verb(V)
 Noun(N)
SyntaxTree:
A Syntax tree or a parse tree is a tree representation of different syntactic categories of a
sentence. It helps us to understand the syntactical structure of a sentence.
Example:
The syntax tree for the sentence given below is as follows:
I drive a car to my college.

Clauses: Clauses are units that contain a subject and a predicate. They can be independent
(complete sentences) or dependent (incomplete sentences that rely on an independent clause
to make complete sense).

Understanding the syntactic structure of a language helps in analyzing sentences, identifying

grammatical patterns, and constructing sentences that convey intended meanings effectively.
By understanding the grammatical structure of a sentence, computers can generate more
natural and fluent textual content.
What is Grammar?
Grammar is defined as the rules for forming well-structured sentences. While
describing the syntactic structure of well-formed programs, Grammar plays a very essential
and important role. In simple words, Grammar denotes syntactical rules that are used for
conversation in natural languages. The theory of formal languages is not only applicable here
but is also applicable in the fields of Computer Science mainly in programming languages and
data structures.
According to Chomsky's grammatical hierarchy, encompasses different types of
grammars, but the most relevant type for natural language processing (NLP) is the Context-
Free Grammar (CFG), which falls under Type 2 in Chomsky's classification.

Context-Free Grammars (CFGs) have significant relevance in NLP for modeling the syntax or
structure of natural languages. Here's how CFGs relate to natural language:

1. Syntax Modeling: CFGs are used to describe the syntax of natural languages by
defining rules that specify how valid sentences can be formed from constituents like
nouns, verbs, adjectives, etc. These grammars help capture the hierarchical structure of
sentences in a language.
2. Phrase Structure: CFGs define the phrase structure of sentences, breaking them
down into constituents such as noun phrases (NP), verb phrases (VP), prepositional
phrases (PP), etc. These constituents are formed by recursive rules defined in the
grammar.
3. Parsing: CFGs are crucial in parsing natural language sentences. Parsing involves
analyzing the syntactic structure of sentences according to the rules specified in the
grammar. Techniques like top-down and bottom-up parsing algorithms use CFGs to
generate parse trees for sentences.
4. Formal Representation: CFGs formalize the rules governing the arrangement of
words in a sentence. These rules dictate how words and phrases can be combined to
form grammatically correct sentences.

Example to generate parse tree using grammmers:

Sentence: The man ate the burger
Parse Tree:

The Sentence-to-sentence symbol is called as Top-down Parsing.

The above rule is called as production rule. In natural language processing (NLP), a
production rule, also known as a rewrite rule or a grammar rule, describes how symbols in a
formal grammar can be replaced by other symbols or sequences of symbols. These rules
define the structure and syntax of a language, providing guidelines for generating valid
sentences or phrases.

Formally, a production rule consists of two parts:

1. Left-hand side (LHS): This is the symbol or non-terminal on the left side of the rule.
It represents a syntactic category or a symbol that can be expanded or replaced
according to the rule.
2. Right-hand side (RHS): This is the sequence of symbols on the right side of the rule.
It consists of terminals (actual words or symbols representing the basic units of the
language) and/or non-terminals (symbols that can be further expanded by other
production rules).

Bottom Up parsing:
The essence of bottom-up parsing lies in starting with individual words or tokens and
gradually constructing larger syntactic units by applying grammar rules until the entire
input is successfully parsed.

Fig: Bottom up parsing

It involves the process of analyzing and constructing the structure of sentences or phrases by
starting from the individual words or tokens and working upwards to form higher-level
constituents based on the rules defined in a given grammar.

This approach is often used with context-free grammars (CFGs) or other formal grammars
and employs parsing techniques that iteratively build constituents from the bottom (individual
words) to the top (complete sentence or phrase).

Mathematically, a grammar G can be written as a 4-tuple (N, T, S, P) where,

N or VN = set of non-terminal symbols, or variables.
T or ∑ = set of terminal symbols.
S = Start symbol where S ∈ N
P = Production rules for Terminals as well as Non-terminals.
It has the form α → β, where α and β are strings on VN ∪ ∑ and at least one symbol of α
belongs to VN.

Toward efficient parsing

Efficient parsing in Natural Language Processing (NLP) is crucial for various language
understanding tasks. Here are ways to achieve efficiency in parsing within NLP

Optimized Algorithms: Employ parsing algorithms tailored for NLP tasks. Techniques like
transition-based parsing (e.g., Shift-Reduce parsers) or chart-based parsing (e.g., CYK) can
efficiently parse sentences.

Dependency Parsing: Utilize dependency parsers that focus on relationships between words
rather than phrase structure. Dependency parsing often leads to faster and simpler parsing.

Neural Network Models: Leverage neural network architectures for parsing, such as graph-
based parsers or transformer models (e.g., BERT, GPT) that excel in handling contextual
information and have shown efficiency in various parsing tasks.

Incremental Parsing Models: Use models that allow for incremental parsing, enabling real-
time analysis and faster processing of incoming language input.

Domain-Specific Parsers: Develop parsers specifically tailored for certain domains or types
of text. These parsers can focus on the specific linguistic patterns prevalent in those domains,
leading to faster and more accurate parsing.

Parallel Processing: Employ parallel computing techniques to process multiple sentences

concurrently, speeding up parsing, especially in large-scale NLP tasks.
Feature Engineering and Selection: Optimize feature sets used in parsing models to reduce
computational overhead. Feature selection and dimensionality reduction techniques can
streamline parsing without sacrificing accuracy.

Language-Specific Optimizations: Implement language-specific optimizations that leverage

the characteristics and structures inherent in certain languages. This includes language-
specific rules or techniques that can expedite parsing.

Incremental Model Updates: For applications where the parsing model needs to adapt to
new data continuously, incremental learning techniques can be employed to update the model
efficiently without retraining from scratch.

Hybrid Approaches: Combine different parsing techniques or models to take advantage of

the strengths of each, creating hybrid systems that are both efficient and accurate.

Efficient parsing in NLP is essential for tasks like information extraction, sentiment analysis,
machine translation, and question answering. Balancing accuracy and speed is often a
challenge, and the choice of parsing method depends on the specific requirements of the NLP
application.

Unit 2 Syntactic Processing
No ratings yet
Unit 2 Syntactic Processing
17 pages
Unit 2 Syntactic Processing
No ratings yet
Unit 2 Syntactic Processing
16 pages
NLP Unit-2
No ratings yet
NLP Unit-2
42 pages
NLP Unit-2
No ratings yet
NLP Unit-2
31 pages
Ia-1 NLP
No ratings yet
Ia-1 NLP
7 pages
Atural Anguage Rocessing: Chandra Prakash LPU
No ratings yet
Atural Anguage Rocessing: Chandra Prakash LPU
59 pages
Syntax Parsing
No ratings yet
Syntax Parsing
95 pages
NLP U2
No ratings yet
NLP U2
12 pages
NLP: Syntactic and Semantic Analysis
No ratings yet
NLP: Syntactic and Semantic Analysis
23 pages
Overview of Linguistics
No ratings yet
Overview of Linguistics
19 pages
An Introduction To Syntax
No ratings yet
An Introduction To Syntax
2 pages
MNLP Unit-2
No ratings yet
MNLP Unit-2
96 pages
NLP: A Guide for Tech Enthusiasts
No ratings yet
NLP: A Guide for Tech Enthusiasts
15 pages
NLP Unit 2
No ratings yet
NLP Unit 2
48 pages
Units - 2.1
No ratings yet
Units - 2.1
8 pages
NLP 2 MD Dilshad
No ratings yet
NLP 2 MD Dilshad
9 pages
NLP Basics and Challenges Explained
No ratings yet
NLP Basics and Challenges Explained
3 pages
3nlp Computer
No ratings yet
3nlp Computer
83 pages
Lecture 12
No ratings yet
Lecture 12
34 pages
Introduction to NLP Concepts
No ratings yet
Introduction to NLP Concepts
21 pages
NLP Unit-1
No ratings yet
NLP Unit-1
37 pages
Steps in Natural Language Processing
No ratings yet
Steps in Natural Language Processing
7 pages
Linguistics: Syntax & Semantics Guide
No ratings yet
Linguistics: Syntax & Semantics Guide
8 pages
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
No ratings yet
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
32 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
NLP Basics for AI Enthusiasts
100% (1)
NLP Basics for AI Enthusiasts
21 pages
Understanding Syntax in Linguistics
No ratings yet
Understanding Syntax in Linguistics
34 pages
21cse356t NLP Unit 2
No ratings yet
21cse356t NLP Unit 2
89 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
23 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
16 pages
Ch4-Phrase-Structure Grammars and Dependency Grammars PDF
No ratings yet
Ch4-Phrase-Structure Grammars and Dependency Grammars PDF
48 pages
Lecture NLP
100% (1)
Lecture NLP
38 pages
NLP Lab File
100% (2)
NLP Lab File
66 pages
Unit Ii 1
No ratings yet
Unit Ii 1
82 pages
NLP - Shortnotes Unit 3
No ratings yet
NLP - Shortnotes Unit 3
16 pages
Natural Language Processing PDF
100% (1)
Natural Language Processing PDF
47 pages
Chapter15 NaturalLanguage
100% (1)
Chapter15 NaturalLanguage
35 pages
05 and 06-CCS-lectures
No ratings yet
05 and 06-CCS-lectures
106 pages
13-Dependency Grammar-03-09-2024
No ratings yet
13-Dependency Grammar-03-09-2024
31 pages
Nayan Ranjan Paul - NLP-3
No ratings yet
Nayan Ranjan Paul - NLP-3
134 pages
Analysis Syntax: A.Definition of Syntax
No ratings yet
Analysis Syntax: A.Definition of Syntax
7 pages
Poeter Stemmer Algorithm
No ratings yet
Poeter Stemmer Algorithm
57 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
50 pages
NLP Unit 3 Part A PDF
No ratings yet
NLP Unit 3 Part A PDF
75 pages
Understanding Syntax in Linguistics
No ratings yet
Understanding Syntax in Linguistics
9 pages
Lecture-8. Only For This Batch
No ratings yet
Lecture-8. Only For This Batch
46 pages
Lecture 6
No ratings yet
Lecture 6
43 pages
Natural Language Processing Unit 3
No ratings yet
Natural Language Processing Unit 3
55 pages
Unit Iii
No ratings yet
Unit Iii
17 pages
12
No ratings yet
12
31 pages
Natural Language Processing
No ratings yet
Natural Language Processing
57 pages
CSC 594 Topics in AI - Natural Language Processing: Spring 2016/17
No ratings yet
CSC 594 Topics in AI - Natural Language Processing: Spring 2016/17
47 pages
Syntax in Linguistics Report
No ratings yet
Syntax in Linguistics Report
15 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
Syntax & CFG for Linguists
No ratings yet
Syntax & CFG for Linguists
28 pages
Applied Ai U5
No ratings yet
Applied Ai U5
48 pages
Grammar and Parsing
No ratings yet
Grammar and Parsing
8 pages
Cv-Unit 2
No ratings yet
Cv-Unit 2
26 pages
Unit 1 - AI
No ratings yet
Unit 1 - AI
65 pages
Unit II Statistical Learning Support Vector Machines (SVM)
No ratings yet
Unit II Statistical Learning Support Vector Machines (SVM)
52 pages
Unit 2 - ANN
No ratings yet
Unit 2 - ANN
42 pages
Unit Iii
No ratings yet
Unit Iii
16 pages
Types of Sentences - Legal Context
No ratings yet
Types of Sentences - Legal Context
13 pages
Simple Future Near Future Advrbial Clauses
No ratings yet
Simple Future Near Future Advrbial Clauses
8 pages
Grammar and Beyond 2e, Level 4 Scope and Sequence: Unit Theme Grammar Topics Avoid Common Mistakes Academic Writing
No ratings yet
Grammar and Beyond 2e, Level 4 Scope and Sequence: Unit Theme Grammar Topics Avoid Common Mistakes Academic Writing
6 pages
English Syntax & Morphology Guide
No ratings yet
English Syntax & Morphology Guide
126 pages
Junior Two Grammar
No ratings yet
Junior Two Grammar
29 pages
Marina K Burt - Error Analysis in The Adult EFL Classroom PDF
100% (2)
Marina K Burt - Error Analysis in The Adult EFL Classroom PDF
12 pages
Adjective Clauses Guide
No ratings yet
Adjective Clauses Guide
31 pages
Apostila de Inglês Avançado
100% (1)
Apostila de Inglês Avançado
45 pages
Home Ask Me Anything Archive Mobile RSS: Search
No ratings yet
Home Ask Me Anything Archive Mobile RSS: Search
10 pages
Grade-8 2ND MT
No ratings yet
Grade-8 2ND MT
2 pages
Grammar Lessons Review
No ratings yet
Grammar Lessons Review
14 pages
English: Quarter 4 - Module 1-3
100% (4)
English: Quarter 4 - Module 1-3
27 pages
Academic Writing
No ratings yet
Academic Writing
25 pages
I. Choose The Best Answer (8 M)
No ratings yet
I. Choose The Best Answer (8 M)
13 pages
Noun, Adjective, and Adverb Clauses
No ratings yet
Noun, Adjective, and Adverb Clauses
16 pages
Starting Adjective's Definiton
No ratings yet
Starting Adjective's Definiton
24 pages
StartUp L8 U1 - Teacher Edition
No ratings yet
StartUp L8 U1 - Teacher Edition
24 pages
Understanding Clauses for Students
No ratings yet
Understanding Clauses for Students
9 pages
Noun Valency (PDFDrive) PDF
No ratings yet
Noun Valency (PDFDrive) PDF
232 pages
Case Filter & Passives in Syntax
No ratings yet
Case Filter & Passives in Syntax
18 pages
Airforce
No ratings yet
Airforce
20 pages
Wuolah Free English Grammar in Focus CH 1 The Phrase As A Grammatical Unit Draft Version 1
No ratings yet
Wuolah Free English Grammar in Focus CH 1 The Phrase As A Grammatical Unit Draft Version 1
36 pages
Prepositions
No ratings yet
Prepositions
6 pages
MATERI6-Eng. Syntax II (A)
No ratings yet
MATERI6-Eng. Syntax II (A)
20 pages
Incremental Reduced Error Pruning
No ratings yet
Incremental Reduced Error Pruning
15 pages
Wondem PDF
No ratings yet
Wondem PDF
263 pages
Structures in English 1
No ratings yet
Structures in English 1
12 pages
Passive: - SV (Passive) - Agent - SV (Passive) V - Agent - SV (Passive) O - Agent - SV (Passive) C by-AGENT
No ratings yet
Passive: - SV (Passive) - Agent - SV (Passive) V - Agent - SV (Passive) O - Agent - SV (Passive) C by-AGENT
8 pages
Parallel Structure: With The - Ing Form (Gerund) of Words
No ratings yet
Parallel Structure: With The - Ing Form (Gerund) of Words
3 pages
Drafting An Ordinance
No ratings yet
Drafting An Ordinance
18 pages

NLP - Unit II Syntactic Analysis

Uploaded by

NLP - Unit II Syntactic Analysis

Uploaded by

Unit - II

What is Syntactic Processing?

Syntactic processing is the process of analyzing the grammatical structure of a sentence to

 New York is the capital of the United States of America.

Difference between Lexical Processing and Syntactic Processing

How Does Syntactic Processing Work?

Syntactic processing involves a series of steps, including tokenization, part-of-speech tagging,

Why is Syntactic Processing Important in NLP?

Syntactic Structure and its Components

Syntax Trees/Parse Trees: These graphical representations illustrate the hierarchical

Understanding the syntactic structure of a language helps in analyzing sentences, identifying

Example to generate parse tree using grammmers:

The Sentence-to-sentence symbol is called as Top-down Parsing.

Formally, a production rule consists of two parts:

Fig: Bottom up parsing

Mathematically, a grammar G can be written as a 4-tuple (N, T, S, P) where,

Toward efficient parsing

Parallel Processing: Employ parallel computing techniques to process multiple sentences

Language-Specific Optimizations: Implement language-specific optimizations that leverage

Hybrid Approaches: Combine different parsing techniques or models to take advantage of

You might also like