NLP Techknowledge
NLP Techknowledge
MODULE 5
CHAPTER
Pragmatic and
5 Discourse Processing
Syllabus
5.1 Discourse Reference Resolution, Reference Phenomena, Syntactic & Semantic constraint
on coherence; Anaphora Resolution using Hobbs and Cantering Algorithm.
5.2 Self-Leaming topics: Discourse segmentation, Conference resolution.
Chapter Ends.
Peamsene eseusees
Nature Language rocessing (MU - Sem 7- omp)
(Pragmatic&DiscourseProc.)...Pageno. (5-22
5.1 PRAGMATICANALYSIS
A step in the intormation extraction from text is pragmatic analysis. In particular, its
the section on analysing a group of text structures to determine their true meaning or
the intended meaning.
The study of pragmatics, a branch of linguistics and semiotics, focuses on how context
affects meaning.
In contrast to semantics, which examines meaning that is "coded" or conventional in a
particular language, pragmatics investigates how the transmission of meaning
depends not only on the speaker's and listener's structural and linguistic knowledge
grammar, lexicon, etc.), but also on the context of the utterance, any prior knowledge
about those involved, the speaker's implied intent, and other elements.
For example, the semantic analysis meaning of the sentence "The Soldier fought like
a lion" yields the meaning that the soldier fought using the paws or bit using teeth!
Which not the intended meaning. The intended meaning here is to highlight the
analysis.
5.2 DISCOURSEANALYsIS
events that take place at
Til now our attention has mostly been drawn to linguistic
often consists of collocated, connected
the word or sentence level. Of course, language
unrelated utterances. Such a collection of
groupings of sentences rather than single,
Sentences is referred to as a discourse.
refers to. Discourse analysis
discourse in linguistics
anguage in use is what the word
entals understanding social
ne practise of analysing texts or languages that
Dealing with morphemes, tenses,
n-grams,
eractions and text interpretation.
can be part of discourse analysis.
layouts, and other elements
guistic features, page
One definition of discourse is a series of sentences.
5.3 REFERENCERESOLUTION
in language. Think about the
occurrences
iscourse level is rich with
Example : The name John' and he' in passage (5.1) are referring expressions,
and John' the person is their referent.
2. Corefer
If the same entity has been referred by two referring expressions then it is called
as Corefer.
Example: The name John' and he' are corefer in passage (5.1) as both are
3. Antecedent
There is also a name for a referring expression that grants permission for the use
of another, similar to how mentioning the name John' permits John' the person
to be afterwards referred to by the pronoun he'.
Reference to an entity that has been already introduced into the discourseis
called anaphora,and the referring expression used is said to be
anaphoric.
Example: In passage (5.1), the pronouns 'he' and it are anaphoric. And, the name
John' is anaphora.
(MU-New Syllabus w.ef academic year 22-23) (M7-83) Tech-Neo Publications...ASACHIN SHAH Venu
!!!i l
l
D
il
Natural Language Processing (MU
Sem 7- Comp) (Pragmatic& Discourse Proc.)...Pageno. (5-5)
5.4REFERENCE PHENOMENA
wwww.w.
"Some' Acura Integras were being unloaded at the local dealership today. (6.3)
There may be a
specific/non-specificambiguitysince the er'
indefinite determiner
doesn't say whether the speaker can has
identify the object. Example (5.2) only
the specific reading since the speaker is specificallythinking of the Integra that
she saw.
SH
NaturalLanguage Processing
(MU Sem 7-Comp) Pragmatic&DiscourseProc.)...Pageno. (5-5)
can
Solutionto the problem: In some instances,a followingreferencingstatement
be used to distinguish between the readings; if the expression is definite, the reading
15
is particular(I hope they still have that option), and if it is indefinite, the reading
generic (I hope they have a car I like).
There are several exceptions to this rule, such as when definite expressions are
modal circumstances (I'll park it in my garage), this is
acceptable in particular
compatible with the nonspecific reading.
reference"when,
context (and is therefore
been mentioned in the discourse
) it has already
represented in the discourse model),
uniqueness.
ii) the description itself implies the object's
Example
white and needed to be washed. (5.6)
I saw an Acura Integra today. The Integra' was
in the US. (5.7)
The Indianapolis 500' is the most popular car race
500 was an Integra. (6.8)
The fastest car in the Indianapolis from the
referent may be determined
(6.6) showcases an example where the
Conversationcontext.
the hearer's set of
referent is identifiablefrom
.1) showcases an example where the
beliefs.
the referent is inherently
unique.
showcases an example where
model or the hearer's collection of
to note that
the discourse
e It is important definite noun in a sentence. In the
in order to refer to a
Odviews must be accessible
representation of the referent.
model
evokes a discourse
d instance, it also
3. Pronouns
reterence, as seen in example (5.9).
P
*Onominalization is another type of definite
needed to be washed. (5.9)
white and
Acura Integra today. It
was
*9aW an
restrictions
than plete definite
complet noun phrase
definite noun phrases,
Pronominal reference has more
activation or
salience for the referent
for the referent in the
the
necessitating
essitating a high level of
discourse model._
MJ-New Sylabus w.e.f academic year 22-23) (M7-83)
Tech-NeoPublicatione.ASACHIN SHAH Venture
Natural Language Processing (MU- Sem 7-Comp) (Pragmatic& DiscourseProc.)..Pageno. (5-7
While definite noun phrases can frequently refer further back, pronounsns tyni
typically
(but not always) refer to entities that were introduced no more than one or t
sentences before to the current discourse. This is illustrated by the difieron
ence
between sentences (5.10d) and (5.10d').
(5.10)
(a) John went to Bob's party, and parked next to a beautiful Acura Integra.
Before he bought it, John checked over the Integra very carefully. (5.11)
5. Names
of persons,
Names prevalent type of referring expression and include names
are a
(5.15)
Woodhouse' certainly had not done him justice.
(a) Miss
from Amazon;
(6) International Business Machines' sought patent compensation
BM had previouslysued other companies
expression types,
Ater describing several referring
referent kinds Inferables
we will now focus on a few intriguing
more
reference resolution problem
aE make the
challenging
1. Inferrables Discontinous sets
refers
expression
In certain instances, a referring
rather than
one
to an entity that has been implied Generics
in the text.
that has been xpressly
summoned
Inferrables.
Such referentsare called as which
Fig 54.2:Typesof Referents
Reference Resolution
Complicate the
specified by inferrables. Take into account the recipe's possible follow-ups (a-c) to
sentence (5.17).
(5.17)
Mix the flour, butter, and water.
between) can all be used to refer to the outcome of the acts in the first line, but
2. Discontinuous Sets
is
Generally,'they' and them' are examples of a plural referring expressionthat
sometimes used to refer to groups of items that are invoked collectively. Other
and
examples include another plural referring expression (their Acuras)
conjoined noun phrases (John and Mary), as shown in the following example,
John and Mary love their Acuras. They drive them all the time. (6.18)
However, plural references may also apply to groups of things that the texts
discontinuous phrases have referred to, as in the following example,
John has an Acura, and Mary has a Mazda. They drive them all the time. (519)
.Here, they' refer to John and Mary, and the Acura and the Mazda are referred to
as them' in this sentence. Also, take note that the second line in this instance
will often be read pairwise or correspondingly, meaning that John drives tne
Acura and Mary drives the Mazda rather than both of them driving botn
automobiles.
(MU-New Syllabus w.e.f academic year 22-23) (M7-83) Tech-Neo Publications...A SACHIN SHAH Venture
Language Proce
cessing (MU
Natural
Sem 7- Comp) (Pragmatic &DiscourseProc.)...Page no.(5-10)
3. Generics
Plural Unspecified
Singular
Them You
She, Her, He, Him, His,
It | We, Us, They,
(5.24)
John has three Acuras. Tt is red.
new
are correct whereas, (5.23) and (5.24) are a
(5.22)
Here, the examples (5.21) and be associated with plural and the terr
term
as the
they' should
term
rect statements statements violate the constraints.
These to
0uld be associated with singular.
Three different person tenses are recognised in English: first, second, and third.
Also, due to case agreement, various pronoun forms may be necessary when used
in the subject position (nominative case, e.g, he, she, or they), object position
(accusative case, e g.. him, her, or they), and genitive position (genitive case, e.g.,
his Acura, her Acura, their Acura).
Table No. 5.5.2 depicts a division of pronoun categories according to person and
according to case agreement.
Table 5.5.2: Division of Pronoun Categories according to Person and case agreement
Example
You and I have Acuras. We' love them. (5.25)
John and Mary have Acuras. They' love them. (5.26)
John and Mary have Acuras. 'We' love them. (Where, We = John and Mary) 6.27)
You and I have Acuras. "They' love them. (Where, They = You and D (5.28)
Here, the examples (5.25) and (5.26) follow the person and case agreement constraints
whereas, (5.27) and (5.28) do not follow these constraints.
3. Gender Agreement
The gender implied by the referring statement must likewise be accepted by the
referents. English third person pronouns differentiate between the genders of male,
female, and nonpersonal, although unlike many other languages, the first two only apply
to living objects. Table No. 5.5.3 illustrate the pronouns under these gender categories.
Table 5.5.3: Pronouns under Gender Categories
(MU-New Syllabusw.e.f academic year 22-23) (M7-83) Tech-Neo Publications...A SACHIN SHAH Venture
Natural
Language Processing(MU Sen 7-Comp) (Pragmatic&DiscourseProc.)..Page
-
no. (5-12)
Example
John has an Acura. "He' is attractive. (Where, he = John, not the Acura) (5.29)
John has an Acura. Tt is attractive. (Where, it = the Acura, not John) 5.30)
4. Syntactic Constraints
Example
to the constraints indicated
The pronouns in all of the followingsentences are subject
in brackets.
John bought 'himself a new Acura. (himself =John) (5.31)
same item.
generally, would be able to corefer
or
example, a nonreflexive pronoun like "him,
with the subject of the preceding sentence as it does in example (5.36), but it is not
coreterential he' in
POSsible in example (5.37) due to the presence ot the pronoun the
second clause.
Bill bought 'him' a new Acura. (him =
John) (6.36)
John wanted a new car.
new Acura. (he =
John, him T.
He' bought 'him'
a *
6.37)
The criteria discussed above simplify the problem too much in a number of ware
they do not apply in many situations. In truth, the facts become extremely comnl.
plex
And
upon closer examination. In fact, it seems implausible that mere syntactic relations
ns
evidence.
could adequately explain all of the
appear in the same syntactic arrangements, tho
For example, even though they
reflexive himself and the nonreflexive "him in sentences18.43) and (18.44)
respectively, can both relate to the subject John.
John set the pamphlets about Acuras next to himself. (himself = John) (5.38)
5. Selectional Restrictions
Referents may be dropped due to selectional restrictions that a verb imposes on
its arguments, as in the following example (5.40).
John parked his Acura in the garage. He had driven it around for hours. (5.40)
Here, the term it might be compared toeither the Acura or the garage. The
direct object of the verb 'drive', however, must be a vehicle that can be driven,
such as a car, truck, or bus, and cannot be a garage. As a result, 'the Acura' is the
It is
only potential referent because the pronoun appears as the object of drive.
feasible that a real-world NLP system might have a set of selectional constraints
that were very extensive for the verbs in its lexicon.
Venture
Jack and Jill went up the hill, To fetch a pail of water. Jack fell down and broke "his'
crown, And Jill came tumbling after. (5.44)
Here, Jack, Jill, hill, water, and crown are the potential resolution candidates for
pronoun 'his'.
word his' was followed by the noun crown'. The Hobbs algorithm's initial
states that the referent search must always be confined to the left of the
presumption
target, eliminating the crown.
hill potentialreferents?
If so, are Jill, water, or
this property.
embed intelligence (using the Hobbs algorithm) in
explore how we can
Now, let's the task of pronoun resolution now that
common sense
to accomplish
0Dots that lack resolve pronouns.
humans digest text and
we have
a better grasp of how
It is when resolving
portant to note that, the previo
previous sections. The pronoun to be
s in the
mentioned
syntactic straints as
sentences to and
up to and including the current
of the
solved and the
the syntactic parse
Hobbs algorithm.
the
e Serveas the input to Tech-Neo Publications..ASACHIN SHAH Venture
New Syllabusv 22-23) (M7-83)
w.e.f academic year
Natural Language Processing(MU -Sem 7-Comp) (Pragmatic&DiscourseProc.)..Pageno. (5-15
Fig 5.6.1 details the Hobbs algorithm.
Construota parse tree and ldentfy the
pronoun for oo-reference reeolution
www.w w.w
If X
is highest 'S ifXle an NP If Xis an
nodein sentence
Satisfied
No
with
Answer?
Yes
Answer
Fig 5.6.1 :Hobbs Algorithm
Example
Let's look at twosentences:
(5.45)
Sentence 1(S1):Jackis an engineer.
Sentence 2 (S2): Jill likes him.
(MU-New Syllabus w.e.f academicyear
22-23) (M7-83) Tech-NeoPublications...ASACHIN SHAH
Venu
Language
Natural Processing(MU Sem 7-Comp) -
S1
NP P
NP
Jack
Jill
NP
NP
S
likes PRP
Det
an
engineer him
2. As there is no path to the left of p', we again go back from NP' to VP to s2,
Cr(U1)member mentionedi
nterchangeable
highest-ranking
the
because he' is referent of it,
bU) by definition For each potential
referent for he').
(Since he' is the only possible
would be as follows if we
2 we
The assignments
the ensuing transitions are compared.
CrU3):1John, Ford)
Cp (U3)John
C U3): John
Result: Continue (Cp (U3)=©%(U3)=C%(U2))
.If we assume he refers to Bob, then Bob is Cb{U3) and the assignments would be:
C (Us):Bob, Ford)
Cp (Us): Bob
Cb Ug): Bob
Result: Smooth-Shift (C(U3)=©% (Ua); C%(U3)6=Ch(U2))
Since a Continue is preferred to a Smooth-Shift per Rule 2, John is correctly taken t
be the referent.
the
the preferences for repeated mentions, recency, and grammatical roles. Since
resulting transition type determines the final reference assignments, the
grammatical role hierarchy only indirectly influences salience.
In particular,ifthe former results in a more
highly ranked transition, a referent
low-ranking grammatical role will be preferred to one in a more highly ra
nked
position.
(MU-New Syllabus w.e.f academic year 22-23) (M7-83)
Tech-Neo Publications..ASACHINSHAH
Venture
Natural Languag Processing(MU Sem 7-Comp) (Pragmatic &DiscourseProc.)..Pageno.(5-20)
Consequently, the centering algorith1m can mistakenly resolve a pronoun to a
The third sentence's subject pronoun, "he," will be assigned to Bob by the centering
method because Bob is Ch(U2), whereas John will be assigned if John is C(U2),
resulting in a Smooth-Shift relation.
On the other hand, John will be accurately designated as the referent by the Hobbs
The like the algorithm, needs
Hobbs both
algorithm. centering algorithm,
morphological gender detectors anda complete syntactic parse.
As a model of entity coherence, centering theory also has implications for other
discourse applications, such as summarization.
Chapter Ends..
DOO