Atural Anguage Rocessing: (IT3EA06)

Uploaded by

harodanurag123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views26 pages

Atural Anguage Rocessing: (IT3EA06)

Uploaded by

harodanurag123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

NATURAL LANGUAGE PROCESSING

(IT3EA06)
- Dr. Kush Bhushanwar
UNIT - III
 Speech Processing: Speech and Phonetics,
 Vocal Organ,
 Phonological Rules and Transducer,
 Probabilistic Models,
 Spelling Error,
 Bayesian Method to Spelling,
 Minimum Edit Distance,
 Bayesian Method of Pronunciation Variation
SPEECH PROCESSING
 Speech processing and language technology contains lots of
special concepts and terminology.
 To understand how different speech synthesis and analysis
methods work we must have some knowledge of speech
production, articulatory phonetics, and some other related
terminology.
 The study of the pronunciation of words is part of the field
of phonetics, the study of the speech sounds used in the
languages of the world.
 Phonetics focuses on the production and classification of
the world’s speech sounds.
TYPES OF PHONETICS
 The production of speech looks at the interaction of
different vocal organs, for example, the lips, tongue
and teeth, to produce particular sounds.
 Articulatory phonetics: The study of how phones are
produced, as the various organs in the mouth, throat, and
nose modify the airflow from the lungs.
 Acoustic phonetics: The study of the physical properties
of the soundwaves created by the vocal organ (duration,
frequency, etc.).
 Auditory phonetics: The examination of how speech
sounds are perceived and identified by the hearer’s ear and
brain.
THE VOCAL ORGANS
 Most sounds in human languages are produced by
expelling air from the lungs through the windpipe
(technically the trachea) and then out the mouth or
nose.
 As it passes through the trachea, the air passes
through the larynx, commonly known as the Adam’s
apple or voicebox.
 The larynx contains two small folds of muscle, the
vocal folds (often referred to non-technically as the
vocal cords) which can be moved together or apart.
 The space between these two folds is called the
glottis.
 Sounds made with the vocal folds together and
vibrating are called voiced; sounds made without
this vocal cord vibration are called unvoiced or
voiceless.
 Voiced sounds include [b], [d], [g], [v], [z], and all the
English vowels, among others.
 Unvoiced sounds include [p], [t], [k], [f],[z], and
others.
PHONEME
 A phoneme is the smallest/indivisible and contrastive or
significant unit of the sound of a language, which, when
replaced with another sound, results in a change in
meaning.
 Therefore, a phoneme can not be subdivided into a smaller
unit.
 “Can” can be broken to /k/+/æ/+/n/ but can not further be
broken into smaller units.
ALLOPHONE
 An allophone is a phoneme that shares similarities of
pronunciation.
 More technically, allophones are phonemes, which share a
variety of pronunciations determined by phonetic
circumstances such as types of words, morphemes or
positions.
 Ashby and Maidment (2005:189) define an allophone as a
positional (initial, middle, final) variant of a phoneme due
to the position it occupies in a word, which calls for a slight
difference in pronunciation.
EXAMPLE
 the phoneme, /t/ and some of its variants
(allophones).
 /th/ aspirated as in teach
 /t/ unaspirated as in water
 /t/ /tn/ nasalize as in tin
 /tw/ labialised as in twice
 /t-/ unreleased as in great
BASIC TERMINOLOGIES
 We can represent the pronunciation of words in terms of units
called phones.
 The standard system for representing phones is the
International Phonetic Alphabet or IPA.
 An alternative English-only transcription system that uses ASCII
letters is the ARPAbet.
 A phoneme is a generalization or abstraction over different
phonetic realizations. Allophonic rules express how a phoneme
is realized in a given context.
PHONOLOGICAL RULE
 The relationship between a phoneme and its allophones is often captured
by writing a phonological rule.
 Here is the phonological rule for dentalization in the traditional notation
of Chomsky and Halle (1968):

 In this notation, the surface allophone appears to the right of the arrow,
and the phonetic environment is indicated by the symbols surrounding the
underbar ( _ ).
 These rules resemble the rules of two-level morphology but since they
don’t use multiple types of rewrite arrows, this rule is ambiguous between
an obligatory or optional rule. Here is a version of the flapping rule:
PHONOLOGICAL RULES AND TRANSDUCERS
 There are several different models of computational phonology that use
finite automata in various ways to realize phonological rules.
 As a first example, Figure shows a transducer that models the application
of the simplified flapping rule in (3.1):
(3.1)

Transducer for English Flapping:

ARPAbet ‘dx’ indicates a
flap, and the ‘other’ symbol means ‘any
feasible pair not used elsewhere in the
transducer’.
‘@’ means ‘any symbol not used
elsewhere on any arc’.
PROBABILISTIC MODELS
 Probabilistic models are an essential component of machine
learning, which aims to learn patterns from data and make
predictions on new, unseen data.
 They are statistical models that capture the inherent
uncertainty in data and incorporate it into their
predictions.
 Probabilistic models are used in various applications such
as image and speech recognition, natural language
processing, and recommendation systems.
 Examples: Bayesian networks, Gaussian mixture models,
hidden Markov models, and probabilistic graphical models.
TYPES OF PROBABILISTIC MODELS
o Generative models
o Discriminative models.
o Graphical models
GENERATIVE MODELS:
 Generative models aim to model the joint distribution of
the input and output variables.
 These models generate new data based on the probability
distribution of the original dataset.
 Generative models are powerful because they can generate
new data that resembles the training data.
 They can be used for tasks such as image and speech
synthesis, language translation, and text generation.
DISCRIMINATIVE MODELS:
 The discriminative model aims to model the conditional
distribution of the output variable given the input variable.
 They learn a decision boundary that separates the different
classes of the output variable.
 Discriminative models are useful when the focus is on
making accurate predictions rather than generating new
data.
 They can be used for tasks such as image recognition,
speech recognition, and sentiment analysis.
GRAPHICAL MODELS:
 These models use graphical representations to show the
conditional dependence between variables.
 They are commonly used for tasks such as image
recognition, natural language processing, and causal
inference.
NAIVE BAYES ALGORITHM IN PROBABILISTIC
MODELS
 Naive Bayes is a probabilistic algorithm that is used for
classification problems.
 It is based on the Bayes theorem of probability and
assumes that the features are conditionally independent of
each other given the class.
 It is also known as a probabilistic classifier. The Naive
Bayes Algorithm comes under supervised learning and is
mainly used to solve classification problems.
 For example, you cannot identify a bird based on its
features and color as there are many birds with similar
attributes.
NAIVE BAYES ALGORITHM
BAYESIAN PROBABILITY
 Bayesian Probability allows to calculate the conditional
probabilities.
 It enables to use of partial knowledge for calculating the
probability of the occurrence of a specific event.
 This algorithm is used for developing models for
prediction and classification problems like Naive Bayes.
 The Bayesian Rule is used in probability theory for
computing - conditional probabilities.
 What is important is that you cannot discover just how the
evidence will impact the probability of an event occurring,
but you can find the exact probability.
SPELLING ERROR
 A spellchecker points to spelling errors and
possibly suggests alternatives.
 An auto-corrector usually goes a step further and
automatically picks the most likely word.
 In case of the correct word already having been
typed, the same is retained.
 So, in practice, an autocorrect is a bit more
aggressive than a spellchecker, but this is more
of an implementation detail — tools allow you to
configure the behavior.
TYPES OF SPELLING ERRORS
 Non-word Errors: These are the most common type
of errors. You either miss a few keystrokes or let your
fingers hurtle a bit longer.
 E.g., typing langage when you meant language;
or hurryu when you meant hurry

 Real Word Errors: If you have fat fingers,

sometimes instead of creating a non-word, you end up
creating a real word, but one you didn’t intend.
 E.g, typing buckled when you meant bucked. Or your
fingers are a tad wonky, and you type in three when you
meant there.
TYPES OF SPELLING ERRORS CONT.
 Cognitive Errors: The previous two types of
errors result not from ignorance of a word or its
correct spelling.
 Cognitive errors can occur due to those factors.

 The words piece and peace are homophones

(sound the same). So you are not sure which one
is which.
 Sometimes your damn sure about your spellings
despite a few grammar nazis claim you’re not.
TYPES OF SPELLING ERRORS CONT.
 Short forms/Slang/Lingo: These are possibly
not even spelling errors. May be u r just being
kewl (cool). Or you are trying hard to fit in
everything within a text message or a tweet and
must commit a spelling sin.
 Intentional Typos: Well, because you are
clever. You type in teh and pwned and zomg
carefully and frown if they get autocorrected.
BAYESIAN METHOD TO SPELLING
 In this approach Bayes’ theorem is used to compute the
probability of the intended word being ‘w’ when the typist has
typed ‘x’:

x: input word the typist has typed (observed data)

w: the word they meant to type (parameter to estimate)
 This is called the posterior probability of ‘w’ being the intended
word.
 Then, the word in dictionary with the highest posterior
probability is chosen as the intended word:
BAYESIAN METHOD TO SPELLING
 However, it will be computationally inefficient and too expensive
to check w’s over all the dictionary.
 So, we should generate a set of candidates for any input word (x),
which we call C, and do the maximization over the C set:

 And since P(x) is a normalizing constant (independent from w) we

can drop it from the maximized function without any harm.
 We will also rank candidates according to log-posterior instead of
posterior probability:
DAMERAU-LEVENSHTEIN EDIT DISTANCE
MINIMUM EDIT DISTANCE
 Minimal edit distance between two strings,
where edits are:
 Insertion
 Deletion
 Replace
 Transposition of two adjacent letters

Unit 5 Speech Processing
No ratings yet
Unit 5 Speech Processing
8 pages
An Open Introduction To Linguistics 2022
No ratings yet
An Open Introduction To Linguistics 2022
279 pages
Experimental Phonology Methods
100% (1)
Experimental Phonology Methods
484 pages
Untitled Document-2
No ratings yet
Untitled Document-2
3 pages
Chapter 5. Probabilistic Models of Pronunciation and Spelling
No ratings yet
Chapter 5. Probabilistic Models of Pronunciation and Spelling
40 pages
Speech Recognition1
100% (1)
Speech Recognition1
39 pages
NLP Lecture 5 PDF
No ratings yet
NLP Lecture 5 PDF
36 pages
Speech and Audio Processing and Coding
No ratings yet
Speech and Audio Processing and Coding
52 pages
Cs383 Lecture16 PDF
No ratings yet
Cs383 Lecture16 PDF
46 pages
Lec 3 slp04 LM and Ngrans
No ratings yet
Lec 3 slp04 LM and Ngrans
73 pages
Introduction To Language Modeling Final
No ratings yet
Introduction To Language Modeling Final
69 pages
Important Questions-Answers Text Analytics and Natural Language Processing (KAI073)
No ratings yet
Important Questions-Answers Text Analytics and Natural Language Processing (KAI073)
37 pages
Introduction To Speech Processing
No ratings yet
Introduction To Speech Processing
38 pages
Speech & Language Processing Course
No ratings yet
Speech & Language Processing Course
39 pages
13 Ngramlm
No ratings yet
13 Ngramlm
27 pages
Cortado-Cap 6
No ratings yet
Cortado-Cap 6
160 pages
Lecture5 Ngrams
No ratings yet
Lecture5 Ngrams
40 pages
Natural Language: Anguage Odels
No ratings yet
Natural Language: Anguage Odels
28 pages
5624 - Softskill - NLP
No ratings yet
5624 - Softskill - NLP
28 pages
Bayesian Spelling Correction Guide
No ratings yet
Bayesian Spelling Correction Guide
5 pages
Intro To Statistical NLP
No ratings yet
Intro To Statistical NLP
57 pages
NLP CH 2
No ratings yet
NLP CH 2
59 pages
Doctoral Course in Speech and Speaker Recognition: Teachers: Mats Blomberg, Kjell Elenius
No ratings yet
Doctoral Course in Speech and Speaker Recognition: Teachers: Mats Blomberg, Kjell Elenius
22 pages
Chapter 4 Part 2
No ratings yet
Chapter 4 Part 2
15 pages
Lecture 1 LING325 Models of Phonology
No ratings yet
Lecture 1 LING325 Models of Phonology
45 pages
N-Gram Language Models in NLP
No ratings yet
N-Gram Language Models in NLP
48 pages
7.? Using Corpora For Language Research. Statistics For Corpora Based Studies
No ratings yet
7.? Using Corpora For Language Research. Statistics For Corpora Based Studies
63 pages
03 Evaluation and Perplexity 11-09
No ratings yet
03 Evaluation and Perplexity 11-09
5 pages
Cme4408 p7 Probmodels Med
No ratings yet
Cme4408 p7 Probmodels Med
50 pages
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
No ratings yet
Automatic Speech Recognition: 2.1 Relevant Keywords From Probability Theory and Statistics
14 pages
NLP Module 1
No ratings yet
NLP Module 1
55 pages
Lecture 4
No ratings yet
Lecture 4
37 pages
NLP 1.2
No ratings yet
NLP 1.2
22 pages
Language Models & N-Gram Analysis
No ratings yet
Language Models & N-Gram Analysis
41 pages
Em 1 Les 2
No ratings yet
Em 1 Les 2
8 pages
NLP Week 2 Rationalist and Empiricist Paradigms in Natural Language Processing
No ratings yet
NLP Week 2 Rationalist and Empiricist Paradigms in Natural Language Processing
28 pages
NLP 2
No ratings yet
NLP 2
13 pages
N-Gram Language Models Explained
No ratings yet
N-Gram Language Models Explained
13 pages
Phonetics&Phonology Lecture Notes Week 2-PhD
No ratings yet
Phonetics&Phonology Lecture Notes Week 2-PhD
46 pages
Language Models
No ratings yet
Language Models
59 pages
PLM 17
No ratings yet
PLM 17
15 pages
HG3051 Lec06 DIY
No ratings yet
HG3051 Lec06 DIY
59 pages
Video v1
No ratings yet
Video v1
39 pages
ENG503 Short Notes Lesson 1-22
No ratings yet
ENG503 Short Notes Lesson 1-22
14 pages
Introduction to NLP Concepts
No ratings yet
Introduction to NLP Concepts
117 pages
Predicting Words and Sentences Using Statistical Models: Nicola Carmignani
No ratings yet
Predicting Words and Sentences Using Statistical Models: Nicola Carmignani
42 pages
Unit 1
No ratings yet
Unit 1
17 pages
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
No ratings yet
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
148 pages
Natural Language Processing - Language Modelling
No ratings yet
Natural Language Processing - Language Modelling
117 pages
Chapter Four 1
No ratings yet
Chapter Four 1
91 pages
Bcse306l Ai Module-7 Smsatapathy
No ratings yet
Bcse306l Ai Module-7 Smsatapathy
51 pages
6.chapter6 LanguageModel
No ratings yet
6.chapter6 LanguageModel
33 pages
Challenges in Automatic Speech Recognition
No ratings yet
Challenges in Automatic Speech Recognition
34 pages
v24dsl07t - Unit I - NLP
No ratings yet
v24dsl07t - Unit I - NLP
65 pages
04 Language Modeling
No ratings yet
04 Language Modeling
70 pages
Lecture 4
No ratings yet
Lecture 4
87 pages
Introduction To Generative AI - Pre Quiz - Attempt Review
100% (1)
Introduction To Generative AI - Pre Quiz - Attempt Review
4 pages
Bits & Bytes Apr'2024
No ratings yet
Bits & Bytes Apr'2024
18 pages
AI Practice Exam for Class IX
No ratings yet
AI Practice Exam for Class IX
5 pages
Session 1 Introduction To AI and Generative AI
No ratings yet
Session 1 Introduction To AI and Generative AI
26 pages
Towards Universal Fake Image Detectors That Generalize Across Generative Models
No ratings yet
Towards Universal Fake Image Detectors That Generalize Across Generative Models
17 pages
Mil780 Classification
No ratings yet
Mil780 Classification
18 pages
Introduction To Gen Ai
No ratings yet
Introduction To Gen Ai
13 pages
ML Terminologies PDF
100% (1)
ML Terminologies PDF
44 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
53 pages
Artificial Intelligence in Action - Ahmed Banafa
No ratings yet
Artificial Intelligence in Action - Ahmed Banafa
407 pages
CS485 Ch5 Transformers
No ratings yet
CS485 Ch5 Transformers
50 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
Deep Convolutional Generative Adversarial Network Based Food Recognition
No ratings yet
Deep Convolutional Generative Adversarial Network Based Food Recognition
4 pages
Unit V
No ratings yet
Unit V
21 pages
IITK PCC GenAI-AIML
No ratings yet
IITK PCC GenAI-AIML
32 pages
Vision Transformers for GZSL
No ratings yet
Vision Transformers for GZSL
21 pages
OCI AI Foundation Associate 1Z0-1122-25
No ratings yet
OCI AI Foundation Associate 1Z0-1122-25
10 pages
Full Download (Ebook) Generative Artificial Intelligence: Exploring The Power and Potential of Generative AI by Shivam R Solanki, Drupad K Khublani ISBN 9798868804021, 8868804026 PDF
100% (3)
Full Download (Ebook) Generative Artificial Intelligence: Exploring The Power and Potential of Generative AI by Shivam R Solanki, Drupad K Khublani ISBN 9798868804021, 8868804026 PDF
66 pages
cs236 Lecture1 2023
No ratings yet
cs236 Lecture1 2023
46 pages
Chapter 8
No ratings yet
Chapter 8
6 pages
A Very Brief Introduction To Machine Learning With Applications To Communication Systems
No ratings yet
A Very Brief Introduction To Machine Learning With Applications To Communication Systems
20 pages
Generative AI On AWS
100% (11)
Generative AI On AWS
208 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
34 pages
1 s2.0 S1877050919302789 Main
No ratings yet
1 s2.0 S1877050919302789 Main
7 pages
AI Notes Gen AI & Maths in AI
No ratings yet
AI Notes Gen AI & Maths in AI
8 pages
L18 Gan Slides
No ratings yet
L18 Gan Slides
33 pages
Introduction To Generative AI
No ratings yet
Introduction To Generative AI
29 pages
Unit 4 NNDL
No ratings yet
Unit 4 NNDL
37 pages
MLOps Brochure BITS
No ratings yet
MLOps Brochure BITS
27 pages

Atural Anguage Rocessing: (IT3EA06)

Uploaded by

Atural Anguage Rocessing: (IT3EA06)

Uploaded by

NATURAL LANGUAGE PROCESSING

Transducer for English Flapping:

 Real Word Errors: If you have fat fingers,

 The words piece and peace are homophones

x: input word the typist has typed (observed data)

 And since P(x) is a normalizing constant (independent from w) we

You might also like