Natural Language Processing: by Dr. Parminder Kaur

Uploaded by

Riya jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

521 views26 pages

Natural Language Processing: by Dr. Parminder Kaur

Uploaded by

Riya jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 26

Natural Language Processing

By
Dr. Parminder Kaur
What is NLP?
• Natural Language Processing (NLP)
– Computers use (analyze, understand,
generate) natural language
– A somewhat applied field
• Computational Linguistics (CL)
– Computational aspects of the human
language faculty
– More theoretical
Goals of NLP
• Scientific Goal
– Identify the computational machinery
needed for an agent to exhibit various
forms of linguistic behavior
• Engineering Goal
– Design, implement, and test systems
that process natural languages for
practical applications
Applications
• speech processing: get flight information or book
a hotel over the phone
• information extraction: discover names of people
and events they participate in, from a document
• machine translation: translate a document from
one human language into another
• question answering: find answers to natural
language questions in a text collection or
database
• summarization: generate a short biography of
Noam Chomsky from one or more news articles
General Themes
• Ambiguity of Language
• Language as a formal system
• Computation with human language
• Rule-based vs. Statistical Methods
• The need for efficiency
Topic Ideas
1.Text to Speech – artificial voices
2.Speech Recognition - understanding
3.Textual Analysis – readability
4.Plagiarism Detection – candidate selection
5.Intelligent Agents – machine interaction
Text to Speech – artificial voice
• Text Input
• Break text into phonemes
– Match phonemes to voice elements
– Concatenate voice elements
– Manipulate pitch and spacing
• Output results
• Research question: How can a human voice be
used to produce an artificial voice?
• Model Talker - opportunities for active, hands-on
research
Speech Recognition
• Spoken Input
• Identify words and phonemes in speech
– Generate text for recognized word parts
– Concatenate text elements
– Perform spelling, grammar and context checking
• Output results
• Research question: How can speech recognition
assist a deaf student taking notes in class?
• VUST – Villanova University Speech Transcriber
Textual Analysis - Readability
• Text Input
• Analyze text & estimate “readability”
– Grade level of writing
– Consistency of writing
– Appropriateness for certain educ. level
• Output results
• Research question: How can computer
analyze text and measure readability?
• Opportunities for hands-on research
Plagiarism Detection
• Text Input
• Analyze text & locate “candidates”
– Find one or more passages that might be plagiarized
– Algorithm tries to do what a teacher does
– Search on Internet for candidate matches
• Output results
• Research question: What algorithms work like
humans when finding plagiarism?
• Experimental CS research
Intelligent Agents
• Example: ELIZA
• AIML: Artificial Intelligence Modeling Lang.
• Human types something
• Computer parses, “understands”, and generates
response
• Response is viewed by human
• Research question: How can computers
“understand” and “generate” human writing?
• Also good area for experimentation
Digital Image Processing
• The images in previous slides are digital
(now), but they are NOT the result of DIP
• Digital Image Processing is
– Processing digital images by a digital
computer
• DIP requires a digital computer and other
supporting technologies (e.g., data
storage, display and transmission)
Photography
Motion Pictures
Law Enhancement and Biometrics
Remote Sensing

Hurricane Andrew America at night

taken by NOAA GEOS (Nov. 27, 2000)
Thermal Images
Operate in infrared frequency

Human body disperses Different colors indicate

heat (red pixels) varying temperatures
Medical Diagnostics
Operate in X-ray frequency

chest head
PET and Astronomy
Operate in gamma-ray frequency

Cygnus Loop in the

constellation of Cygnus
Positron Emission Tomography
Cartoon Pictures (Non-photorealistic)
Synthetic Images in Gaming

Age of Empire III by Ensemble Studios

Virtual Reality (Photorealistic)
Speech Recognition in AI

• AI-based speech recognition has made it

possible for computers to understand and
recognize human speech, enabling
frictionless interaction between humans
and machines.
How does Speech
Recognition in AI Work?
• Recording: The voice recorder that is built into the gadget is used to carry out the first
stage. The user's voice is kept as an audio signal after being recorded.
• Sampling: As you are aware, computers and other electronic gadgets use data in their
discrete form. By basic physics, it is known that a sound wave is continuous. Therefore,
for the system to understand and process it, it is converted to discrete values. This
conversion from continuous to discrete is done at a particular frequency.
• Transforming to Frequency Domain: The audio signal's time domain is changed to its
frequency domain in this stage. This stage is very important because the frequency
domain may be used to examine a lot of audio information. Time domain refers to the
analysis of mathematical functions, physical signals, or time series of economic or
environmental data, concerning time. Similarly, the frequency domain refers to the
analysis of mathematical functions or signals concerning frequency, rather than time.
• Information Extraction from Audio: Each voice recognition system's foundation is at
this stage. At this phase, the audio is transformed into a vector format that may be used.
For this conversion, many extraction methods, including PLP, MFCC, etc., are applied.
• Recognition of Extracted Information: The idea of pattern matching is applied in this
step. Recognition is performed by taking the extracted data and comparing it to some pre-
defined data. Pattern matching is used to accomplish this comparing and matching. One
of the most popular pieces of software for this is Google Speech API.
techniques used in AI for
speech recognition are:
• Hidden Markov Models (HMMs): HMMs are statistical models that
are widely used in speech recognition AI. HMMs work by modelling
the probability distribution of speech sounds, and then using these
models to match input speech to the most likely sequence of sounds.
• Deep Neural Networks (DNNs): DNNs are a type of machine
learning model that is used extensively in speech recognition AI.
DNNs work by using a hierarchy of layers to model complex
relationships between the input speech and the corresponding text
output.
• Convolutional Neural Networks (CNNs): CNNs are a type of
machine learning model that is commonly used in image recognition,
but have also been applied to speech recognition AI. CNNs work by
applying filters to input speech signals to identify relevant features.
Some recent advancements in
speech recognition AI include:
• Transformer-based models: Transformer-based models, such as BERT
and GPT, have been highly successful in natural language processing
tasks, and are now being applied to speech recognition AI.
• End-to-end models: End-to-end models are designed to directly map
speech signals to text, without the need for intermediate steps. These
models have shown promise in improving the accuracy and efficiency of
speech recognition AI.
• Multimodal models: Multimodal models combine speech recognition AI
with other modalities, such as vision or touch, to enable more natural and
intuitive interactions between humans and machines.
• Data augmentation: Data augmentation techniques, such as adding
background noise or changing the speaking rate, can be used to generate
more training data for speech recognition AI models, improving their
accuracy and robustness.

Mbogo - Participatory Monitoring and Evaluation Practices and The Performance of Early Childhood Development Education Projects in Kilifi County, Kenya.
No ratings yet
Mbogo - Participatory Monitoring and Evaluation Practices and The Performance of Early Childhood Development Education Projects in Kilifi County, Kenya.
83 pages
MCEC06 - Unit 1-5
No ratings yet
MCEC06 - Unit 1-5
486 pages
1 AI Notes Complete Watermark
No ratings yet
1 AI Notes Complete Watermark
95 pages
Sales Data
No ratings yet
Sales Data
1 page
Ed Last Year PPR
No ratings yet
Ed Last Year PPR
1 page
AI NEW Lab Manual-R22 BATCH-CSE
No ratings yet
AI NEW Lab Manual-R22 BATCH-CSE
32 pages
Exploring Types of Code-Mixing Used in Gita Wirjawan'S Youtube Channel
No ratings yet
Exploring Types of Code-Mixing Used in Gita Wirjawan'S Youtube Channel
8 pages
Ad3351 Daa Unit I
No ratings yet
Ad3351 Daa Unit I
135 pages
Machine Learning Handwritten Notes
No ratings yet
Machine Learning Handwritten Notes
52 pages
Tips To Publish Scopus - Isi Journal
No ratings yet
Tips To Publish Scopus - Isi Journal
55 pages
Mobile Application Development
No ratings yet
Mobile Application Development
193 pages
Mini Project Report
No ratings yet
Mini Project Report
37 pages
Mini Project
No ratings yet
Mini Project
43 pages
Unit-4 of Ai
No ratings yet
Unit-4 of Ai
9 pages
Artificial Intelligence Handwritten Notes by Riya
100% (1)
Artificial Intelligence Handwritten Notes by Riya
118 pages
Introduction To AI Notes
No ratings yet
Introduction To AI Notes
4 pages
JBPR Faiz
No ratings yet
JBPR Faiz
15 pages
Notes For Business Analytics Part II
No ratings yet
Notes For Business Analytics Part II
66 pages
Dissertation Sur Limportance de La Famille
100% (2)
Dissertation Sur Limportance de La Famille
7 pages
Unit 4
No ratings yet
Unit 4
26 pages
PST Material Unit-I
No ratings yet
PST Material Unit-I
32 pages
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
No ratings yet
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
26 pages
Bahasa Zimbabwe
100% (1)
Bahasa Zimbabwe
22 pages
AI 5 Semester - Questions & Answers - Set 1 06-01-2025
No ratings yet
AI 5 Semester - Questions & Answers - Set 1 06-01-2025
12 pages
Ai - Unit Ii
No ratings yet
Ai - Unit Ii
126 pages
Thesis Defence Speech Sample
100% (3)
Thesis Defence Speech Sample
7 pages
Blue Eye
100% (23)
Blue Eye
29 pages
Ia 4 Report
No ratings yet
Ia 4 Report
8 pages
Cs2351 Ai Notes
100% (1)
Cs2351 Ai Notes
91 pages
Automatic Speech Recognition Using Python
No ratings yet
Automatic Speech Recognition Using Python
18 pages
Dataset For Tree Map
No ratings yet
Dataset For Tree Map
1 page
Cs8582-Object Oriented Analysisand Design Laboratory-46023968-Cs8582 - Ooad Lab
No ratings yet
Cs8582-Object Oriented Analysisand Design Laboratory-46023968-Cs8582 - Ooad Lab
132 pages
Matthews AssessingOrganizationalEffectiveness 2011
No ratings yet
Matthews AssessingOrganizationalEffectiveness 2011
29 pages
UNIT 4 Information Retrieval Using NLP
No ratings yet
UNIT 4 Information Retrieval Using NLP
13 pages
Design A Learning System in Machine Learning
No ratings yet
Design A Learning System in Machine Learning
41 pages
Python Notes 3rd Mca
No ratings yet
Python Notes 3rd Mca
99 pages
Aiml Complete Notes
No ratings yet
Aiml Complete Notes
57 pages
Wang 2021
No ratings yet
Wang 2021
12 pages
BA Lab Manual
No ratings yet
BA Lab Manual
62 pages
Mca Book
100% (2)
Mca Book
277 pages
Lecture - Tutorial Letter 201 - S1 LEG2601 2024
No ratings yet
Lecture - Tutorial Letter 201 - S1 LEG2601 2024
10 pages
Interview With DR Ian Stevenson - Reincarnation Is Real
100% (1)
Interview With DR Ian Stevenson - Reincarnation Is Real
18 pages
SM 6th-Sem Cse Internet-Of-Things
No ratings yet
SM 6th-Sem Cse Internet-Of-Things
76 pages
Annotated Bibliography
100% (1)
Annotated Bibliography
10 pages
CCS341 Data Warehousing Notes Unit I
No ratings yet
CCS341 Data Warehousing Notes Unit I
30 pages
LP 4 Lab Manual
No ratings yet
LP 4 Lab Manual
52 pages
Daa Lab Manual
No ratings yet
Daa Lab Manual
60 pages
Unit-1 Cyber Laws
No ratings yet
Unit-1 Cyber Laws
21 pages
Cloud Computing Lab Manual-New
No ratings yet
Cloud Computing Lab Manual-New
150 pages
CS 3 - Problem Solving Agent
No ratings yet
CS 3 - Problem Solving Agent
80 pages
Grid Architecture
No ratings yet
Grid Architecture
19 pages
Ai-Unit-I Notes
No ratings yet
Ai-Unit-I Notes
74 pages
NUS Overseas Colleges ISRAEL Course Guide On Start-Up Internship Programme (TR3202) I Objective
No ratings yet
NUS Overseas Colleges ISRAEL Course Guide On Start-Up Internship Programme (TR3202) I Objective
9 pages
Visit Report Atal Incubation Center Chh. Sambhajinagar
No ratings yet
Visit Report Atal Incubation Center Chh. Sambhajinagar
5 pages
Spss
No ratings yet
Spss
2 pages
Expression Tree
No ratings yet
Expression Tree
18 pages
Unit 2
No ratings yet
Unit 2
94 pages
AI Lab Manual
No ratings yet
AI Lab Manual
37 pages
DATA STRUCTURES AND ALGORITHMS - Unit 5
No ratings yet
DATA STRUCTURES AND ALGORITHMS - Unit 5
35 pages
Gebeyehu Final Research
No ratings yet
Gebeyehu Final Research
43 pages
III Sem Syllabus RNSIT New
No ratings yet
III Sem Syllabus RNSIT New
19 pages
Sales and Distribution Final
No ratings yet
Sales and Distribution Final
67 pages
Data Analytics Lab File Rohit
No ratings yet
Data Analytics Lab File Rohit
23 pages
PPL Unit 3
No ratings yet
PPL Unit 3
14 pages
Unit III - SPM
No ratings yet
Unit III - SPM
13 pages
Java Lab Manual
No ratings yet
Java Lab Manual
26 pages
Lesson 1: Structure of A Compiler
No ratings yet
Lesson 1: Structure of A Compiler
20 pages
Cloud Computing Unit-1 Notes
No ratings yet
Cloud Computing Unit-1 Notes
12 pages
BPSM - Unit 1 PDF
No ratings yet
BPSM - Unit 1 PDF
55 pages
Module-1 Introduction To File Structures
No ratings yet
Module-1 Introduction To File Structures
50 pages
Jawaharlal Nehru Engineering College: Digital Image Processing
50% (2)
Jawaharlal Nehru Engineering College: Digital Image Processing
26 pages
Iii Year Vi Sem CS6659 Artificial Intelligence
No ratings yet
Iii Year Vi Sem CS6659 Artificial Intelligence
44 pages
Critique of A Nursing Theorist
100% (1)
Critique of A Nursing Theorist
8 pages
Christ The King Academy: Topic/Lesson Defining Social Science As The Study of The Society Content Standards
No ratings yet
Christ The King Academy: Topic/Lesson Defining Social Science As The Study of The Society Content Standards
3 pages
Input and Output Text and Binary I/O: Introduction To Java Y.Daniel Liang 1
No ratings yet
Input and Output Text and Binary I/O: Introduction To Java Y.Daniel Liang 1
64 pages
Ai Important Questions For Viva
No ratings yet
Ai Important Questions For Viva
4 pages
Ai-Unit2 - QB-VDP
No ratings yet
Ai-Unit2 - QB-VDP
13 pages
E. Comm Unit 2
No ratings yet
E. Comm Unit 2
45 pages
AI Lab MAnual Final
No ratings yet
AI Lab MAnual Final
44 pages
Int. J. Production Economics: Suhaiza Zailani, K. Jeyaraman, G. Vengadasan, R. Premkumar
No ratings yet
Int. J. Production Economics: Suhaiza Zailani, K. Jeyaraman, G. Vengadasan, R. Premkumar
11 pages
Ethical Issues of Internet Marketing Practices
No ratings yet
Ethical Issues of Internet Marketing Practices
5 pages
Unit 2
No ratings yet
Unit 2
32 pages
E. Comm Unit 1
No ratings yet
E. Comm Unit 1
36 pages
AIML - Module 1-Question Bank
No ratings yet
AIML - Module 1-Question Bank
3 pages
Practical 3 ANN
No ratings yet
Practical 3 ANN
3 pages
Unit 2 SND
No ratings yet
Unit 2 SND
32 pages
Types of Pipeline
100% (1)
Types of Pipeline
2 pages
Reference Interval
No ratings yet
Reference Interval
2 pages
25th August MCA New First Year Syllabus 2020
No ratings yet
25th August MCA New First Year Syllabus 2020
24 pages
1 - 1 Introduction To Robotics (Autosaved)
No ratings yet
1 - 1 Introduction To Robotics (Autosaved)
22 pages
International Conference On Nanoscience and Photonics For Medical Applications
No ratings yet
International Conference On Nanoscience and Photonics For Medical Applications
4 pages
Unit - 1
100% (1)
Unit - 1
20 pages
Walton 382 Proposal
No ratings yet
Walton 382 Proposal
4 pages
Assignment #2 AI
No ratings yet
Assignment #2 AI
5 pages
00 Cuprins 2-2013 - 91-92
No ratings yet
00 Cuprins 2-2013 - 91-92
2 pages
Factors Affecting The Reading Comprehension Level of Grade VI Learners of Selected Elementary School in The District of Tanza, Cavite
No ratings yet
Factors Affecting The Reading Comprehension Level of Grade VI Learners of Selected Elementary School in The District of Tanza, Cavite
10 pages
What Are Cognitive Biases?: Aquino, Ed Gerarr R. Mrs. Cherry Cruz 1CTE1D - GED111 Understanding The Self
No ratings yet
What Are Cognitive Biases?: Aquino, Ed Gerarr R. Mrs. Cherry Cruz 1CTE1D - GED111 Understanding The Self
3 pages
Unit 3
No ratings yet
Unit 3
14 pages
Introduction to Linux: Installation and Programming
From Everand
Introduction to Linux: Installation and Programming
N. B. Venkateswarlu
No ratings yet
Unit 3 AI Srs 13-14
No ratings yet
Unit 3 AI Srs 13-14
45 pages
Irsip Aplication Form Revised For July 2012 To Onward
No ratings yet
Irsip Aplication Form Revised For July 2012 To Onward
7 pages
CS 606 Skill Dev Lab - 7TO 10 - 1648109707
No ratings yet
CS 606 Skill Dev Lab - 7TO 10 - 1648109707
12 pages
E Commerce
No ratings yet
E Commerce
11 pages
Characteristics of A Good SRS
No ratings yet
Characteristics of A Good SRS
2 pages
CS8691 AI CO-PO Mapping
No ratings yet
CS8691 AI CO-PO Mapping
6 pages
CO4752 Web Development Assignment (2020 A) : Learning Outcomes Assessed
No ratings yet
CO4752 Web Development Assignment (2020 A) : Learning Outcomes Assessed
3 pages
Question Bank: Subject: Data Structures and Algorithms
No ratings yet
Question Bank: Subject: Data Structures and Algorithms
6 pages
Elctronic Data For Order by
No ratings yet
Elctronic Data For Order by
1 page
E Commerce - Merged
No ratings yet
E Commerce - Merged
5 pages