0% found this document useful (0 votes)

380 views8 pages

Natural Language Processing Handout

This document provides an overview of a course on natural language processing. It includes: 1) Course objectives focused on learning fundamental NLP concepts, techniques, and applications. 2) A modular content structure covering topics like n-gram language models, hidden Markov models, part-of-speech tagging, parsing, machine translation, semantic ontologies, question answering and dialogue systems. 3) Details of contact sessions covering these topics through lectures, readings from specified textbooks and references.

Uploaded by

sdfasd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

380 views8 pages

Natural Language Processing Handout

Uploaded by

sdfasd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

BIRLA INSTITUTE OF

TECHNOLOGY & SCIENCE, PILANI

WORK INTEGRATED LEARNING PROGRAMMES
COURSE HANDOUT

Part A: Content Design

Course Title Natural Language Processing

Course No(s)
Credit Units 3 units
Course Author Prof. Vijayalakshmi and Dr. Chetana Gavankar
Version No 3.0
Date 6th Jan 2020

Course Objectives
No Course Objective

CO1 To learn the fundamental concepts and techniques of natural language processing (NLP)

CO2 To learn computational properties of natural languages and the commonly used algorithms
for processing linguistic information

CO3 To apply NLP techniques in state of art applications

CO4 To learn implementation of NLP algorithms and techniques

Text Book(s)
T1 Speech and Language processing: An introduction to Natural Language Processing,
Computational Linguistics and speech Recognition by Daniel Jurafsky and James H.
Martin[3rd edition]

T2 Natural language understanding[2nd edition] by James Allen

Reference Book(s) & other resources
Handbook of Natural Language Processing, Second Edition—NitinIndurkhya, Fred J.
R1 Damerau, Fred J. Damerau
R2 Natural Language Processing with Python by Steven Bird, Ewan Klein, Edward Lopper

Modular Content Structure

1. Introduction to Natural Language Understanding

1.1 The Study of Language.
1.2 Applications of Natural Language Understanding.
1.3 Evaluating Language Understanding Systems.
1.4 The Different Levels of Language Analysis.
1.5 Representations and Understanding.
1.6 The Organization of Natural Language Understanding Systems.

2. N-gram Language Models

2.1 N-Grams
2.2 Evaluating Language Models
2.3 Generalization and Zeros
2.4 Smoothing
2.5 Kneser-Ney Smoothing
2.6 The Web and Stupid Backoff

3. Hidden Markov Models

3.1 Markov Chains
3.2 The Hidden Markov Model
3.3 Likelihood Computation: The Forward Algorithm
3.4 Decoding: The Viterbi Algorithm
3.5 HMM Training: The Forward-Backward Algorithm

4. Part-of-Speech Tagging
4.1 (Mostly) English Word Classes
4.2 The Penn Treebank Part-of-Speech Tag set
4.3 Part-of-Speech Tagging
4.4 HMM Part-of-Speech Tagging
4.5 Maximum Entropy Markov Models
4.6 Bidirectionality
4.7 Part-of-Speech Tagging for Morphological Rich Languages

5. Grammars and Parsing.

5.1 Grammars and Sentence Structure.
5.2 What Makes a Good Grammar
5.3 A Top-Down Parser.
5.4 A Bottom-Up Chart Parser.
5.5 Top-Down Chart Parsing.
5.6 Finite State Models and Morphological Processing.
5.7 Grammars and Logic Programming.
5.8 Parsing

6. Statistical Constituency Parsing

6.1 Probabilistic Context-Free Grammars
6.2 Probabilistic CKY Parsing of PCFGs
6.3 Ways to Learn PCFG Rule Probabilities
6.4 Problems with PCFGs
6.5 Improving PCFGs by Splitting Non-Terminals
6.6 Probabilistic Lexicalized CFGs
6.7 Probabilistic CCG Parsing
6.8 Evaluating Parsers

7. Word sense and word net

7.1 Word Senses
7.2 Relations between Senses
7.3 WordNet: A Database of Lexical Relations
7.4 Word Sense Disambiguation
7.5 Alternate WSD algorithms and Tasks
7.6 Using Thesauruses to Improve Embeddings
7.7 Word Sense Induction

8. Dependency Parsing
8.1 Dependency Relations
8.2 Dependency Formalisms
8.3 Dependency Treebanks
8.4 Transition-Based Dependency Parsing
8.5 Graph-Based Dependency Parsing
8.6 Evaluation

9. Statistical Machine translation

9.1 Introduction
9.2 Approaches
9.3 Language Models
9.4 Parallel Corpora
9.5 Word Alignment
9.6 Phrase Library
9.7 Translation Models.
9.8 Search Strategies

10. Semantic web ontology

10.1 Introduction
10.2 Ontology and Ontologies
10.3 Ontology Engineering
10.4 Ontology Learning
10.5 State of the Art

11. Question Answering

11.1 IR-based Factoid Question answering
11.2 Knowledge-based Question Answering
11.3 Using multiple information sources: IBM’s Watson
11.4 Evaluation of Factoid Answers
12 Dialogue Systems and Chatbots
12.1 Properties of Human Conversation
12.2 Chatbots
12.3 GUS: Simple Frame-based Dialogue Systems
12.4 The Dialogue-State Architecture
12.5 Evaluating Dialogue Systems
12.6 Dialogue System Design

13. Sentiment analysis

13.1 The Problem of Sentiment Analysis
13.2 Sentiment and Subjectivity Classification
13.3 Document-Level Sentiment Classification
13.4 Feature-Based Sentiment Analysis
13.5 Sentiment Analysis of Comparative Sentences

Learning Outcomes:

No Learning Outcomes

LO1 Should have a good understanding of the field of natural language processing.

LO2 Should have an algorithms and techniques used in this field.

LO3 Should also understand the how natural language processing is used in Machine
translation and Information extraction.

Part B: Contact Session Plan

Academic Term
Course Title Natural Language processing
Course No
Lead Instructor Dr. Chetana Gavankar
Course Contents

Contact List of Topic Title Topic # Text/Ref

session1 (from content structure in Part A) (from Book/external
content resource
structure in
Part A)

1 Introduction Chapter1 T2
 The Study of Language.
 Applications of Natural Language
Understanding.
 Evaluating Language Understanding Systems.
 The Different Levels of Language Analysis.
 Representations and Understanding.
 The Organization of Natural Language
Understanding Systems.

2 N-Grams Language models Chapter 3 T1

 Evaluating Language Models
 Generalization and Zeros
 Smoothing
 Kneser-Ney Smoothing
 The Web and Stupid Backoff

4 Hidden Markov Models Appendix T1

 Markov Chains chapter A
 The Hidden Markov Model
 Likelihood Computation: The Forward
Algorithm
 Decoding: The Viterbi Algorithm
 HMM Training: The Forward-Backward
Algorithm

5 Part-of-Speech Tagging Chapter8 T1

 (Mostly) English Word Classes
 The Penn Treebank Part-of-Speech Tag set
 Part-of-Speech Tagging
 HMM Part-of-Speech Tagging
 Maximum Entropy Markov Model
 Bidirectionality
 Part-of-Speech Tagging for Morphological Rich
Languages

6 Grammars and Parsing Chapter3 T2

 Grammars and Sentence Structure.
 What Makes a Good Grammar
 A Top-Down Parser.
 A Bottom-Up Chart Parser.
 Top-Down Chart Parsing.
 Finite State Models and Morphological
Processing.
 Grammars and Logic Programming.
 Parsing

7 Statistical Constituency Parsing Chapter 14 T1

 Probabilistic Context-Free Grammars
 Probabilistic CKY Parsing of PCFGs
 Ways to Learn PCFG Rule Probabilities
 Problems with PCFGs
 Improving PCFGs by Splitting Non-Terminals
 Probabilistic Lexicalized CFGs
 Probabilistic CCG Parsing
 Evaluating Parsers

8 Review of session 1 to session 7

9 Dependency Parsing Chapter15 T1

 Dependency Relations
 Dependency Formalisms
 Dependency Treebanks
 Transition-Based Dependency Parsing
 Graph-Based Dependency Parsing
 Evaluation

10 Implementation using NLTK R2,

Class Notes
 Part of speech tagging
 Build and draw parser tree
 Implement parsing algorithm
 Word sense disambiguation

11 Statistical Machine translation Chapter 17 R1

 Introduction
 Approaches
 Language Models
 Parallel Corpora
 Word Alignment
 Phrase Library
 Translation Models
 Search Strategies

12 Semantic web ontology Chapter 24 R1 and class

 Introduction notes
 Ontology and Ontologies
 Ontology Engineering
 Ontology Learning
 State of the Art

13 Question Answering Chapter 25 T1

 IR-based Factoid Question answering
 Knowledge-based Question Answering
 Using multiple information sources: IBM’s
Watson
 Evaluation of Factoid Answers

14 Dialogue Systems and Chatbots Chapter 26 T1

 Properties of Human Conversation
 Chatbots
 GUS: Simple Frame-based Dialogue Systems
 The Dialogue-State Architecture
 Evaluating Dialogue Systems
 Dialogue System Design

15 Sentiment analysis Chapter 26 R1

 The Problem of Sentiment Analysis
 Sentiment and Subjectivity Classification
 Document-Level Sentiment Classification
 Feature-Based Sentiment Analysis
 Sentiment Analysis of Comparative Sentences

16 Review of session 9 to session 15

Evaluation Scheme
Evaluation Name Type Weight Duration Day, Date, Session,
Component (Quiz, Lab, Project, (Open book, Time
Midterm exam, End Closed book,
semester exam, etc) Online, etc.)

EC – 1 Assignment Open book 20% To be announced

EC – 2 Mid-term Exam Closed book 30% 2 hours To be announced

EC – 3 End Semester Exam Open book 50% 2.5 hours To be announced

Note - Evaluation components can be tailored depending on the proposed model.

Important Information

Syllabus for Mid-Semester Test (Closed Book): Topics in Weeks 1-8 (1-18 Hours)
Syllabus for Comprehensive Exam (Open Book): All topics given in plan of study
Evaluation Guidelines:
1. EC-1 consists of either two Assignments or three Quizzes. Announcements regarding the same
will be made in a timely manner.
2. For Closed Book tests: No books or reference material of any kind will be permitted.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
3. For Open Book exams: Use of prescribed and reference text books, in original (not photocopies)
is permitted. Class notes/slides as reference material in filed or bound form is permitted.
However, loose sheets of paper will not be allowed. Use of calculators is permitted in all exams.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
4. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the student
should follow the procedure to apply for the Make-Up Test/Exam. The genuineness of the
reason for absence in the Regular Exam shall be assessed prior to giving permission to appear
for the Make-up Exam. Make-Up Test/Exam will be conducted only at selected exam centres
on the dates to be announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self-study schedule
as given in the course handout, attend the lectures, and take all the prescribed evaluation components
such as Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the evaluation
scheme provided in the handout.

15CS421E - Natural Language Processing
No ratings yet
15CS421E - Natural Language Processing
2 pages
NLP Course for Students
No ratings yet
NLP Course for Students
25 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
Natural Language Processing (NLP) With Python - Tutorial
No ratings yet
Natural Language Processing (NLP) With Python - Tutorial
72 pages
Speech Recognition Systems Guide
No ratings yet
Speech Recognition Systems Guide
13 pages
A Guide To Text Classification (NLP)
No ratings yet
A Guide To Text Classification (NLP)
17 pages
A New Approach To Parts of Speech Tagging in Malayalam
100% (1)
A New Approach To Parts of Speech Tagging in Malayalam
10 pages
1 Intro To NLP
100% (1)
1 Intro To NLP
46 pages
Intro to Topic Modeling
No ratings yet
Intro to Topic Modeling
120 pages
Real Time Indian Sign Language Recognition Using Deep LSTM Networks
No ratings yet
Real Time Indian Sign Language Recognition Using Deep LSTM Networks
7 pages
NLP Course for B.Tech Students
No ratings yet
NLP Course for B.Tech Students
206 pages
(Ebook) Speech and Language Processing: An Introduction To Natural Language Processing, Computational Linguistics, and Speech Recognition by Daniel Jurafsky, James H. Martin Download
100% (1)
(Ebook) Speech and Language Processing: An Introduction To Natural Language Processing, Computational Linguistics, and Speech Recognition by Daniel Jurafsky, James H. Martin Download
80 pages
(A) What Is Traditional Model of NLP?: Unit - 1
No ratings yet
(A) What Is Traditional Model of NLP?: Unit - 1
18 pages
Table of Content
No ratings yet
Table of Content
13 pages
Langauage Model
No ratings yet
Langauage Model
148 pages
Knowledge Graph 4 Paper
No ratings yet
Knowledge Graph 4 Paper
52 pages
Shivangi Tyagi (NLP Assignments)
No ratings yet
Shivangi Tyagi (NLP Assignments)
60 pages
Unit 3 Notes UDS23201J Query Processing
No ratings yet
Unit 3 Notes UDS23201J Query Processing
38 pages
Text Based Information Retrieval - Document Mining
No ratings yet
Text Based Information Retrieval - Document Mining
37 pages
NLP & Word Vectors: SVD and Word2Vec
No ratings yet
NLP & Word Vectors: SVD and Word2Vec
14 pages
IS 7118 Unit-9 Semantics
No ratings yet
IS 7118 Unit-9 Semantics
82 pages
024 KnowledgeRepresentationMethods (Puhr)
No ratings yet
024 KnowledgeRepresentationMethods (Puhr)
18 pages
Unit 4
No ratings yet
Unit 4
10 pages
NLP Python Intro 1-3
100% (1)
NLP Python Intro 1-3
79 pages
ML Module A7707 - Part1
No ratings yet
ML Module A7707 - Part1
48 pages
Langmodel PDF
0% (1)
Langmodel PDF
69 pages
NLP MQP Solved
No ratings yet
NLP MQP Solved
26 pages
Text Processing: Basics: Pawan Goyal
No ratings yet
Text Processing: Basics: Pawan Goyal
42 pages
Natural Language Processing
No ratings yet
Natural Language Processing
116 pages
AI-Natural Language Processing (NLP) - IJRASET
No ratings yet
AI-Natural Language Processing (NLP) - IJRASET
8 pages
A Survey On Multimodal Bidirectional Machine Learning Translation of Image and Natural Language Processing
No ratings yet
A Survey On Multimodal Bidirectional Machine Learning Translation of Image and Natural Language Processing
14 pages
Analysis of Statistical Parsing in Natural Language Processing
No ratings yet
Analysis of Statistical Parsing in Natural Language Processing
6 pages
NLP Notes
No ratings yet
NLP Notes
203 pages
Question Bank
No ratings yet
Question Bank
13 pages
NLP Unit1
No ratings yet
NLP Unit1
51 pages
NLP Unit I
No ratings yet
NLP Unit I
30 pages
Be Computer Engineering Semester 7 2023 May Dloc III Natural Language Processing Rev 2019 C Scheme
0% (1)
Be Computer Engineering Semester 7 2023 May Dloc III Natural Language Processing Rev 2019 C Scheme
2 pages
Unit 1
No ratings yet
Unit 1
99 pages
Linguistics & NLP: Morphology Basics
No ratings yet
Linguistics & NLP: Morphology Basics
14 pages
Text Normalization in NLP
No ratings yet
Text Normalization in NLP
29 pages
IS 7118 Unit1 Introduction
No ratings yet
IS 7118 Unit1 Introduction
58 pages
Chapter 6
100% (1)
Chapter 6
28 pages
Week 6: Introduction To Natural Language Processing
No ratings yet
Week 6: Introduction To Natural Language Processing
18 pages
Neuromorphic Computing
No ratings yet
Neuromorphic Computing
14 pages
Natural Language Processing-A Paninian Perspective
No ratings yet
Natural Language Processing-A Paninian Perspective
224 pages
Intro To NLP and Text Mining
No ratings yet
Intro To NLP and Text Mining
28 pages
NLP Feature Extraction Techniques
No ratings yet
NLP Feature Extraction Techniques
19 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
Early Detection of Lung Cancer Using AI and ML
No ratings yet
Early Detection of Lung Cancer Using AI and ML
6 pages
Natural Language Processing-Wiki
No ratings yet
Natural Language Processing-Wiki
237 pages
DL Unit-V
100% (1)
DL Unit-V
8 pages
GloVe Word Vectors for CS Students
No ratings yet
GloVe Word Vectors for CS Students
24 pages
Google NLP: NLP (Natural Language Processing)
No ratings yet
Google NLP: NLP (Natural Language Processing)
8 pages
Natural Language processing-Regular-HO
No ratings yet
Natural Language processing-Regular-HO
10 pages
Natural Language Processing-Course Handout September 2022
No ratings yet
Natural Language Processing-Course Handout September 2022
8 pages
Ai in Natural Language Processing
No ratings yet
Ai in Natural Language Processing
4 pages
TSA Book
No ratings yet
TSA Book
154 pages
NLP for Computer Science Students
No ratings yet
NLP for Computer Science Students
16 pages
Al3501 - Teaching Content
No ratings yet
Al3501 - Teaching Content
3 pages
Stream Processing and Analytics Handout
No ratings yet
Stream Processing and Analytics Handout
8 pages
Probabilistic Graphical Model Handout
No ratings yet
Probabilistic Graphical Model Handout
6 pages
Info Retrieval Course Guide
No ratings yet
Info Retrieval Course Guide
5 pages
Graph Algorithms & Data Mining
No ratings yet
Graph Algorithms & Data Mining
7 pages
Engaging Teaching Strategies
No ratings yet
Engaging Teaching Strategies
8 pages
Talk About A Stimulus by (I) Responding To Wh-Questions
100% (1)
Talk About A Stimulus by (I) Responding To Wh-Questions
7 pages
Book 1 Teaching Guide
100% (2)
Book 1 Teaching Guide
21 pages
Summative Assessment Point 2
No ratings yet
Summative Assessment Point 2
11 pages
Spitzberg, Brian H. (2006) Preliminary Development of A Model and Measure of Computer-Mediated Communication (CMC) Competence PDF
No ratings yet
Spitzberg, Brian H. (2006) Preliminary Development of A Model and Measure of Computer-Mediated Communication (CMC) Competence PDF
38 pages
Hot Bread: © Oxford University Press. Permission Granted To Reproduce For Instructional Use
No ratings yet
Hot Bread: © Oxford University Press. Permission Granted To Reproduce For Instructional Use
3 pages
Cambridge C1 Writing Guide
No ratings yet
Cambridge C1 Writing Guide
4 pages
All Around 2 TB PDF
75% (4)
All Around 2 TB PDF
106 pages
Homebased Activity in Mapeh 1
No ratings yet
Homebased Activity in Mapeh 1
5 pages
Week 1 Learning Log
No ratings yet
Week 1 Learning Log
2 pages
Understanding The Purpose and Study of Psychology
No ratings yet
Understanding The Purpose and Study of Psychology
8 pages
Developmental Screening Test (DST)
75% (4)
Developmental Screening Test (DST)
2 pages
shareimprove this question: Matt E. Эллен
No ratings yet
shareimprove this question: Matt E. Эллен
4 pages
Paraphrasing: Incorrect: Some People Believe That Car Emissions Have A Largeimpact On The Environment
No ratings yet
Paraphrasing: Incorrect: Some People Believe That Car Emissions Have A Largeimpact On The Environment
4 pages
Essay in Yellow Paper
No ratings yet
Essay in Yellow Paper
2 pages
Ai in Business
No ratings yet
Ai in Business
19 pages
Foundation of Curriculum Final
90% (48)
Foundation of Curriculum Final
209 pages
Brain Activity: This Is A Sample Text. Insert Your Desired Text Here
No ratings yet
Brain Activity: This Is A Sample Text. Insert Your Desired Text Here
6 pages
Asme7312 Activity 2.1 & 2.2 ST10122214
100% (1)
Asme7312 Activity 2.1 & 2.2 ST10122214
26 pages
Washington DC Field Trip Mini-Unit Lesson Plans
No ratings yet
Washington DC Field Trip Mini-Unit Lesson Plans
5 pages
Matatag Cot
No ratings yet
Matatag Cot
16 pages
CV - DR Jamuna Rajeswaran
0% (1)
CV - DR Jamuna Rajeswaran
9 pages
IOP Conf. Series Journal of Physics Conf. Series
No ratings yet
IOP Conf. Series Journal of Physics Conf. Series
5 pages
Graduate Education Insights
No ratings yet
Graduate Education Insights
77 pages
Schweidtmann 2024 Nature Chemical Engineering Viewpoint
No ratings yet
Schweidtmann 2024 Nature Chemical Engineering Viewpoint
1 page
Research Methodology & Intellectual Property Rights: BRMK557
No ratings yet
Research Methodology & Intellectual Property Rights: BRMK557
57 pages
Peran Pemerintah dalam Pendidikan
No ratings yet
Peran Pemerintah dalam Pendidikan
12 pages
What Has Been Happening?: The Present Perfect Continuous Grammar Guide
No ratings yet
What Has Been Happening?: The Present Perfect Continuous Grammar Guide
12 pages
David Ausubel's Subsumption Theory
100% (1)
David Ausubel's Subsumption Theory
2 pages
Practice MCQ 3
No ratings yet
Practice MCQ 3
4 pages

Natural Language Processing Handout

Uploaded by

Natural Language Processing Handout

Uploaded by

BIRLA INSTITUTE OF

TECHNOLOGY & SCIENCE, PILANI

Part A: Content Design

Course Title Natural Language Processing

CO3 To apply NLP techniques in state of art applications

CO4 To learn implementation of NLP algorithms and techniques

T2 Natural language understanding[2nd edition] by James Allen

Modular Content Structure

1. Introduction to Natural Language Understanding

2. N-gram Language Models

3. Hidden Markov Models

5. Grammars and Parsing.

6. Statistical Constituency Parsing

7. Word sense and word net

9. Statistical Machine translation

10. Semantic web ontology

11. Question Answering

13. Sentiment analysis

LO2 Should have an algorithms and techniques used in this field.

Part B: Contact Session Plan

Contact List of Topic Title Topic # Text/Ref

2 N-Grams Language models Chapter 3 T1

4 Hidden Markov Models Appendix T1

5 Part-of-Speech Tagging Chapter8 T1

6 Grammars and Parsing Chapter3 T2

7 Statistical Constituency Parsing Chapter 14 T1

8 Review of session 1 to session 7

9 Dependency Parsing Chapter15 T1

10 Implementation using NLTK R2,

11 Statistical Machine translation Chapter 17 R1

12 Semantic web ontology Chapter 24 R1 and class

13 Question Answering Chapter 25 T1

14 Dialogue Systems and Chatbots Chapter 26 T1

15 Sentiment analysis Chapter 26 R1

16 Review of session 9 to session 15

EC – 1 Assignment Open book 20% To be announced

EC – 2 Mid-term Exam Closed book 30% 2 hours To be announced

EC – 3 End Semester Exam Open book 50% 2.5 hours To be announced

You might also like