[go: up one dir, main page]

0% found this document useful (0 votes)
56 views2 pages

Statistical Natural Language Processing

The course on Statistical Natural Language Processing focuses on developing algorithms and techniques for processing natural language using statistical learning methods. It covers topics such as language models, sequence labeling, applications like named entity recognition, and deep learning models including RNNs and Transformers. Prerequisites include knowledge in machine learning, linear algebra, probability, and proficiency in Python programming.

Uploaded by

aimlhod
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
56 views2 pages

Statistical Natural Language Processing

The course on Statistical Natural Language Processing focuses on developing algorithms and techniques for processing natural language using statistical learning methods. It covers topics such as language models, sequence labeling, applications like named entity recognition, and deep learning models including RNNs and Transformers. Prerequisites include knowledge in machine learning, linear algebra, probability, and proficiency in Python programming.

Uploaded by

aimlhod
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Statistical Natural Language Processing

L TP C
3 0 0 3

Pre-requisites

Must: Introduction to Machine Learning (CS771) or equivalent course, Proficiency in Linear


Algebra, Probability and Statistics, Proficiency in Python Programming
Desirable: Probabilistic Machine Learning (CS772), Topics in Probabilistic Modeling and
Inference (CS775), Deep Learning for Computer Vision (CS776)

Course Objectives:

Natural language (NL) refers to the language spoken/written by humans.


NL is the primary mode of communication for humans. With the growth of the world wide web,
data in the form of text has grown exponentially. It calls for the development of algorithms and
techniques for processing natural language for the automation and development of intelligent
machines. This course will primarily focus on understanding and developing linguistic
techniques, statistical learning algorithms and models for processing language. We will have a
statistical approach towards natural language processing, wherein we will learn how one could
develop natural language understanding models from statistical regularities in large corpora of
natural language texts while leveraging linguistics theories.

UNIT 1

Introduction to Natural Language (NL) : why is it hard to process NL, linguistics


fundamentals, etc. Language Models: n-grams, smoothing, class-based, brown clustering.

UNIT II

Sequence Labeling: HMM, MaxEnt, CRFs, related applications of these models e.g.
Part of Speech tagging, etc. Parsing: CFG, Lexicalized CFG, PCFGs, Dependency
parsing

UNIT III

Applications: Named Entity Recognition, Coreference Resolution, text classi cation,


toolkits e.g. Spacy, etc. Distributional Semantics: distributional hypothesis, vector space
models, etc.

UNIT IV
Distributed Representations: Neural Networks (NN), Backpropagation, Softmax,
Hierarchical Softmax Word Vectors: Feedforward NN, Word2Vec, GloVE,
Contextualization (ELMo etc.), Subword information (FastText, etc.)

UNIT V

Deep Models: RNNs, LSTMs, Attention, CNNs, applications in language, etc.


Sequence to Sequence models : machine translation and other applications
Transformers : BERT, transfer learning and applications

References

1. Speech and Language Processing, Daniel Jurafsky, James H.Martin,


2. Foundations of Statistical Natural Language Processing, CH Manning, H Schutze
3. Introduction to Natural Language Processing, Jacob Eisenstein
4. Natural Language Understanding, James Allen
5. There are no specific references, this course gleans information from a variety of sources
like books, research papers, other courses, etc. Relevant references would be suggested in
the lectures. Some of the frequent references are as follows:

You might also like