[go: up one dir, main page]

0% found this document useful (0 votes)
49 views4 pages

Data Science Syl Lab Us

gvfgv

Uploaded by

Khushbu Maurya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
49 views4 pages

Data Science Syl Lab Us

gvfgv

Uploaded by

Khushbu Maurya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

INDUS INSTITUTE OF TECHNOLOGY& ENGINEERING

Constituent Institute of Indus University

Subject: Data Science

Program: B. Tech CE/CSE/IT Subject Code: CE0630 Semester: VI

Teaching Scheme (Hours per week) Examination Evaluation Scheme (Marks)

Continuous Continuous
University University Internal Internal
Lecture Tutorial Practical Credits Theory Practical Evaluation Evaluation Total
Examination Examination (CIE)- (CIE)-
Theory Practical

3 0 2 4 40 40 60 60 200

Course Outcome:

1. Learn the fundamentals of data analytics and the data science pipeline
2. Learn how to scope the resources required for a data science project
3. Apply principles of Data Science to the analysis of business problems.
4. Skill development in data mining software to solve real-world problems.
5. Increase in employability in cutting edge tools and technologies to analyze Big Data.

CONTENTS

UNIT-I
[12 Hours]
Introduction to data science:

Defining Data Science, what do data science people do? Data Science in Business, Use Cases for
Data Science, Data science and Big data, Data science and Machine learning
Data Science Process Overview – Defining goals – Retrieving data – Data preparation – Data
exploration – Data modeling – Presentation.
UNIT-II
[12 Hours]
Introduction to statistics:

What is statistics, Descriptive Statistics: Introduction, Population and sample, Types of variables,
Measures of central tendency, Measures of variability, Coefficient of variance, Skewness and
Kurtosis

Inferential Statistics:

Normal distribution, Test hypotheses, Central limit theorem, Confidence interval, T-test, Type I
and II errors

UNIT-III
[12 Hours]
Machine Learning Introduction and Concepts:

Machine learning – Modeling Process – Training model – Validating model – Predicting new
observations
Important machine learning terminologies, Types of machine learning algorithms, Supervised
learning algorithms: Types of supervised learning algorithms, Regression: Linear Regression,
Classification algorithms
Unsupervised learning algorithms: Clustering algorithms

UNIT-IV
[12 Hours]

Introduction to data visualization – Data visualization options – Filters – Python libraries for
visualization – Matplotlib- seaborn
Data Science Ethics – Doing good data science – Owners of the data - Valuing different aspects
of privacy - Getting informed consent - The Five Cs – Diversity – Inclusion – Future Trends.

Course Outcome:
After completion of the course students will be able to:
1) Demonstrate knowledge of big data analytics.
2) Demonstrate the ability to think critically in making decisions based on data
3) Interpret data, extract meaningful information, and assess findings.
4) Identify and analyze social, legal, and ethical issues in data science.
5) Choose and apply tools and methodologies to solve data science tasks.
6) Explore future trends in data.
Text Books:

1. Introducing Data Science, Davy Cielen, Arno D. B. Meysman, Mohamed Ali, Manning
Publications Co., 1st edition, 2016

2. An Introduction to Statistical Learning: with Applications in R, Gareth James, Daniela


Witten, Trevor Hastie, Robert Tibshirani, Springer, 1st edition, 2013

3. Ethics and Data Science, D J Patil, Hilary Mason, Mike Loukides, O’ Reilly, 1st edition,
2018

Reference Books:

1. Machine Learning: A Probabilistic Perspective. Kevin P. Murphy.

LIST OF EXPERIMENTS

Sr. No. Title Learning Outcome


Getting Started with Skills Network Labs To know functionality and
1 usage of Skill Network Labs
environment
Getting Started with Jupyter Notebooks To know functionality and
2 usage of Jupyter Notebook
platform
3 Getting Started with Apache Zeppelin Notebooks To know functionality and
usage of Apache Zeppelin
Notebook
4 Getting Started with RStudio IDE Introduction to Rstudio and
its usage in Machine
Learning
5 Data Analysis with Python To understand the concept
Import data sets of machine learning, data
Clean and prepare data for analysis preparation, pandas and
Manipulate pandas Data Frame scikit-learn with model
Summarize data
building.
Build machine learning models using scikit-learn
Build data pipelines
6 Data Visualization with Python To understand about the
Introduction to Visualization Tools field of data visualization
Basic Visualization Tools and tools used for
Specialized Visualization Tools visualization.
Creating Maps and Visualizing Geospatial Data
7 Advanced Visualization Tools Study and understanding
about functionalities of
advanced visualization
tools.

You might also like