LESSION PLAN
Date:07/07/2024
Sub Code & Name: CS3352-FOUDATIONS OF DATA
SCIENCE Page 1 of 7
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : ITSemester: III
SYLLABUS
CS3352 – FOUNDATIONS OF DATA SCIENCE
COURSE OBJECTIVES:
To understand the data science fundamentals and process.
To learn to describe the data for the data science process.
To learn to describe the relationship between data.
To utilize the Python libraries for Data Wrangling.
To present and interpret data using visualization libraries in Python
UNIT I INTRODUCTION 9
Data Science: Benefits and uses – facets of data - Data Science Process: Overview – Defining
research goals – Retrieving data – Data preparation - Exploratory Data analysis – build the model–
presenting findings and building applications - Data Mining - Data Warehousing – Basic Statistical
descriptions of Data
UNIT II DESCRIBING DATA 9
Types of Data - Types of Variables -Describing Data with Tables and Graphs –Describing Data with
Averages - Describing Variability - Normal Distributions and Standard (z) Scores
UNIT III DESCRIBING RELATIONSHIPS 9
Correlation –Scatter plots –correlation coefficient for quantitative data –computational formula for
correlation coefficient – Regression –regression line –least squares regression line – Standard error of
estimate – interpretation of r2 –multiple regression equations –regression towards the mean
UNIT IV PYTHON LIBRARIES FOR DATA WRANGLING 9
Basics of Numpy arrays –aggregations –computations on arrays –comparisons, masks, boolean logic
– fancy indexing – structured arrays – Data manipulation with Pandas – data indexing and selection –
operating on data – missing data – Hierarchical indexing – combining datasets – aggregation and
grouping – pivot tables
UNIT V DATA VISUALIZATION 9
Importing Matplotlib – Line plots – Scatter plots – visualizing errors – density and contour plots –
Histograms – legends – colors – subplots – text and annotation – customization – three dimensional
plotting - Geographic Data with Basemap - Visualization with Seaborn.
TOTAL: 45 PERIODS
LESSION PLAN
Date: 07/07/2024
Sub Code & Name: CS3352-FOUDATIONS OF DATA
SCIENCE Page 1 of 7
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : IT Semester: III
TEXTBOOKS:
1. David Cielen, Arno D. B. Meysman, and Mohamed Ali, “Introducing Data Science”, Manning
Publications, 2016. (Unit I)
2. Robert S. Witte and John S. Witte, “Statistics”, Eleventh Edition, Wiley Publications, 2017.
(Units II and III)
3. Jake VanderPlas, “Python Data Science Handbook”, O’Reilly, 2016. (Units IV and V)
REFERENCE:
1. Allen B. Downey, “Think Stats: Exploratory Data Analysis in Python”, Green Tea Press,2014.
Total No. of hours as per syllabus : 45
Total No of hours available as per calendar : 45
Hours (Cumulative) 09 18 27 36 45
Units 1 2 3 4 5
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor
Date: 07/07/2024
LECTURE PLAN
Page 3 of 7
Sub Code & Name: CS3352-FOUDATIONS OF DATA
SCIENCE
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : ITSemester: III
UNIT I INTRODUCTION
Session Topics To Be Covered Reference Mode Of Teaching
No. Books
1 Introduction Data Science Benefits Text Book 1 Board Work
and uses.
2 Facets of data ,data science process- Text Book 1 Board Work
overview
3 Defining research goal. Text Book 1 Board Work
4 Retrieving data and data preparation. Text Book 1 Board Work
5 Exploratory data analysis. Text Book 1 Board Work
6 Exploratory data analysis and build Text Book 1 Board Work
the model.
7 Presenting findings and buildings Text Book1 Board Work
application.
8 Data mining, Data warehousing Text Book 1 Board Work
9 Basic statistical description of data Text Book 1 Board Work
TOTAL PERIODS :09
PROPOSED DATE OF COMPLETION :
DEVIATION (IF ANY) :
CORRECTIVE MEASURES :
REFERENCES / WEBSITES :
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor
Date:07/07/2024
LECTURE PLAN
Page 4 of 7
Sub Code & Name: CS3352-FOUDATIONS OF DATA
SCIENCE
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : ITSemester: III
UNIT II DESCRIBING DATA
Session Topics To Be Covered Reference Mode Of
No. Books Teaching
1 Types of variables. Text Book 1 Board Work
2 Describing data with tables Text Book 1 Board Work
3 Describing data with graphs Text Book 1 Board Work
4 Describing data with average. Text Book 1 Board Work
5 Describing data with average Text Book 1 Board Work
6 Describing variability Text Book 1 Board Work
7 Normal distributions Text Book2 Board Work
8 DPT test conducted Text Book 2 Board Work
9 Standard (z) scores Text Book 2 Board Work
TOTAL PERIODS : 09
PROPOSED DATE OF COMPLETION :
DEVIATION (IF ANY) :
CORRECTIVE MEASURES :
REFERENCES / WEBSITES :
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor
LECTURE PLAN
Date: 07/07/2024
Sub Code & Name: CS3352-FOUDATIONS OF DATA Page 5 of 7
SCIENCE
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : ITSemester: III
UNIT III DESCRIBING RELATIONSHIPS
Session Topics To Be Covered Reference Mode Of
No. Books Teaching
1 Describing relationships Text Book 2 Board Work
correlation.
2 Scatter plots – correlation Text Book 2 Board Work
coefficient for quantitative data.
3 Computational formula for Text Book 2 Board Work
correlation co efficient
4 Regression – regression line Text Book 2 Board Work
5 Least squares regression line Text Book 2 Board Work
6 Standard errors of estimate Text Book 2 Board Work
7 Interpretation of r2 Text Book2 Board Work
8 Multiple regression equation Text Book 2 Board Work
9 Regression towards the mean Text Book 2 PPT
TOTAL PERIODS : 09
PROPOSED DATE OF COMPLETION :
DEVIATION (IF ANY) :
CORRECTIVE MEASURES :
REFERENCES / WEBSITES :
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor
LECTURE PLAN
Date: 07/07/2024
Sub Code & Name: CS3352-FOUDATIONS OF DATA
SCIENCE Page 6 of 7
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : ITSemester: III
UNIT IV PYTHON LIBRARIES FOR DATA WRANGLING
Session Topics To Be Covered Reference Books Mode Of
No. Teaching
1 Basic of numpy arrays Text Book 3 Board Work
2 aggregate Text Book 3 Board Work
3 Aggregation – computations of Text Book 3 Board Work
arrays comparisons , masks
4 Computation of arrays – Boolean Text Book 3 PPT
logic
5 Fancy indexing structured arrays Text Book 3 Board Work
6 Data manipulation with pandas Text Book 3 Board Work
data indexing and selection
7 Operating on data, missing data Text Book 3 Board Work
8 Hierarchical indexing – Text Book3 PPT
combining datasets
9 Aggregation and grouping pivot Text Book 3 Board Work
tables
TOTAL PERIODS :09
PROPOSED DATE OF COMPLETION :
DEVIATION (IF ANY) :
CORRECTIVE MEASURES :
REFERENCES / WEBSITES :
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor
LECTURE PLAN
Date: 07/07/2024
Sub Code & Name: CS3352-FOUDATIONS OF DATA Page 7 of 7
SCIENCE
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : ITSemester: III
UNIT V DATA VISUALIZATION
Session Topics To Be Covered Reference Books Mode Of
No. Teaching
1 Improving mat plot lib – line plots Text Book 3 Board Work
2 Scatter plots – visualizing errors Text Book 3 PPT
3 Density and contour plots Text Book 3 Board Work
histogram – legends
4 Colors – subplots Text Book 3 Board Work
5 Text and annotation. Text Book 3 Board Work
6 Customization – three dimensional Text Book 3 Board Work
plotting
7 Geographic data base map Text Book3 Board Work
8 Data base map – visualization with Text Book 3 Board Work
sea born.
9 Visualization with sea born. Text Book 3 PPT
TOTAL PERIODS :09
PROPOSED DATE OF COMPLETION :
DEVIATION (IF ANY) :
CORRECTIVE MEASURES :
REFERENCES / WEBSITES :
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor
LESSON PLAN
Sub Code & Name: CS3352-FOUDATIONS OF DATA Date:
SCIENCE 07/07/2024d
Name of the faculty: E DEEPIKA
Designation & Department: Asst. Prof., IT
Regulation: 2021
Branch : IT Semester: III
Session Reference Mode Of
Topics To Be Covered Page No Date
No. Books Teaching
1. Introduction Data Science Text Book 1 16.8.2024
1-4 Board Work
Benefits and uses.
2. Facets of data ,data science Text Book 1 20.8.2024
13-21 Board Work
process-overview
3. Defining research goal. Text Book 1 24-37 Board Work 21.8.2024
4. Retrieving data and data Text Book 1 21.8.2024
37-48 Board Work
preparation.
5. Exploratory data analysis. Text Book 1 27.8.2024
48-58 Board Work
6. Exploratory data analysis Text Book 1 28.8.2024
and build the model. 48-58 Board Work
7. Presenting findings and Text Book1 28.8.2024
buildings application. 48-58 Board Work
8. Data mining, Data Text Book 1 30.8.2024
warehousing 58-67 Board Work
9. Basic statistical description Text Book 1 3.9.2024
of data 97-105 Board Work
10. Types of variables. Text Book 1 68-77 Board Work 4.9.2024
11. Describing data with tables Text Book 1 78-84 Board Work 4.9.2024
12. Describing data with graphs Text Book 1 143-152 Board Work 5.9.2024
13. Describing data with Text Book 1 6.9.2024
121-127 Board Work
average.
14. Describing data with Text Book 1 10.9.2024
128-135 Board Work
average
15. Describing variability Text Book 1 11.9.2024
136-142 Board Work
16. Normal distributions Text Book2 13.9.2024
143-152 Board Work
17. DPT test conducted Text Book 2 18.9.2024
171-180 Board Work
18. Standard (z) scores Text Book 2 18.9.2024
196-199 Board Work
19. Describing relationships Text Book 2 Board Work 19.9.2024
205-209
correlation.
20. Scatter plots – correlation Text Book 2 Board Work 20.9.2024
coefficient for quantitative 300-302
data.
21. Computational formula for Text Book 2 303-305 Board Work 24.9.2024
correlation co efficient
22. Regression – regression line Text Book 2 306-308 Board Work 25.9.2024
23. Least squares regression line Text Book 2 306-308 Board Work 26.9.2024
24. Standard errors of estimate Text Book 2 309-315 Board Work 27.9.2024
25. Interpretation of r2 Text Book2 Board Work 28.9.2024
316-319
26. Multiple regression equation Text Book 2 Board Work 1.10.2024 &
316-319 7.10.2024
27. Regression towards the Text Book 2 PPT 8.10.2024
mean 320-328
28. Basic of numpy arrays , Text Book 3 Board Work 8.10.2024
355-365
aggregate
29. DPT test conducted Text Book 3 - Board Work 9.10.2024
30. Aggregation – computations Text Book 3 Board Work 9.10.2024
of arrays comparisons , -
masks
31. Computation of arrays – Text Book 3 PPT 18.10.2024
-
Boolean logic
32. Fancy indexing structured Text Book 3 Board Work 21.10.2024
-
arrays
33. Data manipulation with Text Book 3 Board Work 22.10.2024
pandas data indexing and -
selection
34. Operating on data, missing Text Book 3 Board Work 23.10.2024
data 366-371
35. Hierarchical indexing – Text Book3 PPT 24.10.2024
combining datasets 371-375
36. Aggregation and grouping Text Book 3 Board Work 25.10.2024
pivot tables 371-375
37. Improving mat plot lib – line Text Book 3 Board Work 5.11.2024
385-387
plots
38. Scatter plots – visualizing Text Book 3 PPT 6.11.2024
-
errors
39. Density and contour plots Text Book 3 Board Work 7.11.2024
19-20
histogram – legends
40. Colors – subplots Text Book 3 20-22 Board Work 10.11.2024
41. Text and annotation. Text Book 3 22-23 Board Work 12.11.2024
42. Customization – three Text Book 3 Board Work 12.11.2024
22-23
dimensional plotting
43. Geographic data base map Text Book3 22-23 Board Work 15.11.2024
44. Data base map– Text Book 3 Board Work 15.11.2024
visualization with sea born. 40-41
45. Visualization with sea born. Text Book 3 15.11.2024
74-80 PPT
Course HOD/IT Faculty Chairman Vice-Principal Principal Instructor