PDS Question Paper
PDS Question Paper
___________
MARKS
Q.6 (a) List the type of plots that can be drawn using matplotlib. 03
(b) Write a python program to read data from CSV files using pandas. 04
(c) Explain pie chart plot with appropriate examples. 07
Q.8 (a) Define EDA. List the tasks need to be carried out in EDA? 03
(b) How hash functions can be useful to solve data science problems? 04
(c) Define the regression problem. How can it be solved using SciKit- 07
learn?
*************
2
Seat No.: ________ Enrolment No.___________
MARKS
Q.1 (a) What is the role of Python in Data science? 03
(b) Differentiate List and Tuple in Python 04
(c) Explain data science pipeline in details. 07
Page 2 of 2
Seat No.: ________ Enrolment No.___________
MARKS
Q.1 (a) Differentiate the list and dictionary data types of python by their 03
characteristics along with example in brief.
(b) What do you mean by slicing operation in string of python? Write an 04
example of slicing to fetch first name and last name from full name
of person and display it.
(c) Which are the basic activities we performed as a part of data science 07
pipeline? Summarize and explain in brief.
Q.2 (a) What is the core competencies needed to become a data scientist? 03
Explain in brief.
(b) Compare and summarize four different coding styles supported by 04
Python language.
(c) Summarize the characteristics of NumPy, Pandas, Scikit-Learn and 07
matplotlib libraries along with their usage in brief.
OR
(c) What do you mean by prototyping? List the phases of prototyping 07
and experimentation process and explain in brief.
Q.3 (a) Compare the numpy and pandas on the basis of their characteristics 03
and usage.
(b) For what purpose sampling is used. Demonstrate random sampling 04
with example.
(c) What is the need of streaming the data? Explain data uploading and 07
streaming data with example.
OR
Q.3 (a) How XPath is useful for analysis of html data? Explain in brief. 03
(b) Define term n-gram. Explain the TF-IDF techniques. 04
(c) List the techniques to handle missing data. Explain various 07
techniques with example.
Q.4 (a) List various types of graph/chart available in the pyplot of matplotlib 03
library for data visualization. Explain any two of them in brief.
(b) What kind data is analyzed with Bag of word model? Explain it with 04
example.
(c) What do you mean by time series data? How can we plot it? Explain 07
it with example to plot trend over time
OR
1
Q.4 (a) Compare bar graph, box-plot and histogram with respect to their 03
applicability in data visualization.
(b) Define stemming. Explain the concept of stemming with example. 04
(c) What is the use of scatter-plot in data visualization? Can we draw 07
trendline in scatter-plot? Explain it with example.
Q.5 (a) Define the term Data wrangling. Explain the steps needed to perform 03
data wrangling.
(b) Why we need to perform Z-score standardization in EDA? Justify it 04
with example.
(c) What is the use of hash function in EDA? Express various hashing 07
trick along with example.
OR
Q.5 (a) What do you mean by Exploratory Data Analysis (EDA)? How t-test 03
is useful for EDA?
(b) What do you mean by covariance? What is the importance of 04
covariance in data analysis? Explain it with example.
(c) List different way for defining descriptive statistics for 07
Numeric Data. Explain them in brief.
*************
2
Seat No.: ________ Enrolment No.___________
*************
1
Seat No.: ________ Enrolment No.___________
MARKS
Q.1 (a) List Advantages of Python. 03
(b) Differentiate Numpy and Pandas. 04
(c) Explain Exploratory Data Analysis (EDA). 07
*************