100% found this document useful (1 vote)

21 views26 pages

Sample MCQ Questions

The document outlines the fundamentals of data science, including its purpose, benefits, and the essential data types and processes involved. It covers Python basics, control structures, functions, data structures, and libraries like NumPy, along with data collection, preprocessing, and exploratory data analysis. Additionally, it introduces descriptive statistics and key concepts related to data analysis.

Uploaded by

h6152462

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

21 views26 pages

Sample MCQ Questions

Uploaded by

h6152462

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

UNIT I

1. Need for Data Science

1. What is the primary purpose of data science?

A) Store data
B) Analyze and extract insights from data
C) Replace humans in decision-making
D) Develop hardware
Answer: B
2. Why has data science gained importance in recent years?
A) Increase in data availability
B) Decrease in computing power
C) Elimination of the internet
D) Reduced need for programming
Answer: A
3. Which industry heavily relies on data science for customer behavior analysis?
A) Banking
B) Retail
C) Healthcare
D) All of the above
Answer: D
4. What type of data is essential for data science?
A) Structured data only
B) Unstructured data only
C) Both structured and unstructured data
D) No data is required
Answer: C
5. Which of these roles is closely related to data science?
A) Web developer
B) Data analyst
C) Graphic designer
D) Network administrator
Answer: B

2. Benefits and Uses of Data Science

6. How does data science benefit organizations?

A) Reduces decision-making errors
B) Increases complexity in tasks
C) Replaces manual work with spreadsheets
D) Eliminates data collection needs
Answer: A
7. Which of these is NOT a use of data science?
A) Fraud detection
B) Data storage
C) Predictive modeling
D) Customer segmentation
Answer: B
8. What is one benefit of applying data science in healthcare?
A) Higher patient privacy violations
B) Personalized medicine recommendations
C) Increased medication costs
D) Reduced focus on patient care
Answer: B
9. Which sector uses data science for inventory management?
A) Retail
B) Transportation
C) Education
D) Law
Answer: A
10. What is an outcome of implementing data science in marketing?
A) Predicting customer churn
B) Increasing email spam
C) Lowering marketing efficiency
D) Decreasing customer satisfaction
Answer: A

3. Facets of Data

11. What are the facets of data?

A) Volume, Velocity, Variety, Veracity
B) Length, Width, Height
C) Shape, Texture, Density
D) None of the above
Answer: A
12. Which facet of data represents the speed at which data is generated?
A) Volume
B) Velocity
C) Variety
D) Veracity
Answer: B
13. What does "Variety" in data facets signify?
A) Quality of data
B) Different formats and types of data
C) Speed of data collection
D) Accuracy of data analysis
Answer: B
14. What is the challenge with the "Veracity" of data?
A) High cost of storing data
B) Inaccuracy and inconsistency of data
C) Too much data to analyze
D) Data moving too quickly
Answer: B
15. What does "Volume" in data refer to?
A) The size of data
B) The speed of data generation
C) The variety of data types
D) The reliability of data
Answer: A

4. Data Science Process

16. What is the first step in the data science process?

A) Data modeling
B) Data visualization
C) Problem definition
D) Model deployment
Answer: C
17. Which step involves cleaning and preprocessing data?
A) Data collection
B) Data preparation
C) Data analysis
D) Model evaluation
Answer: B
18. What does the data modeling step involve?
A) Generating hypotheses
B) Building algorithms to identify patterns
C) Visualizing insights
D) Collecting data
Answer: B
19. In which stage is data visualization used?
A) Preprocessing
B) Model building
C) Results interpretation
D) Data storage
Answer: C
20. What happens during the deployment phase?
A) The final model is put into production
B) Data is cleaned and structured
C) Features are engineered
D) Data is visualized
Answer: A

5. Basics of Python

21. Which file extension is used for Python scripts?

A) .py
B) .txt
C) .java
D) .exe
Answer: A
22. Which function is used to print output in Python?
A) output()
B) print()
C) display()
D) show()
Answer: B
23. How do you declare a variable in Python?
A) let x = 5
B) int x = 5
C) x = 5
D) declare x = 5
Answer: C
24. Which data type is mutable in Python?
A) Tuple
B) String
C) List
D) Integer
Answer: C
25. What is the result of 10 // 3 in Python?
A) 3.33
B) 3
C) 10.0
D) None
Answer: B

6. Setting Working Directory

26. Which library is commonly used to set the working directory in Python?
A) os
B) math
C) random
D) re
Answer: A
27. What function sets the working directory?
A) set_dir()
B) os.chdir()
C) os.getdir()
D) os.mkdir()
Answer: B
28. How do you check the current working directory?
A) os.checkdir()
B) os.getcwd()
C) os.curdir()
D) os.dir()
Answer: B

7. File Execution

29. How do you execute a Python script?

A) Run it in Notepad
B) Double-click the file
C) Use python script.py in the terminal
D) Compile it
Answer: C
30. Which IDE is commonly used for executing Python code?
A) Eclipse
B) PyCharm
C) IntelliJ
D) NetBeans
Answer: B

8. Variable Management

31. How do you delete a variable in Python?

A) del variable_name
B) remove variable_name
C) delete variable_name
D) clear variable_name
Answer: A
32. What is used to clear all variables in Python?
A) os.clear()
B) %reset
C) del all()
D) None of the above
Answer: B

9. Commenting Script Files

33. How do you write single-line comments in Python?

A) /* comment */
B) // comment
C) # comment
D) 
Answer: C

10. Data Types and Operators

34. Which is a numeric data type in Python?

A) int
B) str
C) list
D) dict
Answer: A
35. What does True and False evaluate to?
A) True
B) False
C) None
D) Error
Answer: B
36. What operator is used for exponentiation?
A) ^
B) **
C) %
D) //
Answer: B

UNIT II

1. Control Structures

1. What is the purpose of control structures in programming?

A) Organize data
B) Control the flow of execution
C) Store large data sets
D) None of the above
Answer: B
2. Which of these is a conditional control structure in Python?
A) for
B) if-else
C) while
D) break
Answer: B
3. What does the elif keyword represent in Python?
A) End of a loop
B) Else if
C) Initiates a loop
D) None of the above
Answer: B
4. Which of the following is NOT a valid control structure?
A) if
B) elif
C) switch
D) None of the above
Answer: C
5. What is the default control flow in Python?
A) Sequential execution
B) Parallel execution
C) Random execution
D) Iterative execution
Answer: A

2. Loops

6. Which keyword is used to terminate a loop prematurely?

A) pass
B) break
C) continue
D) stop
Answer: B
7. What is the purpose of the continue keyword in loops?
A) Ends the loop
B) Skips the current iteration and proceeds to the next
C) Stops execution completely
D) Executes the loop condition again
Answer: B
8. Which loop is best for iterating over a range of numbers?
A) while
B) for
C) do-while
D) None of the above
Answer: B
9. What is the output of the following code?

python
Copy code
for i in range(3):
print(i)

A) 1 2 3
B) 0 1 2
C) 0 1 2 3
D) None of the above
Answer: B

10. What happens when the else block is used with a loop?
A) Runs only if the loop executes at least once
B) Executes when the loop condition is false
C) Skips to the next loop iteration
D) Only works with while loops
Answer: B

3. Functions

11. Which keyword is used to define a function in Python?

A) func
B) function
C) def
D) define
Answer: C
12. What is the purpose of the return statement in functions?
A) To end a function
B) To pass back a value to the caller
C) To call another function
D) None of the above
Answer: B
13. Which of the following is NOT a valid function parameter type?
A) Positional
B) Keyword
C) Default
D) Constant
Answer: D
14. What is a lambda function in Python?
A) A function defined inside another function
B) An anonymous, inline function
C) A recursive function
D) None of the above
Answer: B
15. How do you call a function named my_func in Python?
A) call my_func()
B) my_func()
C) def my_func()
D) execute my_func()
Answer: B

4. Data Structures

16. Which data structure is mutable in Python?

A) List
B) Tuple
C) String
D) None of the above
Answer: A
17. How do you access the first element of a list named my_list?
A) my_list[0]
B) my_list(0)
C) my_list[1]
D) my_list.first()
Answer: A
18. What is a tuple?
A) An immutable list
B) A mutable dictionary
C) A mutable set
D) None of the above
Answer: A
19. What method is used to add an element to a set?
A) add()
B) append()
C) insert()
D) push()
Answer: A
20. Which of the following is a valid key type in a dictionary?
A) Integer
B) String
C) Tuple
D) All of the above
Answer: D
5. NumPy Library

21. What is the primary purpose of the NumPy library?

A) Data visualization
B) Numerical computing
C) Text processing
D) Web development
Answer: B
22. How do you import NumPy in Python?
A) import numpy as np
B) include numpy
C) require numpy
D) import np
Answer: A
23. Which function creates an array of zeros in NumPy?
A) zeros()
B) empty()
C) ones()
D) array()
Answer: A
24. What is the shape of the following NumPy array?

python
Copy code
np.array([[1, 2], [3, 4]])

A) (2, 2)
B) (1, 4)
C) (4,)
D) None of the above
Answer: A

25. What is the difference between a list and a NumPy array?

A) NumPy arrays are slower than lists
B) NumPy arrays support vectorized operations
C) Lists are immutable
D) None of the above
Answer: B

6. Data Collection and Types

26. What is primary data?

A) Data collected by someone else
B) Data collected firsthand
C) Data from online sources
D) None of the above
Answer: B
27. Which of these is an example of structured data?
A) Audio files
B) Spreadsheets
C) Videos
D) Images
Answer: B
28. What type of data is “age in years”?
A) Categorical
B) Numerical
C) Ordinal
D) None of the above
Answer: B
29. Which method is NOT used for data collection?
A) Surveys
B) Experiments
C) Data cleaning
D) Interviews
Answer: C
30. What is metadata?
A) Data about data
B) Processed data
C) Data stored in arrays
D) Data visualizations
Answer: A

7. Data Preprocessing

31. Which of the following is a step in data preprocessing?

A) Data cleaning
B) Data visualization
C) Data modeling
D) Model deployment
Answer: A
32. What is the purpose of feature scaling?
A) Normalize data range
B) Add more features
C) Increase the dataset size
D) None of the above
Answer: A
33. What does one-hot encoding do?
A) Handles missing values
B) Encodes categorical variables
C) Scales numerical features
D) None of the above
Answer: B

8. Exploratory Data Analysis (EDA)

34. What is the primary goal of EDA?

A) Build predictive models
B) Summarize and visualize data
C) Collect data
D) Scale data
Answer: B
35. Which library is commonly used for data visualization in Python?
A) NumPy
B) Matplotlib
C) os
D) random
Answer: B
36. What does a boxplot visualize?
A) Relationships between variables
B) Distribution and outliers
C) Missing values
D) Categorical data
Answer: B

Unit III

1. Descriptive Statistics

1. What is the purpose of descriptive statistics?

A) Predict future outcomes
B) Summarize and describe data
C) Test hypotheses
D) Explore relationships between variables
Answer: B
2. Which of the following is NOT a measure of central tendency?
A) Mean
B) Median
C) Mode
D) Standard Deviation
Answer: D
3. Which of the following measures dispersion in a dataset?
A) Mean
B) Range
C) Mode
D) Median
Answer: B
4. The interquartile range (IQR) is calculated as:
A) Q1 - Q3
B) Q3 - Q1
C) Mean - Median
D) Median - Mode
Answer: B
5. What does the term "outlier" refer to in a dataset?
A) The average value
B) Values significantly different from others
C) The middle value
D) A value that repeats often
Answer: B

2. Mean

6. How is the mean calculated?

A) Sum of all values divided by the number of values
B) Middle value in a dataset
C) Most frequent value
D) Difference between maximum and minimum values
Answer: A
7. Which of the following affects the mean?
A) Outliers
B) Median
C) Mode
D) None of the above
Answer: A
8. What is the mean of the dataset {2, 4, 6, 8}?
A) 4
B) 5
C) 6
D) 10
Answer: C
9. What is the mean of {5, 10, 15}?
A) 10
B) 15
C) 12.5
D) 11
Answer: A
10. If all values in a dataset are increased by 5, how does the mean change?
A) Increases by 5
B) Decreases by 5
C) Remains the same
D) Doubles
Answer: A

3. Standard Deviation

11. What does standard deviation measure?

A) Central tendency
B) Spread of data around the mean
C) Median
D) Mode
Answer: B
12. If the standard deviation is 0, what can be inferred?
A) Data is widely spread
B) All data points are equal
C) Data has many outliers
D) Data has no mean
Answer: B
13. What happens to standard deviation if all data points are increased by a
constant?
A) Increases by the same constant
B) Remains unchanged
C) Doubles
D) Becomes zero
Answer: B
14. What does a large standard deviation indicate?
A) Data points are close to the mean
B) Data points are widely spread
C) Data points are all identical
D) None of the above
Answer: B
15. Which of the following datasets has the largest standard deviation?
A) {5, 5, 5, 5}
B) {1, 5, 9}
C) {2, 4, 6, 8}
D) {10, 10, 10}
Answer: B

4. Skewness and Kurtosis

16. What does skewness measure?

A) The shape of the distribution
B) The spread of the data
C) The mean value
D) The correlation between variables
Answer: A
17. A positively skewed distribution has:
A) A longer tail on the left
B) A longer tail on the right
C) Equal tails on both sides
D) No tails
Answer: B
18. What does kurtosis measure?
A) Spread of data
B) Peakedness of a distribution
C) Average of data
D) Number of outliers
Answer: B
19. Which distribution has kurtosis greater than 3?
A) Normal distribution
B) Platykurtic distribution
C) Leptokurtic distribution
D) Mesokurtic distribution
Answer: C
20. What does a negative skewness indicate?
A) Symmetrical distribution
B) Longer tail on the left
C) Longer tail on the right
D) No skewness
Answer: B

5. Inferential Statistics

21. What is the primary goal of inferential statistics?

A) Summarize data
B) Make conclusions about a population based on a sample
C) Collect data
D) Identify outliers
Answer: B
22. What is the null hypothesis (H₀)?
A) The hypothesis being tested
B) The hypothesis assumed true unless evidence suggests otherwise
C) The hypothesis that always gets rejected
D) None of the above
Answer: B
23. Which test is used to compare the means of two independent groups?
A) Chi-square test
B) t-test
C) ANOVA
D) Regression analysis
Answer: B
24. What does a p-value less than 0.05 indicate?
A) Fail to reject the null hypothesis
B) Reject the null hypothesis
C) Results are insignificant
D) None of the above
Answer: B
25. What type of error occurs when the null hypothesis is rejected but is actually
true?
A) Type I error
B) Type II error
C) Sampling error
D) None of the above
Answer: A

6. Probability Theory

26. What is the range of probability values?

A) -1 to 1
B) 0 to 1
C) 0 to 100
D) None of the above
Answer: B
27. What is the probability of an impossible event?
A) 0
B) 0.5
C) 1
D) Undefined
Answer: A
28. What is the sum of probabilities of all outcomes in a sample space?
A) 0
B) 1
C) Infinity
D) Depends on the event
Answer: B
29. What is conditional probability?
A) Probability of A given B
B) Probability of B given A
C) Probability of A and B
D) None of the above
Answer: A
30. What formula represents Bayes’ Theorem?
A) P(A∩B)×P(B)P(A)
B) P(A∣B)=P(B∣A)P(A)/P(B)
C) P(A)+P(B)P(A) + P(B)P(A)+P(B)
D) None of the above
Answer: B

7. Pandas Library

31. What is the primary purpose of the Pandas library?

A) Data manipulation and analysis
B) Web scraping
C) Numerical computing
D) None of the above
Answer: A
32. Which object in Pandas represents tabular data?
A) Series
B) DataFrame
C) Array
D) List
Answer: B
33. How do you import Pandas in Python?
A) import pandas as pd
B) include pandas
C) require pandas
D) import pd
Answer: A
34. Which method reads a CSV file into a DataFrame?
A) pd.read_table()
B) pd.read_csv()
C) pd.read_file()
D) None of the above
Answer: B
35. How do you select a column named "Age" from a DataFrame df?
A) df[Age]
B) df["Age"]
C) df.Age
D) Both B and C
Answer: D

8. DataFrame Operations

36. Which method adds a new column to a DataFrame?

A) append()
B) insert()
C) assign()
D) None of the above
Answer: C
37. What does the head() method do?
A) Shows the first few rows of a DataFrame
B) Deletes rows
C) Sorts rows
D) Merges two DataFrames
Answer: A
38. Which method removes missing values from a DataFrame?
A) drop()
B) dropna()
C) fillna()
D) None of the above
Answer: B
39. How do you sort a DataFrame by a column?
A) sort()
B) sort_values()
C) arrange()
D) None of the above
Answer: B
40. Which method provides a summary of statistics for a DataFrame?
A) describe()
B) info()
C) summary()
D) stats()
Answer: A

Unit IV

1. Data Cleaning and Preparation

1. What is the primary goal of data cleaning?

A) Data modeling
B) Remove inconsistencies and errors
C) Predict future outcomes
D) Visualize data
Answer: B
2. Which of the following is NOT part of data preparation?
A) Data transformation
B) Model evaluation
C) Removing duplicates
D) Handling missing values
Answer: B
3. What is data normalization?
A) Removing duplicates
B) Converting data to a uniform scale
C) Identifying outliers
D) None of the above
Answer: B
4. Which method is commonly used for text data cleaning?
A) Encoding
B) Tokenization
C) Visualization
D) Regression
Answer: B
5. What is the process of reducing a dataset's dimensionality called?
A) Data cleaning
B) Feature selection
C) Data wrangling
D) Data scaling
Answer: B

2. Handling Missing Data

6. What is the simplest way to handle missing data?

A) Replace with zeros
B) Remove rows/columns with missing values
C) Predict missing values
D) All of the above
Answer: D
7. Which method in pandas removes rows with missing values?
A) drop()
B) dropna()
C) fillna()
D) replace()
Answer: B
8. What does the fillna() method in pandas do?
A) Drops missing values
B) Fills missing values
C) Detects missing values
D) Removes duplicates
Answer: B
9. Which method in pandas can interpolate missing values?
A) fillna()
B) interpolate()
C) replace()
D) dropna()
Answer: B
10. What is imputation in the context of handling missing data?
A) Removing duplicates
B) Replacing missing values with statistical measures
C) Normalizing data
D) Creating new features
Answer: B

3. Data Transformations (pandas and sklearn)

11. What does the apply() function in pandas do?

A) Filters rows
B) Applies a function to a DataFrame or Series
C) Removes duplicates
D) Converts data types
Answer: B
12. Which sklearn function is used to standardize features?
A) MinMaxScaler
B) StandardScaler
C) LabelEncoder
D) OneHotEncoder
Answer: B
13. What is the range of values after using MinMaxScaler?
A) -1 to 1
B) 0 to 1
C) No fixed range
D) None of the above
Answer: B
14. Which sklearn class is used for encoding categorical variables?
A) LabelEncoder
B) OneHotEncoder
C) Both A and B
D) None of the above
Answer: C
15. Which pandas method is used for renaming columns?
A) rename()
B) rename_columns()
C) reindex()
D) None of the above
Answer: A

4. Removing Duplicates

16. How do you remove duplicate rows in pandas?

A) drop_duplicates()
B) remove_duplicates()
C) delete_duplicates()
D) None of the above
Answer: A
17. What is the default behavior of drop_duplicates() in pandas?
A) Removes all duplicates
B) Removes the first occurrence of a duplicate
C) Keeps the first occurrence and removes the rest
D) Does not remove any rows
Answer: C
18. Which parameter in drop_duplicates() specifies columns to check for
duplicates?
A) subset
B) columns
C) check_cols
D) None of the above
Answer: A
19. What does inplace=True do in pandas methods?
A) Creates a new DataFrame
B) Modifies the original DataFrame
C) Copies the DataFrame
D) None of the above
Answer: B
20. How can you detect duplicate rows in a DataFrame?
A) duplicated()
B) is_duplicate()
C) check_duplicates()
D) None of the above
Answer: A

5. Replacing Values

21. Which pandas method is used to replace specific values?

A) replace()
B) update()
C) fillna()
D) None of the above
Answer: A
22. How do you replace all occurrences of 10 with 0 in a DataFrame?
A) df.replace(10, 0)
B) df.fillna(10, 0)
C) df.drop(10, 0)
D) df.update(10, 0)
Answer: A
23. Which parameter in replace() allows replacing values with a dictionary?
A) mapping
B) dict
C) to_replace
D) None of the above
Answer: C
24. What does the regex=True option in replace() enable?
A) Regex-based value matching
B) String replacement only
C) Numerical replacement only
D) None of the above
Answer: A
25. Can replace() work on both rows and columns?
A) Yes
B) No
C) Only rows
D) Only columns
Answer: A

6. Detecting Outliers

26. Which plot is most commonly used to detect outliers?

A) Box plot
B) Histogram
C) Scatter plot
D) Line plot
Answer: A
27. What is the IQR (Interquartile Range)?
A) Q1 - Q3
B) Q3 - Q1
C) Mean of the dataset
D) None of the above
Answer: B
28. Which formula is used to detect outliers based on IQR?
A) Values < Q1 - 1.5 * IQR or > Q3 + 1.5 * IQR
B) Values < Q1 - IQR or > Q3 + IQR
C) Values > Mean + Std. Dev
D) None of the above
Answer: A
29. Which library provides the IsolationForest algorithm for detecting outliers?
A) sklearn
B) pandas
C) matplotlib
D) seaborn
Answer: A
30. What does a Z-score measure in outlier detection?
A) Distance from the mean in terms of standard deviations
B) Distance from the median
C) Difference between two values
D) None of the above
Answer: A

7. Data Visualization

31. Which library is used for creating static, interactive, and animated
visualizations?
A) matplotlib
B) seaborn
C) pandas
D) sklearn
Answer: A
32. Which seaborn function is used to create pair plots?
A) pairplot()
B) scatterplot()
C) boxplot()
D) lineplot()
Answer: A
33. What type of plot is best for visualizing data distribution?
A) Histogram
B) Scatter plot
C) Line plot
D) Bar plot
Answer: A
34. Which plot visualizes the relationship between two variables?
A) Scatter plot
B) Histogram
C) Box plot
D) Pie chart
Answer: A
35. Which seaborn function creates a heatmap?
A) heatmap()
B) barplot()
C) scatterplot()
D) None of the above
Answer: A

8. Scatter Plot

36. What does a scatter plot show?

A) Relationships between two variables
B) Data distribution
C) Outliers only
D) None of the above
Answer: A
37. Which function creates a scatter plot in matplotlib?
A) plt.scatter()
B) plt.plot()
C) plt.scatterplot()
D) plt.line()
Answer: A

9. Line Plot

38. Which function is used to plot a line graph in matplotlib?

A) plt.plot()
B) plt.line()
C) plt.scatter()
D) None of the above
Answer: A

Unit V
1. Supervised Learning: Basics

1. What is supervised learning?

A) Training a model with labeled data
B) Training a model with unlabeled data
C) Reinforcement through trial and error
D) None of the above
Answer: A
2. Which of the following is NOT an example of supervised learning?
A) Linear regression
B) Clustering
C) Decision tree
D) Logistic regression
Answer: B
3. What are the two main categories of supervised learning?
A) Regression and clustering
B) Classification and regression
C) Clustering and classification
D) Regression and reinforcement learning
Answer: B

2. Regression

4. What is the primary goal of regression?

A) Predict continuous values
B) Classify data into categories
C) Identify clusters in data
D) Reinforce learning from past actions
Answer: A
5. Which metric is commonly used to evaluate regression models?
A) Accuracy
B) Mean Squared Error (MSE)
C) Precision
D) Recall
Answer: B
6. In regression, the line that minimizes the sum of squared errors is called the:
A) Decision boundary
B) Regression line
C) Margin
D) Hyperplane
Answer: B
7. Which algorithm is commonly used for regression problems?
A) K-means
B) Linear regression
C) Naïve Bayes
D) DBSCAN
Answer: B
8. The slope in a simple linear regression equation represents:
A) Intercept
B) Rate of change in the dependent variable
C) Sum of squared errors
D) None of the above
Answer: B

3. Classification

9. What is the primary goal of classification?

A) Predict categories or labels
B) Predict continuous values
C) Identify clusters
D) Reinforce learning from actions
Answer: A
10. Which algorithm is used for binary classification?
A) Logistic regression
B) K-means
C) DBSCAN
D) PCA
Answer: A
11. Which of the following is a classification metric?
A) R-squared
B) Confusion matrix
C) Mean Absolute Error
D) Sum of squares
Answer: B
12. Which algorithm assumes conditional independence of features?
A) Decision tree
B) Random forest
C) Naïve Bayes
D) K-Nearest Neighbor
Answer: C

4. Linear Regression

13. What is the assumption in linear regression about the relationship between
variables?
A) Non-linear
B) Linear
C) Polynomial
D) Exponential
Answer: B
14. What is the cost function used in linear regression?
A) Entropy
B) Mean Squared Error (MSE)
C) Log loss
D) Gini impurity
Answer: B
15. What does multicollinearity refer to in linear regression?
A) High correlation between independent variables
B) High correlation between dependent and independent variables
C) Low correlation between all variables
D) No correlation between variables
Answer: A

5. Logistic Regression

16. What is the primary use of logistic regression?

A) Regression tasks
B) Classification tasks
C) Clustering tasks
D) Reinforcement tasks
Answer: B
17. Which function does logistic regression use to predict probabilities?
A) Linear function
B) Sigmoid function
C) Polynomial function
D) Exponential function
Answer: B
18. What is the range of predicted values in logistic regression?
A) -∞ to ∞
B) 0 to 1
C) -1 to 1
D) None of the above
Answer: B
19. Which loss function is used in logistic regression?
A) Mean Squared Error
B) Log loss (Cross-Entropy)
C) Gini impurity
D) Entropy
Answer: B

6. Decision Tree

20. What is a decision tree?

A) A clustering algorithm
B) A model that splits data based on feature values
C) A reinforcement learning algorithm
D) None of the above
Answer: B
21. What is the purpose of entropy in a decision tree?
A) Measure information gain
B) Identify clusters
C) Perform regression analysis
D) None of the above
Answer: A
22. Which algorithm is used to build decision trees using entropy?
A) Random Forest
B) ID3
C) KNN
D) Naïve Bayes
Answer: B
23. Information gain measures:
A) Reduction in entropy after a split
B) Increase in variance after a split
C) Distance between clusters
D) None of the above
Answer: A

7. Random Forest

24. What is random forest?

A) A single decision tree
B) An ensemble of decision trees
C) A clustering algorithm
D) None of the above
Answer: B
25. Which method is used to combine multiple decision trees in random forest?
A) Averaging for regression, voting for classification
B) Clustering
C) Bagging
D) Boosting
Answer: A

8. K-Nearest Neighbors (KNN)

26. What is the main idea of KNN?

A) Identify the nearest neighbors and classify based on majority voting
B) Build a tree structure
C) Perform regression analysis
D) None of the above
Answer: A
27. K in KNN represents:
A) Number of clusters
B) Number of neighbors to consider
C) Number of classes
D) None of the above
Answer: B
28. Which distance metric is commonly used in KNN?
A) Manhattan
B) Euclidean
C) Minkowski
D) All of the above
Answer: D

9. Unsupervised Learning: Clustering

29. What is the main goal of clustering?

A) Group similar data points
B) Predict future outcomes
C) Perform classification
D) None of the above
Answer: A
30. Which of the following is NOT a clustering algorithm?
A) K-means
B) DBSCAN
C) Random Forest
D) Agglomerative Clustering
Answer: C
31. Which parameter specifies the number of clusters in K-means?
A) k
B) max_iter
C) epsilon
D) None of the above
Answer: A

10. Reinforcement Learning

32. What is the primary goal of reinforcement learning?

A) Find an optimal policy to maximize cumulative reward
B) Classify data into categories
C) Reduce dimensionality
D) None of the above
Answer: A
33. In reinforcement learning, the agent interacts with:
A) The environment
B) A supervisor
C) Labeled data
D) None of the above
Answer: A
34. Which algorithm is commonly used in reinforcement learning?
A) Q-learning
B) Linear regression
C) K-means
D) Logistic regression
Answer: A

MCQ Interview Questions
No ratings yet
MCQ Interview Questions
16 pages
PDS Bits
No ratings yet
PDS Bits
6 pages
Top 50 Python Interview Questions
No ratings yet
Top 50 Python Interview Questions
8 pages
Data Science QnA
No ratings yet
Data Science QnA
15 pages
Day 1-Quiz
No ratings yet
Day 1-Quiz
7 pages
Python Programming
No ratings yet
Python Programming
9 pages
Computer Programming
No ratings yet
Computer Programming
21 pages
Final Exam Reviewer
No ratings yet
Final Exam Reviewer
10 pages
Python 1
No ratings yet
Python 1
18 pages
UNIT 3 Python ProgrammingExtra
No ratings yet
UNIT 3 Python ProgrammingExtra
7 pages
CT-3 QB
No ratings yet
CT-3 QB
12 pages
Python Interview QA DataScience GenAI
No ratings yet
Python Interview QA DataScience GenAI
4 pages
Python and Libraries For AI
No ratings yet
Python and Libraries For AI
34 pages
Check Your Progress 5
No ratings yet
Check Your Progress 5
7 pages
UNIT 3 Python Programming
No ratings yet
UNIT 3 Python Programming
7 pages
AI Viva Questions for Class 10
No ratings yet
AI Viva Questions for Class 10
3 pages
Final Ga
No ratings yet
Final Ga
152 pages
Python For Data Science PDF
100% (10)
Python For Data Science PDF
30 pages
Computer Science: Supporting Material
No ratings yet
Computer Science: Supporting Material
54 pages
ML Lab Viva
No ratings yet
ML Lab Viva
6 pages
Python Quizzes for Beginners
No ratings yet
Python Quizzes for Beginners
29 pages
Programming in Python Mcqs With Answers (2022) CSD
No ratings yet
Programming in Python Mcqs With Answers (2022) CSD
63 pages
Who Is The Developer of Python
No ratings yet
Who Is The Developer of Python
12 pages
2 Marks
No ratings yet
2 Marks
5 pages
Exam Questions Computer Programming I
No ratings yet
Exam Questions Computer Programming I
14 pages
APL Mن
No ratings yet
APL Mن
18 pages
100 MCQs On Python, NumPy, and Pandas For Amex Onl
No ratings yet
100 MCQs On Python, NumPy, and Pandas For Amex Onl
23 pages
IP - Computer Science - MCQs - SW - 2 - Q + Soln
No ratings yet
IP - Computer Science - MCQs - SW - 2 - Q + Soln
7 pages
Python Basics for Beginners
No ratings yet
Python Basics for Beginners
9 pages
Python Important
No ratings yet
Python Important
35 pages
3 Python MCQs Exam
No ratings yet
3 Python MCQs Exam
1 page
4-Python Programming Mcqs With Answers PDF
No ratings yet
4-Python Programming Mcqs With Answers PDF
43 pages
Bridge Course QP2 19.04
No ratings yet
Bridge Course QP2 19.04
3 pages
End Module A Mock Questions
No ratings yet
End Module A Mock Questions
28 pages
Namma Kalvi 12th Computer Science 1 Mark Question Bank em 217866
No ratings yet
Namma Kalvi 12th Computer Science 1 Mark Question Bank em 217866
18 pages
Final Question
No ratings yet
Final Question
28 pages
Sample Questions
No ratings yet
Sample Questions
16 pages
Python 2025 Mcqs
No ratings yet
Python 2025 Mcqs
56 pages
Sample Paper 14 IP
No ratings yet
Sample Paper 14 IP
9 pages
Sec A Question Paper
No ratings yet
Sec A Question Paper
4 pages
12CS em MLM
No ratings yet
12CS em MLM
41 pages
Complete Python Questions With Answers
No ratings yet
Complete Python Questions With Answers
13 pages
More Mcqs
No ratings yet
More Mcqs
14 pages
DATASCIENCE (Unit-1) Question Bank
No ratings yet
DATASCIENCE (Unit-1) Question Bank
6 pages
12 CS em
No ratings yet
12 CS em
15 pages
12th CS MLM
No ratings yet
12th CS MLM
51 pages
Python MCQs
No ratings yet
Python MCQs
70 pages
+2 CS One Mark RM
No ratings yet
+2 CS One Mark RM
12 pages
Kunci Jawaban PYTHON
No ratings yet
Kunci Jawaban PYTHON
6 pages
Final Exam Data Mining and Machine Learning
No ratings yet
Final Exam Data Mining and Machine Learning
5 pages
Python Programming MCQs for BCA Students
No ratings yet
Python Programming MCQs for BCA Students
20 pages
Lecture 2 Exercises
No ratings yet
Lecture 2 Exercises
4 pages
Python Exam Paper Solve
No ratings yet
Python Exam Paper Solve
7 pages
Python MCQ 100
No ratings yet
Python MCQ 100
15 pages
Python MCQ Set of All Units
No ratings yet
Python MCQ Set of All Units
23 pages
Week 1
No ratings yet
Week 1
7 pages
Iot (Internet of Things) Advanced Top Objectives For Practice
No ratings yet
Iot (Internet of Things) Advanced Top Objectives For Practice
31 pages
PDSC Few Questions Answers 2020
No ratings yet
PDSC Few Questions Answers 2020
36 pages
Python English. - January 2023
No ratings yet
Python English. - January 2023
19 pages
Seminar On TOPIC Data Science For Health Care
No ratings yet
Seminar On TOPIC Data Science For Health Care
13 pages
Technohacks Internship Report
No ratings yet
Technohacks Internship Report
22 pages
127+ Data Science Projects With Python Code.
No ratings yet
127+ Data Science Projects With Python Code.
9 pages
Google Product Analyst Prep Guide
No ratings yet
Google Product Analyst Prep Guide
7 pages
Aspiring Data Scientist's Journey
No ratings yet
Aspiring Data Scientist's Journey
2 pages
MS Datascience Worksheet
No ratings yet
MS Datascience Worksheet
4 pages
Timetable For B.Tech III II R20 Supple Nov Dec 2024 Exams
No ratings yet
Timetable For B.Tech III II R20 Supple Nov Dec 2024 Exams
9 pages
Yahia Omar Data Engineering CV
No ratings yet
Yahia Omar Data Engineering CV
2 pages
DS G (8) - Staff Data Scientist
No ratings yet
DS G (8) - Staff Data Scientist
4 pages
A Functional Approach To Basics of Data Science With Excel-Book - Chapter 1 and 2 - 1st Print
No ratings yet
A Functional Approach To Basics of Data Science With Excel-Book - Chapter 1 and 2 - 1st Print
13 pages
Big - Data Unit-2
100% (2)
Big - Data Unit-2
64 pages
DSA2324 Lecture 01 Introduction To Data Science
No ratings yet
DSA2324 Lecture 01 Introduction To Data Science
96 pages
Manpower Planning Final
No ratings yet
Manpower Planning Final
11 pages
DS&ML 1
No ratings yet
DS&ML 1
9 pages
Healthcare Data Scientist Expertise
No ratings yet
Healthcare Data Scientist Expertise
2 pages
Strategic Intelligence Boosts Donor Projects
No ratings yet
Strategic Intelligence Boosts Donor Projects
12 pages
zdEDM98PSACRAzPfD7gARA - Data Driven Decisions With Power BI - Coursera Readings Knowledge Accelerators
No ratings yet
zdEDM98PSACRAzPfD7gARA - Data Driven Decisions With Power BI - Coursera Readings Knowledge Accelerators
30 pages
Database ABES Faculty
No ratings yet
Database ABES Faculty
33 pages
Assignment 1 - Big Data in Big Companies
No ratings yet
Assignment 1 - Big Data in Big Companies
5 pages
Deena Christina CV
No ratings yet
Deena Christina CV
3 pages
ALX Milestone 5: Personal Growth & Community Issues
No ratings yet
ALX Milestone 5: Personal Growth & Community Issues
13 pages
Data Analytics (Da) by I Tech World
No ratings yet
Data Analytics (Da) by I Tech World
65 pages
Data Science Techniques, Tools and Predictions: March 2020
No ratings yet
Data Science Techniques, Tools and Predictions: March 2020
9 pages
DMM - Presentation - Edgar Ruiz - 3484
No ratings yet
DMM - Presentation - Edgar Ruiz - 3484
13 pages
ZG536 L1 Introduction 140124
No ratings yet
ZG536 L1 Introduction 140124
18 pages
DLC Catalogue
No ratings yet
DLC Catalogue
16 pages
Fdsa Unit 1
No ratings yet
Fdsa Unit 1
25 pages
Energies 16 04025
No ratings yet
Energies 16 04025
31 pages
DSUP Chapter 1 PDF
No ratings yet
DSUP Chapter 1 PDF
31 pages
TT 2
No ratings yet
TT 2
14 pages

Sample MCQ Questions

Uploaded by

Sample MCQ Questions

Uploaded by

UNIT I

1. Need for Data Science

1. What is the primary purpose of data science?

2. Benefits and Uses of Data Science

6. How does data science benefit organizations?

11. What are the facets of data?

4. Data Science Process

16. What is the first step in the data science process?

21. Which file extension is used for Python scripts?

6. Setting Working Directory

29. How do you execute a Python script?

31. How do you delete a variable in Python?

9. Commenting Script Files

33. How do you write single-line comments in Python?

10. Data Types and Operators

34. Which is a numeric data type in Python?

1. What is the purpose of control structures in programming?

6. Which keyword is used to terminate a loop prematurely?

11. Which keyword is used to define a function in Python?

16. Which data structure is mutable in Python?

21. What is the primary purpose of the NumPy library?

25. What is the difference between a list and a NumPy array?

6. Data Collection and Types

26. What is primary data?

31. Which of the following is a step in data preprocessing?

8. Exploratory Data Analysis (EDA)

34. What is the primary goal of EDA?

1. What is the purpose of descriptive statistics?

6. How is the mean calculated?

11. What does standard deviation measure?

4. Skewness and Kurtosis

16. What does skewness measure?

21. What is the primary goal of inferential statistics?

26. What is the range of probability values?

31. What is the primary purpose of the Pandas library?

36. Which method adds a new column to a DataFrame?

1. Data Cleaning and Preparation

1. What is the primary goal of data cleaning?

2. Handling Missing Data

6. What is the simplest way to handle missing data?

3. Data Transformations (pandas and sklearn)

11. What does the apply() function in pandas do?

16. How do you remove duplicate rows in pandas?

21. Which pandas method is used to replace specific values?

26. Which plot is most commonly used to detect outliers?

36. What does a scatter plot show?

38. Which function is used to plot a line graph in matplotlib?

1. What is supervised learning?

4. What is the primary goal of regression?

9. What is the primary goal of classification?

16. What is the primary use of logistic regression?

20. What is a decision tree?

24. What is random forest?

8. K-Nearest Neighbors (KNN)

26. What is the main idea of KNN?

9. Unsupervised Learning: Clustering

29. What is the main goal of clustering?

10. Reinforcement Learning

32. What is the primary goal of reinforcement learning?

You might also like