Unit I – Introduction to Data Science (20 Qs)
Multiple Choice Questions
1. Data Science is primarily concerned with:
a) Hardware Design
b) Data Analysis and Insights
c) Network Security
d) Software Installation
Answer: b
2. Which of the following fields is closely related to Data Science?
a) Information Science
b) Civil Engineering
c) Mechanical Design
d) None
Answer: a
3. The skill that involves solving problems step by step using logic is called:
a) Experimentation
b) Computational Thinking
c) Debugging
d) Simulation
Answer: b
4. Tools like Python, R, and SQL are used in:
a) Data Science
b) Mechanical Drawing
c) Physics
d) Chemistry
Answer: a
5. Ethics in data science includes issues like:
a) Privacy
b) Bias
c) Fairness
d) All of the above
Answer: d
6. Which of the following is NOT a tool for Data Science?
a) Jupyter Notebook
b) Google Colab
c) AutoCAD
d) RStudio
Answer: c
7. The relationship between Data Science and Information Science is:
a) Independent
b) Closely related
c) No connection
d) Opposite
Answer: b
8. Which of these is a required skill for a Data Scientist?
a) Machine Learning
b) Data Visualization
c) Programming
d) All of the above
Answer: d
9. Data Science is mainly applied in:
a) Healthcare
b) Finance
c) Social Media
d) All of the above
Answer: d
10. Which programming language is most popular in Data Science?
a) Java
b) Python
c) C++
d) Ruby
Answer: b
Fill in the blanks
11. __________ is defined as extracting knowledge and insights from data.
Answer: Data Science
12. Data Science combines mathematics, statistics, and __________.
Answer: Computer Science
13. __________ is the mindset for solving problems step by step.
Answer: Computational Thinking
14. One major issue in Data Science is __________ of user information.
Answer: Privacy
15. __________ is a visualization library widely used in Python.
Answer: Matplotlib
16. __________ helps identify hidden patterns in data.
Answer: Machine Learning
17. Skills for Data Science include domain knowledge, programming, and __________.
Answer: Statistics
18. Jupyter Notebook is an example of a __________ tool.
Answer: Data Science
19. Ethics in Data Science ensures __________ use of data.
Answer: Responsible
20. __________ is considered the fuel for Data Science.
Answer: Data
Unit II – Data (20 Qs)
Multiple Choice Questions
21. Which of the following is NOT a type of data?
a) Structured
b) Semi-Structured
c) Circular
d) Unstructured
Answer: c
22. Tabular data with rows and columns is called:
a) Unstructured Data
b) Structured Data
c) Multimedia Data
d) Random Data
Answer: b
23. Images, videos, and audio files are examples of:
a) Structured Data
b) Unstructured Data
c) Open Data
d) None
Answer: b
24. Which is a challenge with unstructured data?
a) Easy to analyze
b) Hard to store and process
c) Always numeric
d) Requires no cleaning
Answer: b
25. Social media data is often:
a) Structured
b) Unstructured
c) Multimodal
d) Both b and c
Answer: d
26. Data cleaning involves:
a) Removing errors
b) Adding noise
c) Increasing storage
d) Encrypting data
Answer: a
27. Data reduction aims to:
a) Increase data size
b) Reduce data volume but keep information
c) Delete all data
d) Ignore analysis
Answer: b
28. The process of combining data from multiple sources is called:
a) Data Discretization
b) Data Integration
c) Data Cleaning
d) Data Reduction
Answer: b
29. Which type of data includes text, images, audio, and video?
a) Structured
b) Multimodal
c) Tabular
d) None
Answer: b
30. Open data is:
a) Free to use
b) Confidential
c) Encrypted
d) Private
Answer: a
Fill in the blanks
31. __________ data is organized in rows and columns.
Answer: Structured
32. __________ data is difficult to store in traditional databases.
Answer: Unstructured
33. Data __________ is the process of removing incorrect values.
Answer: Cleaning
34. __________ is the process of converting data into useful format.
Answer: Transformation
35. __________ is the process of reducing data into smaller intervals.
Answer: Discretization
36. __________ is often collected from social media platforms.
Answer: User-Generated Data
37. __________ is the process of storing and presenting data for analysis.
Answer: Data Storage
38. __________ data is publicly available for everyone.
Answer: Open
39. Data __________ is needed before analysis for accuracy.
Answer: Preprocessing
40. __________ data is a combination of multiple formats like text, image, and video.
Answer: Multimodal
Unit III – NumPy Basics (First Half Only)
Multiple Choice Questions (10)
41. NumPy’s core data structure for numerical computing is:
a) list
b) tuple
c) ndarray
d) dict
Answer: c
42. Which function is used to create a NumPy array from a Python list?
a) np.create()
b) np.array()
c) np.ndarray()
d) np.list()
Answer: b
43. NumPy arrays are:
a) Mutable
b) Immutable
c) Only one-dimensional
d) None
Answer: a
44. The attribute that gives the shape of a NumPy array is:
a) arr.size
b) arr.shape
c) arr.ndim
d) arr.length
Answer: b
45. Which method is used to create an array of evenly spaced values?
a) np.linspace()
b) np.arange()
c) Both a and b
d) None
Answer: c
46. The default data type of NumPy arrays is:
a) int
b) float64
c) object
d) str
Answer: b
47. Operations between NumPy arrays and scalars are applied:
a) Element-wise
b) Row-wise
c) Column-wise
d) Randomly
Answer: a
48. Which indexing method allows selection using True/False arrays?
a) Basic Indexing
b) Boolean Indexing
c) Fancy Indexing
d) Slice Indexing
Answer: b
49. Which indexing method allows using lists or arrays of indices?
a) Boolean
b) Fancy
c) Slice
d) None
Answer: b
50. The simplest way to transpose a NumPy array is:
a) arr.reverse()
b) arr.transpose()
c) arr.swap()
d) arr.reshape()
Answer: b
Fill in the blanks (10)
51. The abbreviation NumPy stands for __________.
Answer: Numerical Python
52. The function __________ is used to create arrays filled with zeros.
Answer: np.zeros()
53. The function __________ is used to create arrays filled with ones.
Answer: np.ones()
54. The attribute __________ gives the number of dimensions of an array.
Answer: ndim
55. The attribute __________ gives the total number of elements in an array.
Answer: size
56. Slicing in NumPy uses the format __________.
Answer: [start:stop:step]
57. Boolean indexing in NumPy uses __________ conditions.
Answer: Logical (True/False)
58. Fancy indexing allows indexing using __________ of integers.
Answer: Arrays/lists
59. The method __________ swaps axes of an array.
Answer: swapaxes()
60. The shorthand property for transposing an array is __________.
Answer: arr.T