0% found this document useful (0 votes)

175 views6 pages

Structured Data Classification MCQ's

The document contains multiple-choice questions (MCQs) related to structured data classification, covering topics such as hyperparameters, classification techniques, and data preprocessing. Key concepts discussed include the importance of model evaluation, handling imbalanced classes, and the use of various classifiers like Decision Trees and Naive Bayes. Additionally, it emphasizes the significance of using appropriate metrics and techniques for accurate model performance assessment.

Uploaded by

Gurram Anurag

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

175 views6 pages

Structured Data Classification MCQ's

Uploaded by

Gurram Anurag

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Join our channel if you haven’t joined yet https://t.

me/fresco_milestone ( @fresco_milestone )

Structured Data Classification MCQ's

Which of the given hyper parameter(s), when increased may cause random forest to over fit the
data?

Answer : Depth of Tree

To view the first 3 rows of the dataset, which of the following commands are used?Download the
dataset
from:https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2
f487608c537c05e22e4b221/iris.csv to answer the question.

Answer : iris.head(3)

Pruning is a technique associated with

Answer : Decision tree

High classification accuracy always indicates a good classifier.

Answer : True

Categorical variables has

Answer : no logical order

Cross-validation technique will provide accurate results when the training set and the testing set are
from two different populations.

Answer : True

Let's assume, you are solving a classification problem with highly imbalanced class. The majority
class is observed 99% of times in the training data. Which of the following is true when your model
has 99% accuracy after taking the predictions on test data. ?

Answer : For imbalanced class problems, accuracy metric is not a good idea.

Email spam detection is an example of

Answer : supervised classification

A technique used to depict the performance in a tabular form that has 2 dimensions namely “actual”
and “predicted” sets of data.

Answer : Confusion Matrix

Choose the correct sequence for classifier building from the following:

Answer : Initialize -> Train - -> Predict-->Evaluate

The commonly used package for machine learning in python is

Answer : sklearn
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

A classifer that can compute using numeric as well as categorical values is

Answer : Decision Tree Classifier

Can we consider sentiment classification as a text classification problem?

Answer : yes

What kind of classification is the given case study(IRIS dataset)?Download the dataset from:
https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f4876
08c537c05e22e4b221/iris.csv to answer the question.

Answer : Multi class classification

Ensemble learning is used when you build component classifiers that are more accurate and
independent from each other.

Answer : true

clustering is an example of

Answer : unsupervised classification

Model Tuning helps to increase the accuracy

Answer : True

Images and documents are examples of _________

Answer : Unstructured Data

Ordinal variables has

Answer : clear logical order

Which command is used to select all NUMERIC types in the dataset.Download the dataset from:
https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f4876
08c537c05e22e4b221/iris.csv to answer the question.

Answer : iris_num = iris_data.select_dtypes(include=[numpy.number])

The number of categorical attributes in the original dataset.Download the dataset from:
https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f4876
08c537c05e22e4b221/iris.csv to answer the question.

Answer : 3

Which classifier converges easily with less training data?

Answer : Naive Bayes Classifier

Imputing is a strategy to handle

Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

Answer : Missing Values

classification where each data is mapped to more than one class is called

Answer : Binary Classification.

The fit(X, y) is used to

Answer : Train the Classifier

Supervised learning differs from unsupervised learning as supervised learning requires __________

Answer : Labeled data

Clustering is a supervised classification.

Answer : False

Select the correct option which directly achieve multi-class classification (without support of binary
classifiers).

Answer : K Nearest Neighbor

The classification where each data is mapped to more than one class is called ___________

Answer : Multi Label Classification

Email spam data is an example of __________

Answer : unstructed Data

The most widely used package for machine learning in Python is _________

Answer : sklearn

Pruning is a technique associated with __________

Answer : dt

What does the command sentiment_analysis_data['label'].value_counts() return?

Answer : counts of unique values in the 'label' column

Select the pre-processing technique(s) from the following.

Answer : all

Which of the given hyper parameter, when increased, may cause random forest to over fit the data?

Answer : depth of tree

Select the correct statement about Nonlinear classification.

Answer : Kernel tricks are used by Nonlinear classifiers to achieve maximum-margin hyperplanes.
Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

Choose the correct sequence for classifier building from the following.

Answer : Initialize -> Train - -> Predict-->Evaluate

What command should be given to tokenize a sentence into words?

Answer : from nltk.tokenize import word_tokenize, Word_tokens =word_tokenize(sentence)

Choose the correct sequence from the following.

Answer : Data Analysis -> PreProcessing -> Model Building--> Predict

The following are all classification techniques, except ___________

Answer : StratifiedShuffleSplit

The commonly used package for machine learning in python is

Answer : sklearn

How many new columns does the following command return?

Answer : iris_series = pd.get_dummies(iris['Species'])

Download the dataset from:

https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f4876
08c537c05e22e4b221/iris.csv to answer the question.

Answer : 3

Identify the command used to view the dataset SIZE and what is the value returned?Download the
dataset from:
https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f4876
08c537c05e22e4b221/iris.csv to answer the question.

Answer : iris.shape,(150,6) (Incorrect)

Which type of cross validation is used for imbalanced dataset?

Answer : K fold

To view the first 3 rows of the dataset, which of the following commands are used?Download the
dataset from:
https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f4876
08c537c05e22e4b221/iris.csv to answer the question.

Answer : iris.head(3)

Naive Bayes Algorithm is useful for :

Answer : indepth analysis

Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

A process used to identify data points that are simply unusual

Answer : Anomaly Detection

Is there a class imbalance problem in the given data set?

Answer : no

Which of the following is not a technique to process missing values?

Answer : One hot encoding

Images,documents are examples of

Answer : Unstructured Data

email spam detection is an example of

Answer : The count with unique values in the iris['species'] column

Choose the correct sequence for classifier building from the following:

Answer : Initialize -> Train -> Predict -> Evaluate

Imagine you have just finished training a decision tree for spam classication and it is showing
abnormal bad performance on both your training and test sets. Assume that your implementation
has no bugs. What could be reason for this problem.

Answer : All

Identify the structured data from the following.

Answer : Data from mySQL DB and Excel

True Negative is when the predicted instance and the actual is positive.

Answer : False

What does the command iris['species'].value_counts() return?Download the dataset

fromhttps://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f
487608c537c05e22e4b221/iris.csv to answer the question.

Answer : The count with unique values in the iris['species'] column

A process used to identify unusual data points is _________

Answer : Anomaly Detection

The following are techniques to process missing values, except _______

Answer : of the options

Join our channel if you haven’t joined yet https://t.me/fresco_milestone ( @fresco_milestone )

How many classes will the following command return?(target classes in the dataset) :
classes=list(iris['species'].unique())Download the dataset
fromhttps://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f
487608c537c05e22e4b221/iris.csv to answer the question.

Answer : 3

Cross-validation causes over-fitting.

Answer : False

True Positive is when the predicted instance and the actual instance is not negative.

Answer : True

What kind of classification is our case study 'Churn Analysis'?

Answer : Binary

Which command is used to identify the unique values of a column?

Answer : unique()

Which preprocessing technique is used to make the data gaussian with zero mean and unit variance?

Answer : Standardisation

Cross-validation technique is used to evaluate a classifier by dividing the data set into training set to
train the classifier and testing set to test the same.

Answer : True

What are the advantages of Naive Bayes?

Answer : Both the options

What kind of classification is the given case study (Iris dataset)?Download the dataset
fromhttps://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f
487608c537c05e22e4b221/iris.csv to answer the question.

Answer : Binary classification (Incorrect)

Let's assume you are solving a classification problem with a highly imbalanced class.The majority
class is observed 99% of the time in the training data.Which of thefollowing is true when your model
has 99% accuracy after taking the predictions on test data?

Answer : For imbalanced class problems, the accuracy metric is not a good idea.

The cross-validation technique will provide accurate results when the training set and the testing set
are from two different populations.

Answer : False

Structured Data Classification
No ratings yet
Structured Data Classification
3 pages
Unstructured Data Classification Guide
No ratings yet
Unstructured Data Classification Guide
5 pages
Machine Learning Suggestion (2 Marks) MCQ
No ratings yet
Machine Learning Suggestion (2 Marks) MCQ
5 pages
ML MCQs Set
No ratings yet
ML MCQs Set
18 pages
Machine Learning Multiple Choice Questions - Free Practice Test
100% (1)
Machine Learning Multiple Choice Questions - Free Practice Test
12 pages
Sem3 Asmt Answers
No ratings yet
Sem3 Asmt Answers
20 pages
ML MCQ Questions and Answer PDF
No ratings yet
ML MCQ Questions and Answer PDF
10 pages
This Sheet Is For 1 Mark Questions S.R No
100% (1)
This Sheet Is For 1 Mark Questions S.R No
69 pages
MCQ of Machine Learning
100% (2)
MCQ of Machine Learning
151 pages
Data Science in FInancial Services - 3
No ratings yet
Data Science in FInancial Services - 3
76 pages
Machine Learning & AI Quiz Answers
No ratings yet
Machine Learning & AI Quiz Answers
15 pages
True/False and Multiple Choice Quiz on Machine Learning and Python
50% (2)
True/False and Multiple Choice Quiz on Machine Learning and Python
18 pages
End SEM V IMP DSE 2
No ratings yet
End SEM V IMP DSE 2
9 pages
This Sheet Is For 1 Mark Questions S.R No
No ratings yet
This Sheet Is For 1 Mark Questions S.R No
56 pages
Soal CISDM
No ratings yet
Soal CISDM
3 pages
This Sheet Is For 1 Mark Questions S.R No
No ratings yet
This Sheet Is For 1 Mark Questions S.R No
63 pages
Mcqs 1
No ratings yet
Mcqs 1
34 pages
Ai ML Unit 3
No ratings yet
Ai ML Unit 3
15 pages
ML Using Scikit
50% (4)
ML Using Scikit
23 pages
Classification
No ratings yet
Classification
4 pages
Scikit - Notes ML
100% (2)
Scikit - Notes ML
12 pages
Department of Computer Science & Engineering Machine Learning Quiz - I Set - I
No ratings yet
Department of Computer Science & Engineering Machine Learning Quiz - I Set - I
3 pages
Machine Learning Multiple Choice Questions
100% (1)
Machine Learning Multiple Choice Questions
20 pages
Q1-What's The Trade-Off Between Bias and Variance?
100% (1)
Q1-What's The Trade-Off Between Bias and Variance?
5 pages
Question Bank
No ratings yet
Question Bank
67 pages
Supervised Learning Classification Algorithms Comparison
No ratings yet
Supervised Learning Classification Algorithms Comparison
6 pages
Machine Learning Exam for Students
No ratings yet
Machine Learning Exam for Students
7 pages
Data Mining Classification Guide
No ratings yet
Data Mining Classification Guide
54 pages
11 W11NSE6220 - Fall 2023 - Zeng
No ratings yet
11 W11NSE6220 - Fall 2023 - Zeng
43 pages
Advanced Data Analytics Exam Questions and Answers
No ratings yet
Advanced Data Analytics Exam Questions and Answers
7 pages
DMT MCQ
No ratings yet
DMT MCQ
15 pages
ML Objectives Answers
No ratings yet
ML Objectives Answers
8 pages
Data Analytic MCQ
No ratings yet
Data Analytic MCQ
5 pages
DWM Exp3 63
No ratings yet
DWM Exp3 63
7 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
20150908-Lecture-3-Draft Asd Def HFL DFGF Lkreglker Lerg Kelr GK
No ratings yet
20150908-Lecture-3-Draft Asd Def HFL DFGF Lkreglker Lerg Kelr GK
15 pages
Machine Learning mcq1 PDF
No ratings yet
Machine Learning mcq1 PDF
10 pages
HUAWEI Final Written Exam 3333
50% (2)
HUAWEI Final Written Exam 3333
13 pages
Amlt Bca Unit-1
No ratings yet
Amlt Bca Unit-1
24 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
Data Science Final Mock Test
No ratings yet
Data Science Final Mock Test
47 pages
Hatdog 1.2
No ratings yet
Hatdog 1.2
18 pages
DIT865 2018 Mar Solution
No ratings yet
DIT865 2018 Mar Solution
9 pages
DATA - FA 2024 - Dist
No ratings yet
DATA - FA 2024 - Dist
85 pages
Python ML Algorithm
No ratings yet
Python ML Algorithm
30 pages
Huawei Final Written Exam 2.2 Attempts
No ratings yet
Huawei Final Written Exam 2.2 Attempts
19 pages
Machine Learning For Interviews
No ratings yet
Machine Learning For Interviews
12 pages
Machine Learning Supervised 100 MCQs With Answers
No ratings yet
Machine Learning Supervised 100 MCQs With Answers
22 pages
Lecture 11 - 09.09.24 Classification Part 1
No ratings yet
Lecture 11 - 09.09.24 Classification Part 1
51 pages
Module2 25
No ratings yet
Module2 25
119 pages
Implementation of Credit Card Fraud Detection Using Random Forest Algorithm
100% (1)
Implementation of Credit Card Fraud Detection Using Random Forest Algorithm
10 pages
Assignment 9
No ratings yet
Assignment 9
4 pages
Machine Learning MCQ
No ratings yet
Machine Learning MCQ
11 pages
DWM - END SEM LAB Questions
No ratings yet
DWM - END SEM LAB Questions
9 pages
Machine Learning and Python Quiz
No ratings yet
Machine Learning and Python Quiz
13 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Hyperledger Fabric
No ratings yet
Hyperledger Fabric
2 pages
Internet of Things Prime
No ratings yet
Internet of Things Prime
4 pages
Crypto
No ratings yet
Crypto
2 pages
Frescoplay Internet of Things Internet of Things Prime
No ratings yet
Frescoplay Internet of Things Internet of Things Prime
2 pages
Web Control Room Assessment
No ratings yet
Web Control Room Assessment
3 pages
Purview of Icon Design 2
No ratings yet
Purview of Icon Design 2
1 page
T Factor Software Defined Networking Answers
No ratings yet
T Factor Software Defined Networking Answers
4 pages
Unstructured Data Classification
No ratings yet
Unstructured Data Classification
2 pages
Unittest
No ratings yet
Unittest
5 pages
ANOVA Single & Two-Factor Analysis
No ratings yet
ANOVA Single & Two-Factor Analysis
6 pages
Regression Analysis - VCE Further Mathematics
No ratings yet
Regression Analysis - VCE Further Mathematics
5 pages
ST350 NCSU Practice Problems Final Exam
No ratings yet
ST350 NCSU Practice Problems Final Exam
13 pages
Lecture 21 Sarima Modeling
No ratings yet
Lecture 21 Sarima Modeling
28 pages
Answer Any Two Full Questions, Each Carries 15 Marks.: Reg No.: - Name
No ratings yet
Answer Any Two Full Questions, Each Carries 15 Marks.: Reg No.: - Name
2 pages
STAT 443 Project
No ratings yet
STAT 443 Project
19 pages
Midterm 2022 Sol
No ratings yet
Midterm 2022 Sol
7 pages
Course - Syllabus - 2024 WAY - ECO3104-11 - ECONOMETRICS (1) - SEOKJOO ANDREW CHANG
No ratings yet
Course - Syllabus - 2024 WAY - ECO3104-11 - ECONOMETRICS (1) - SEOKJOO ANDREW CHANG
2 pages
Violation of The Classical Assumptions
No ratings yet
Violation of The Classical Assumptions
25 pages
03 x03 Activity Costs WP
No ratings yet
03 x03 Activity Costs WP
21 pages
Instrumental Variables & 2SLS Guide
No ratings yet
Instrumental Variables & 2SLS Guide
28 pages
Explain The Linear Regression Algorithm in Detail
No ratings yet
Explain The Linear Regression Algorithm in Detail
12 pages
Multilevel Analysis For Applied Research It S Just Regression Methodology in The Social Sciences 1st Edition Robert Bickel Download
100% (6)
Multilevel Analysis For Applied Research It S Just Regression Methodology in The Social Sciences 1st Edition Robert Bickel Download
71 pages
Handout - BITS-F464 - Machine - Learning - August 2019
No ratings yet
Handout - BITS-F464 - Machine - Learning - August 2019
4 pages
GEC3 Assignment 5 PDF
No ratings yet
GEC3 Assignment 5 PDF
5 pages
Uji SPSS Rita
No ratings yet
Uji SPSS Rita
3 pages
Jurnal 1
No ratings yet
Jurnal 1
10 pages
Slides On ARIMA Models - Robert Nau
No ratings yet
Slides On ARIMA Models - Robert Nau
21 pages
Econo Mid-Term Exam
No ratings yet
Econo Mid-Term Exam
4 pages
Tutorial 8
No ratings yet
Tutorial 8
2 pages
Experimental Design Report 2024 Div C
No ratings yet
Experimental Design Report 2024 Div C
11 pages
UNIT-5 Detailed Notes
No ratings yet
UNIT-5 Detailed Notes
50 pages
Fotosinteza, Pigmenti Asimilatori
No ratings yet
Fotosinteza, Pigmenti Asimilatori
5 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Lecture 12 - Adv. Correlation and Multiple Regression
No ratings yet
Lecture 12 - Adv. Correlation and Multiple Regression
32 pages
VAR Slides
No ratings yet
VAR Slides
12 pages
Quantitative Techniques Exam 2019
No ratings yet
Quantitative Techniques Exam 2019
3 pages
Stats and Probability
No ratings yet
Stats and Probability
13 pages
Quantitative Analysis Forecasting
100% (1)
Quantitative Analysis Forecasting
25 pages
Cap787-Data Science Toolbox
No ratings yet
Cap787-Data Science Toolbox
1 page