0% found this document useful (0 votes)

22 views3 pages

SL - Problem Statement

Uploaded by

shreyasgawade12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

SL - Problem Statement

Uploaded by

shreyasgawade12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

T

J EC
O
PR
U LE
M OD
SL

AIML MODULE
PROJECT
©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited
AIML MODULE PROJECT

Supervised Learning TOTAL

SCORE 60
Part A - 30 Marks

• DOMAIN: Medical

• CONTEXT: Medical research university X is undergoing a deep research on patients with certain conditions. University has an internal AI team.

Due to con identiality the patient’s details and the conditions are masked by the client by providing different datasets to the AI team for

developing a AIML model which can predict the condition of the patient depending on the received test results.

• DATA DESCRIPTION: The data consists of biomechanics features of the patients according to their current conditions. Each patient is

represented in the data set by six biomechanics attributes derived from the shape and orientation of the condition to their body part.

• PROJECT OBJECTIVE: To Demonstrate the ability to fetch, process and leverage data to generate useful predictions by training Supervised

Learning algorithms.

• STEPS AND TASK [30 Marks]:

1. Data Understanding: [5 Marks]

A. Read all the 3 CSV iles as DataFrame and store them into 3 separate variables. [1 Mark]

B. Print Shape and columns of all the 3 DataFrames. [1 Mark]

C. Compare Column names of all the 3 DataFrames and clearly write observations. [1 Mark]

D. Print DataTypes of all the 3 DataFrames. [1 Mark]

E. Observe and share variation in ‘Class’ feature of all the 3 DaraFrames. [1 Mark]

2. Data Preparation and Exploration: [5 Marks]

A. Unify all the variations in ‘Class’ feature for all the 3 DataFrames. [1 Marks]

For Example: ‘tp_s’, ‘Type_S’, ‘type_s’ should be converted to ‘type_s’

B. Combine all the 3 DataFrames to form a single DataFrame [1 Marks]

Checkpoint: Expected Output shape = (310,7)

C. Print 5 random samples of this DataFrame [1 Marks]

D. Print Feature-wise percentage of Null values. [1 Mark]

E. Check 5-point summary of the new DataFrame. [1 Mark]

3. Data Analysis: [10 Marks]

A. Visualize a heatmap to understand correlation between all features [2 Marks]

B. Share insights on correlation. [2 Marks]

A. Features having stronger correlation with correlation value.

B. Features having weaker correlation with correlation value.

C. Visualize a pairplot with 3 classes distinguished by colors and share insights. [2 Marks]

D. Visualize a jointplot for ‘P_incidence’ and ‘S_slope’ and share insights. [2 Marks]

E. Visualize a boxplot to check distribution of the features and share insights. [2 Marks]

4. Model Building: [6 Marks]

A. Split data into X and Y. [1 Marks]

B. Split data into train and test with 80:20 proportion. [1 Marks]

C. Train a Supervised Learning Classi ication base model using KNN classi ier. [2 Marks]

D. Print all the possible performance metrics for both train and test data. [2 Marks]

5. Performance Improvement: [4 Marks]

A. Experiment with various parameters to improve performance of the base model. [2 Marks]

(Optional: Experiment with various Hyperparameters - Research required)

B. Clearly showcase improvement in performance achieved. [1 Marks]

For Example:

A. Accuracy: +15% improvement

B. Precision: +10% improvement.

C. Clearly state which parameters contributed most to improve model performance. [1 Marks]

©Great Learning. Proprietary content. All Rights Reserved. Unauthorised use or distribution prohibited
f
f
f
f
AIML MODULE PROJECT
Part B - 30 Marks

• DOMAIN: Banking, Marketing

• CONTEXT: A bank X is on a massive digital transformation for all its departments. Bank has a growing customer base whee majority of them are

liability customers (depositors) vs borrowers (asset customers). The bank is interested in expanding the borrowers base rapidly to bring in more

business via loan interests. A campaign that the bank ran in last quarter showed an average single digit conversion rate. Digital transformation

being the core strength of the business strategy, marketing department wants to devise effective campaigns with better target marketing to

increase the conversion ratio to double digit with same budget as per last campaign.

• DATA DICTIONARY:
1. ID: Customer ID
2. Age: Customer’s approximate age.
3. CustomerSince: Customer of the bank since. [unit is masked]
4. HighestSpend: Customer’s highest spend so far in one transaction. [unit is masked]
5. ZipCode: Customer’s zip code.
6. HiddenScore: A score associated to the customer which is masked by the bank as an IP.
7. MonthlyAverageSpend: Customer’s monthly average spend so far. [unit is masked]
8. Level: A level associated to the customer which is masked by the bank as an IP.
9. Mortgage: Customer’s mortgage. [unit is masked]
10. Security: Customer’s security asset with the bank. [unit is masked]
11. FixedDepositAccount: Customer’s ixed deposit account with the bank. [unit is masked]
12. InternetBanking: if the customer uses internet banking.
13. CreditCard: if the customer uses bank’s credit card.
14. LoanOnCard: if the customer has a loan on credit card.

• PROJECT OBJECTIVE: Build a Machine Learning model to perform focused marketing by predicting the potential customers who will convert

using the historical dataset.

• STEPS AND TASK [30 Marks]:

1. Data Understanding and Preparation: [5 Marks]

A. Read both the Datasets ‘Data1’ and ‘Data 2’ as DataFrame and store them into two separate variables. [1 Marks]

B. Print shape and Column Names and DataTypes of both the Dataframes. [1 Marks]

C. Merge both the Dataframes on ‘ID’ feature to form a single DataFrame [2 Marks]

D. Change Datatype of below features to ‘Object’ [1 Marks]

‘CreditCard’, ‘InternetBanking’, ‘FixedDepositAccount’, ‘Security’, ‘Level’, ‘HiddenScore’.

[Reason behind performing this operation:- Values in these features are binary i.e. 1/0. But DataType is ‘int’/’ loat’ which is not expected.]

2. Data Exploration and Analysis: [5 Marks]

A. Visualize distribution of Target variable ‘LoanOnCard’ and clearly share insights. [2 Marks]

B. Check the percentage of missing values and impute if required. [1 Marks]

C. Check for unexpected values in each categorical variable and impute with best suitable value. [2 Marks]

[Unexpected values means if all values in a feature are 0/1 then ‘?’, ‘a’, 1.5 are unexpected values which needs treatment ]

3. Data Preparation and model building: [10 Marks]

A. Split data into X and Y. [1 Marks]

[Recommended to drop ID & ZipCode. LoanOnCard is target Variable]

B. Split data into train and test. Keep 25% data reserved for testing. [1 Marks]

C. Train a Supervised Learning Classi ication base model - Logistic Regression. [2 Marks]

D. Print evaluation metrics for the model and clearly share insights. [1 Marks]

E. Balance the data using the right balancing technique. [2 Marks]

i. Check distribution of the target variable

ii. Say output is class A : 20% and class B : 80%

iii. Here you need to balance the target variable as 50:50.

iv. Try appropriate method to achieve the same.

F. Again train the same previous model on balanced data. [1 Marks]

G. Print evaluation metrics and clearly share di erences observed. [2 Marks]

4. Performance Improvement: [10 Marks]

A. Train a base model each for SVM, KNN. [4 Marks]

B. Tune parameters for each of the models wherever required and inalize a model. [3 Marks]

(Optional: Experiment with various Hyperparameters - Research required)

C. Print evaluation metrics for inal model. [1 Marks]

D. Share improvement achieved from base model to inal model. [2 Marks]

Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
No ratings yet
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
8 pages
Assignment 2 - Machine Learning
No ratings yet
Assignment 2 - Machine Learning
3 pages
DS Assignment
No ratings yet
DS Assignment
7 pages
EST - Problem Statement-3
No ratings yet
EST - Problem Statement-3
3 pages
FMT - Problem - Statement
No ratings yet
FMT - Problem - Statement
2 pages
FAQ's - Supervised Learning
No ratings yet
FAQ's - Supervised Learning
4 pages
Milestone FMT
No ratings yet
Milestone FMT
2 pages
Supervised Learning - Milestones
No ratings yet
Supervised Learning - Milestones
2 pages
Data Science Exam: Classification Task
No ratings yet
Data Science Exam: Classification Task
3 pages
MLPC Midterm
No ratings yet
MLPC Midterm
18 pages
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
100% (1)
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
8 pages
Technical Assignment 2
No ratings yet
Technical Assignment 2
3 pages
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
No ratings yet
Project: ©great Learning. Proprietary Content. All Rights Reserved. Unauthorised Use or Distribution Prohibited
7 pages
Machine Learning Project Guide
No ratings yet
Machine Learning Project Guide
3 pages
PAMLSET1 New
No ratings yet
PAMLSET1 New
4 pages
A1991370857 65680 10 2025 Csm355ca1
No ratings yet
A1991370857 65680 10 2025 Csm355ca1
6 pages
Important Questions
No ratings yet
Important Questions
4 pages
CSL7620 A2
No ratings yet
CSL7620 A2
2 pages
Mittal School of Business Lovely Professional University Academic Task-2
No ratings yet
Mittal School of Business Lovely Professional University Academic Task-2
1 page
ML Assignment
No ratings yet
ML Assignment
3 pages
PAMLSET2
No ratings yet
PAMLSET2
4 pages
Answer Adm Sample
No ratings yet
Answer Adm Sample
4 pages
AI - ML Dev Plan - 29102018
No ratings yet
AI - ML Dev Plan - 29102018
10 pages
Data Scientist Exercise
No ratings yet
Data Scientist Exercise
2 pages
Machine Learning Assignment Guide
No ratings yet
Machine Learning Assignment Guide
2 pages
ML Question Bank
No ratings yet
ML Question Bank
7 pages
ML File External File
No ratings yet
ML File External File
25 pages
30 Days ML Projects Challenge
No ratings yet
30 Days ML Projects Challenge
288 pages
FML Solution 3
No ratings yet
FML Solution 3
11 pages
ML Assignment Questions SyllabusWise 2
No ratings yet
ML Assignment Questions SyllabusWise 2
3 pages
ML Assignment 1
No ratings yet
ML Assignment 1
57 pages
SUB Final Question
No ratings yet
SUB Final Question
2 pages
Data Mining & Machine Learning Courseoutline
No ratings yet
Data Mining & Machine Learning Courseoutline
7 pages
Aishwarya Swetha Data Science
No ratings yet
Aishwarya Swetha Data Science
1 page
ML Question
No ratings yet
ML Question
2 pages
Business Anlytics
No ratings yet
Business Anlytics
1 page
Assignment 2
No ratings yet
Assignment 2
3 pages
FA1 Module 1,2,3 ML
No ratings yet
FA1 Module 1,2,3 ML
6 pages
Ai Fall-23 Assignment
No ratings yet
Ai Fall-23 Assignment
5 pages
ML Ia! Final PDF
No ratings yet
ML Ia! Final PDF
20 pages
Python - Project 2 Problem Statement
No ratings yet
Python - Project 2 Problem Statement
3 pages
AS - Problem Statement
No ratings yet
AS - Problem Statement
4 pages
Credit Risk Project
No ratings yet
Credit Risk Project
11 pages
CNN - Project
No ratings yet
CNN - Project
8 pages
Problem Statement - Graded Project: Variable Details
0% (1)
Problem Statement - Graded Project: Variable Details
3 pages
Data Science Exam Analysis
No ratings yet
Data Science Exam Analysis
16 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
ML Assignment Questions
No ratings yet
ML Assignment Questions
2 pages
FIT1043 A2 Specification - S2 2024 - Gks6arg
No ratings yet
FIT1043 A2 Specification - S2 2024 - Gks6arg
5 pages
SPA Group 13 - Assignment 2 Problem Statement
No ratings yet
SPA Group 13 - Assignment 2 Problem Statement
2 pages
EML Midterm Answer Keys
No ratings yet
EML Midterm Answer Keys
3 pages
FinalTerm - Muhammad Hassan - 2516
No ratings yet
FinalTerm - Muhammad Hassan - 2516
16 pages
Assignment - 1 - Machine Learning
No ratings yet
Assignment - 1 - Machine Learning
3 pages
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
No ratings yet
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
19 pages
TE ML LAB Mannual
No ratings yet
TE ML LAB Mannual
21 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Activities Super
No ratings yet
Activities Super
6 pages
COS10022 Data Science Assignment 1 Question
No ratings yet
COS10022 Data Science Assignment 1 Question
3 pages
A Study of Cash Management in Banking Sector PDF
No ratings yet
A Study of Cash Management in Banking Sector PDF
87 pages
Form ISR 5
No ratings yet
Form ISR 5
3 pages
Roll No 35 Study On Post Office Saving Schemes
100% (2)
Roll No 35 Study On Post Office Saving Schemes
75 pages
Merchant of Venice Summary
No ratings yet
Merchant of Venice Summary
6 pages
Chapter 9-Cash and Marketable Securities Management
No ratings yet
Chapter 9-Cash and Marketable Securities Management
60 pages
IDFCFIRSTBankstatement 10161262719 122724463
No ratings yet
IDFCFIRSTBankstatement 10161262719 122724463
6 pages
Lounge Access for HSBC Cardholders
No ratings yet
Lounge Access for HSBC Cardholders
3 pages
Assignment 3 (Group 9) - LM (1) - 1
No ratings yet
Assignment 3 (Group 9) - LM (1) - 1
27 pages
Axis Bank Account Statement May 2023-Feb 2024
No ratings yet
Axis Bank Account Statement May 2023-Feb 2024
3 pages
KNT Advocates - Firm Profile - 180
No ratings yet
KNT Advocates - Firm Profile - 180
9 pages
Depreciation Methods Guide
No ratings yet
Depreciation Methods Guide
15 pages
XXXXXXXX00501 Canara Bank Statement
No ratings yet
XXXXXXXX00501 Canara Bank Statement
5 pages
Anil Bank Stat Dec-23
No ratings yet
Anil Bank Stat Dec-23
11 pages
BusinessStudies Sec 2023-24
No ratings yet
BusinessStudies Sec 2023-24
6 pages
Account Summary-Savings - Current Account Details
No ratings yet
Account Summary-Savings - Current Account Details
12 pages
PMT - Payment Instructions
No ratings yet
PMT - Payment Instructions
4 pages
Finacial Stability Report NBE WF
No ratings yet
Finacial Stability Report NBE WF
65 pages
UP Notes - Credit Transactions
No ratings yet
UP Notes - Credit Transactions
45 pages
Savings Account - Debit Card Facility
No ratings yet
Savings Account - Debit Card Facility
6 pages
Cryptocurrency BTC
No ratings yet
Cryptocurrency BTC
4 pages
Kotak Mahindra Bank
No ratings yet
Kotak Mahindra Bank
11 pages
Directory of Officers Employees - 31.03.2025 1
No ratings yet
Directory of Officers Employees - 31.03.2025 1
707 pages
RBI's Evolution and Structure
No ratings yet
RBI's Evolution and Structure
12 pages
Tathastu Mobile Sales Daybook Paper
No ratings yet
Tathastu Mobile Sales Daybook Paper
4 pages
Chime Direct Deposit
No ratings yet
Chime Direct Deposit
1 page
Affidavit 1748951197
No ratings yet
Affidavit 1748951197
22 pages
DetailedStatement 48 2
No ratings yet
DetailedStatement 48 2
51 pages
Updated HP Islamic Form26624
100% (1)
Updated HP Islamic Form26624
2 pages
Chase Checking Account Statement
No ratings yet
Chase Checking Account Statement
2 pages
Indian Banking Financial System
No ratings yet
Indian Banking Financial System
20 pages

SL - Problem Statement

Uploaded by

SL - Problem Statement

Uploaded by

T

Supervised Learning TOTAL

• STEPS AND TASK [30 Marks]:

1. Data Understanding: [5 Marks]

B. Print Shape and columns of all the 3 DataFrames. [1 Mark]

D. Print DataTypes of all the 3 DataFrames. [1 Mark]

2. Data Preparation and Exploration: [5 Marks]

For Example: ‘tp_s’, ‘Type_S’, ‘type_s’ should be converted to ‘type_s’

B. Combine all the 3 DataFrames to form a single DataFrame [1 Marks]

Checkpoint: Expected Output shape = (310,7)

C. Print 5 random samples of this DataFrame [1 Marks]

D. Print Feature-wise percentage of Null values. [1 Mark]

E. Check 5-point summary of the new DataFrame. [1 Mark]

3. Data Analysis: [10 Marks]

A. Visualize a heatmap to understand correlation between all features [2 Marks]

B. Share insights on correlation. [2 Marks]

A. Features having stronger correlation with correlation value.

B. Features having weaker correlation with correlation value.

4. Model Building: [6 Marks]

A. Split data into X and Y. [1 Marks]

5. Performance Improvement: [4 Marks]

(Optional: Experiment with various Hyperparameters - Research required)

B. Clearly showcase improvement in performance achieved. [1 Marks]

A. Accuracy: +15% improvement

B. Precision: +10% improvement.

• DOMAIN: Banking, Marketing

using the historical dataset.

• STEPS AND TASK [30 Marks]:

1. Data Understanding and Preparation: [5 Marks]

D. Change Datatype of below features to ‘Object’ [1 Marks]

‘CreditCard’, ‘InternetBanking’, ‘FixedDepositAccount’, ‘Security’, ‘Level’, ‘HiddenScore’.

2. Data Exploration and Analysis: [5 Marks]

B. Check the percentage of missing values and impute if required. [1 Marks]

3. Data Preparation and model building: [10 Marks]

A. Split data into X and Y. [1 Marks]

[Recommended to drop ID & ZipCode. LoanOnCard is target Variable]

E. Balance the data using the right balancing technique. [2 Marks]

i. Check distribution of the target variable

ii. Say output is class A : 20% and class B : 80%

iii. Here you need to balance the target variable as 50:50.

iv. Try appropriate method to achieve the same.

F. Again train the same previous model on balanced data. [1 Marks]

G. Print evaluation metrics and clearly share di erences observed. [2 Marks]

4. Performance Improvement: [10 Marks]

A. Train a base model each for SVM, KNN. [4 Marks]

(Optional: Experiment with various Hyperparameters - Research required)

C. Print evaluation metrics for inal model. [1 Marks]

D. Share improvement achieved from base model to inal model. [2 Marks]

You might also like