MST Mini Project Statements

The document outlines ten machine learning project statements for various companies, including tasks such as predicting house prices, classifying student performance, segmenting customers, and predicting protein solubility. Each project requires data pre-processing, model development, and performance evaluation, with specific datasets and objectives provided. Additionally, guidelines for dataset sourcing, feature selection, and evaluation parameters are included, along with a rubric for assessment.

Uploaded by

J C

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

MST Mini Project Statements

Uploaded by

J C

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

MST Project Statements (UCS321)

1. Develop a machine learning-based regression model for Larsen & Toubro Realty to predict house
prices using features such as location, size, number of rooms, property age, and amenities. The
workflow must include data pre-processing, exploratory data analysis, model building, performance
evaluation, and optimization for improved accuracy. The final model should provide accurate price
predictions and highlight the most influential factors affecting property value, aiding better decision-
making for buyers, sellers, and investors.

2. Pearson VUE, a global leader in computer-based testing, seeks to classify students into categories
such as High Performer, Average Performer, and Needs Improvement. Using a dataset containing
students Mid-Semester Test (MST) scores, Quiz results, Attendance records, and Assignment
performance, you are required to develop a Python-based machine learning classification model. The
project should include:

 Pre-processing the dataset (handling missing values, normalization)

 Training and evaluating suitable classification algorithms
 Model performance

3. Big Bazaar aims to segment its customers into distinct groups to optimize promotional strategies
and improve sales. Using customer purchase history (annual income, spending score, and visit
frequency etc.), develop a Python-based clustering model to identify customer segments. The project
should include:Data cleaning and pre-processing, Performance parameters etc.

4. Pfizer Inc. is focusing on enhancing the success rate of therapeutic protein development. One key
challenge in biotechnology is predicting whether a protein will be soluble or insoluble during
expression in E. coli. Using a dataset containing amino acid composition, molecular weight,
isoelectric point (pI), hydrophobicity index, and other physicochemical properties, develop
a supervised machine learning classification model to categorize proteins as Soluble or Insoluble.
5. HDFC Bank Ltd. aims to improve its loan processing efficiency by predicting whether a loan
application should be approved or rejected based on applicant details. Using a dataset containing
information such as applicant income, loan amount, credit history, employment status, property area,
and other financial indicators, develop a supervised machine learning classification model to
categorize applications as Approved or Rejected.

6. General Electric (GE) Power Systems aims to improve the operational reliability of its electric
motors by predicting winding temperatures under diverse working conditions. Using a dataset
containing parameters such as ambient temperature, motor speed, load torque, supply voltage, and
current, students will develop a supervised machine learning regression model to estimate motor
temperature.

7. Siemens Energy is exploring efficient material selection for manufacturing components in turbines,
engines, and heavy machinery. Given a dataset containing mechanical properties such as tensile
strength, yield strength, hardness, density, thermal conductivity, and elasticity, students will
apply unsupervised learning techniques (e.g., K-Means clustering) to group materials with similar
characteristics.The aim is to help engineers quickly identify suitable materials for specific applications
based on property clusters, reducing selection time and improving performance. The project should
include:

 Data cleaning and handling of missing values

 Visualizing clusters using PCA or t-SNE
8. BASF SE, a global leader in chemical manufacturing, is seeking to enhance environmental
monitoring capabilities by predicting the Air Quality Index (AQI) in industrial and urban areas. Using
historical air quality datasets containing parameters such as PM2.5, PM10, NO₂, SO₂, CO, O₃,
temperature, and humidity, students will develop a supervised regression model to predict future AQI
values.The goal is to assist environmental engineers and regulatory bodies in taking proactive
measures to control emissions and safeguard public health.

9. Customer churn is a critical challenge in the telecom industry, where customers discontinue their
services and move to competitors. Reducing churn can significantly improve revenue and customer
loyalty. Build a classification-based machine learning model to predict whether a customer is likely to
churn based on their demographic details, usage patterns, billing information, and service feedback.

10. Analyze energy consumption data from Siemens Energy using PCA for dimensionality reduction
and K-Means clustering to segment consumers into distinct usage patterns. The goal is to identify key
factors affecting consumption and propose targeted optimization strategies.

Instructions:

 Dataset can be self-generated or obtained from standard platforms (e.g., Kaggle, GitHub).
 Number of features can be selected as per project scope and relevance.
 Performance evaluation parameters can be chosen freely.
 Flow diagram must be given with Pre-processing and visualization steps clearly documented.

A. Rubrics (40 marks) (Group of 4-5 students)

 Problem Understanding & Objective Clarity – 5 marks

 Data Collection & Pre-processing – 10 marks
 Model Development & Implementation – 12 marks
 Performance Evaluation and Interpretation of Results – 8 marks
 Innovation / Creativity in Approach – 5 marks

B. Presentation/viva (20 Marks)

A+B = 60 Marks

List of Projects
No ratings yet
List of Projects
1 page
ML Assignment
No ratings yet
ML Assignment
3 pages
D Caltech PG AI & ML Project
No ratings yet
D Caltech PG AI & ML Project
4 pages
Machine Learning Project in Python Step-By-Step
No ratings yet
Machine Learning Project in Python Step-By-Step
23 pages
Final Projects ATI
No ratings yet
Final Projects ATI
1 page
Data Mining & Machine Learning Courseoutline
No ratings yet
Data Mining & Machine Learning Courseoutline
7 pages
Skill Based Projects - Data - Science (See List On Last Page)
No ratings yet
Skill Based Projects - Data - Science (See List On Last Page)
4 pages
Machine Learning Project Guide
No ratings yet
Machine Learning Project Guide
3 pages
AI - ML Projects Titles With Abstracts
No ratings yet
AI - ML Projects Titles With Abstracts
9 pages
Naukri TejaswihiAhirkar (4y 0m)
No ratings yet
Naukri TejaswihiAhirkar (4y 0m)
2 pages
Cvgenerate 1741866832
No ratings yet
Cvgenerate 1741866832
1 page
In-House Project Titles-2022
No ratings yet
In-House Project Titles-2022
12 pages
Main Project Titles
No ratings yet
Main Project Titles
16 pages
Name - Anil Daharwal
No ratings yet
Name - Anil Daharwal
2 pages
Data Science Fundamentals
No ratings yet
Data Science Fundamentals
44 pages
Sari Go MM Ulaan U Deep Resume
No ratings yet
Sari Go MM Ulaan U Deep Resume
3 pages
Kandarp Dave
No ratings yet
Kandarp Dave
1 page
Supriya Synopsis Final
No ratings yet
Supriya Synopsis Final
27 pages
Mayuri Sonawane: Objective
No ratings yet
Mayuri Sonawane: Objective
3 pages
Adnan Internship
No ratings yet
Adnan Internship
15 pages
IEEE Python & ML Projects 2019
No ratings yet
IEEE Python & ML Projects 2019
2 pages
Rishitha Resume Main
No ratings yet
Rishitha Resume Main
2 pages
Important Questions
No ratings yet
Important Questions
4 pages
Data Science, Machine Learning, Python, Basics of SQL.: Professional Summary
No ratings yet
Data Science, Machine Learning, Python, Basics of SQL.: Professional Summary
5 pages
Data Science Project List - Sheet1
No ratings yet
Data Science Project List - Sheet1
5 pages
Wa0013.
No ratings yet
Wa0013.
83 pages
Beginner AI
No ratings yet
Beginner AI
3 pages
Beginner AI
No ratings yet
Beginner AI
3 pages
Data Science Resume: Tarun Chauhan
No ratings yet
Data Science Resume: Tarun Chauhan
1 page
AIML 2nd Year
No ratings yet
AIML 2nd Year
5 pages
FEBS Project Prodigy
No ratings yet
FEBS Project Prodigy
11 pages
Final Yatin S5 Int
No ratings yet
Final Yatin S5 Int
114 pages
Machine Learning Guide
No ratings yet
Machine Learning Guide
10 pages
2203a52154 Daup Report
No ratings yet
2203a52154 Daup Report
13 pages
Kenny-230724-Top 50 Data Science Projects
No ratings yet
Kenny-230724-Top 50 Data Science Projects
9 pages
Top 5 Free AI ML Deep Learning Projects For KSCST 2025
No ratings yet
Top 5 Free AI ML Deep Learning Projects For KSCST 2025
2 pages
Lavajiit Singh CV
No ratings yet
Lavajiit Singh CV
3 pages
Raushan Dec-2023
No ratings yet
Raushan Dec-2023
2 pages
Internship Report - Merged
No ratings yet
Internship Report - Merged
29 pages
Ay-Sem8-Internship Report
No ratings yet
Ay-Sem8-Internship Report
34 pages
Project Description Document
No ratings yet
Project Description Document
7 pages
Faculty Project Titles 2024
No ratings yet
Faculty Project Titles 2024
26 pages
PBL of Data VISUALIZATION
No ratings yet
PBL of Data VISUALIZATION
2 pages
List of Experiments - CL-I
No ratings yet
List of Experiments - CL-I
3 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Naukri YogendraVerma (6y 6m)
No ratings yet
Naukri YogendraVerma (6y 6m)
3 pages
Data Science & Engineering Project Ideas
No ratings yet
Data Science & Engineering Project Ideas
2 pages
Data Scientist Resume Gowsik
No ratings yet
Data Scientist Resume Gowsik
1 page
Madras: Indian Institute OF Technology
No ratings yet
Madras: Indian Institute OF Technology
2 pages
ThoufeeqM Sainokoyo
No ratings yet
ThoufeeqM Sainokoyo
3 pages
Lokesh Reddy Original
No ratings yet
Lokesh Reddy Original
3 pages
Sai Krishna Neelam Resume
No ratings yet
Sai Krishna Neelam Resume
4 pages
Titles of Mini-Micro Projects
No ratings yet
Titles of Mini-Micro Projects
2 pages
Heart Disease
No ratings yet
Heart Disease
28 pages
Final Int. Report
No ratings yet
Final Int. Report
14 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
32 pages
Smart Blinds Automation Presentation
No ratings yet
Smart Blinds Automation Presentation
6 pages
Raw Milk Questionnaire Template
No ratings yet
Raw Milk Questionnaire Template
8 pages
EEC Scheme 2023 Senate Approved
No ratings yet
EEC Scheme 2023 Senate Approved
137 pages
MST Even 23
No ratings yet
MST Even 23
2 pages
Introduction To Probability With Texas Hold em Examples 1st Schoenberg Solution Manual
No ratings yet
Introduction To Probability With Texas Hold em Examples 1st Schoenberg Solution Manual
3 pages
Minimum Samples Size Conjoint
No ratings yet
Minimum Samples Size Conjoint
6 pages
Fibridge E1V35
No ratings yet
Fibridge E1V35
20 pages
Open Ended Lab
No ratings yet
Open Ended Lab
13 pages
02 Experiment 2 DEKP2213 Sem2 20222023
No ratings yet
02 Experiment 2 DEKP2213 Sem2 20222023
10 pages
Intro to Chemistry for Students
No ratings yet
Intro to Chemistry for Students
2 pages
2023 Atcd
No ratings yet
2023 Atcd
2 pages
Body Bias
No ratings yet
Body Bias
7 pages
Probability of Simple Events
No ratings yet
Probability of Simple Events
19 pages
Forecasting and Procurement at Le Club Français Du Vin
No ratings yet
Forecasting and Procurement at Le Club Français Du Vin
8 pages
Jeppview For Windows: List of Pages in This Trip Kit
No ratings yet
Jeppview For Windows: List of Pages in This Trip Kit
80 pages
Harusdewi 2018
No ratings yet
Harusdewi 2018
7 pages
Color Harmony and Interaction Guide
No ratings yet
Color Harmony and Interaction Guide
32 pages
Aptech Quiz Contest 2010
No ratings yet
Aptech Quiz Contest 2010
190 pages
Fortimanager Cli 520 PDF
No ratings yet
Fortimanager Cli 520 PDF
243 pages
Lecture 1 - Introduction To Building Automation
No ratings yet
Lecture 1 - Introduction To Building Automation
20 pages
Optimizing Premium Reserving Methods
No ratings yet
Optimizing Premium Reserving Methods
13 pages
08 Chapter 3
No ratings yet
08 Chapter 3
35 pages
Laser Notes
No ratings yet
Laser Notes
6 pages
(Ebook) Introducing Maya 2011 by Dariush Derakhshani ISBN 9780470502167, 0470502169 Download
100% (1)
(Ebook) Introducing Maya 2011 by Dariush Derakhshani ISBN 9780470502167, 0470502169 Download
53 pages
30-Second Biochemistry - The 50 Vital Processes in and Around Living Organisms, Each Explained
100% (1)
30-Second Biochemistry - The 50 Vital Processes in and Around Living Organisms, Each Explained
162 pages
Grade 4 Math Worksheet
100% (1)
Grade 4 Math Worksheet
3 pages
Pirate Math: Solving Equations
No ratings yet
Pirate Math: Solving Equations
4 pages
Chemistry EoS1 Test V1 1617
No ratings yet
Chemistry EoS1 Test V1 1617
10 pages
Wireless Security Design PDF
No ratings yet
Wireless Security Design PDF
8 pages
The Structure of Kaolinite and Metakaolinite
No ratings yet
The Structure of Kaolinite and Metakaolinite
4 pages
OCI Foundations Associate
No ratings yet
OCI Foundations Associate
8 pages
The National Shipbuilding Research Program: Carbon Equivalent (PCM) Limits For Thick Carbon and Low Alloy Steels
No ratings yet
The National Shipbuilding Research Program: Carbon Equivalent (PCM) Limits For Thick Carbon and Low Alloy Steels
81 pages
2025 Class Rules - Feb WS - PDF - 3120 - en
No ratings yet
2025 Class Rules - Feb WS - PDF - 3120 - en
45 pages
Verified PDF Download MANAGERIAL ACCOUNTING Version 20 by Kurt Heisinger and Joe Hoyle Ebook and TestBank Bundle FULL Version
0% (1)
Verified PDF Download MANAGERIAL ACCOUNTING Version 20 by Kurt Heisinger and Joe Hoyle Ebook and TestBank Bundle FULL Version
413 pages

MST Mini Project Statements

Uploaded by

MST Mini Project Statements

Uploaded by

MST Project Statements (UCS321)

 Pre-processing the dataset (handling missing values, normalization)

 Data cleaning and handling of missing values

A. Rubrics (40 marks) (Group of 4-5 students)

 Problem Understanding & Objective Clarity – 5 marks

B. Presentation/viva (20 Marks)

You might also like