0% found this document useful (0 votes)

12 views14 pages

Project1 Research Report Week2 FullPages

The report outlines a comprehensive analysis of a marketing dataset, focusing on the relationship between advertising spends and sales. Key findings include strong correlations between TV and Radio spending with sales, the significance of linear regression modeling, and the importance of data cleaning and preprocessing. The report also discusses model evaluation techniques and the trade-offs involved in using polynomial features and interaction effects in modeling.

Uploaded by

daabhu62

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views14 pages

Project1 Research Report Week2 FullPages

Uploaded by

daabhu62

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Chapter-wise Research Report - Project 1

Chapter 1: Understanding the Dataset and Problem Statement

Learned to identify the structure and objective of the dataset. Understood the relationship between marketing

spends (TV, Radio, Newspaper) and sales. Recognized the importance of defining dependent and

independent variables.
Chapter-wise Research Report - Project 1

Chapter 2: Data Cleaning and Preprocessing

Verified the dataset for missing values, proper column names, and data types. Learned how early checks

prevent issues during modeling. No missing values were found in the given dataset.
Chapter-wise Research Report - Project 1

Chapter 3: Exploratory Data Analysis (EDA)

Applied scatterplots and pairplots to observe relationships between variables. Found strong correlation of TV

and Radio with Sales. Used visualization to form hypotheses about variable behavior.
Chapter-wise Research Report - Project 1

Chapter 4: Correlation and Statistical Understanding

Calculated correlation coefficients. Learned to interpret strength and direction of relationships. Understood

that correlation does not imply causation.

Chapter-wise Research Report - Project 1

Chapter 5: Linear Regression Modeling

Built simple and multiple linear regression models. Learned about coefficients, intercept, R² value, and

adjusted R². Found that TV and Radio are statistically significant predictors.
Chapter-wise Research Report - Project 1

Chapter 6: Model Evaluation and Interpretation

Evaluated regression models using R² and p-values. Understood implications of high R² and the risk of

including insignificant predictors like Newspaper.

Chapter-wise Research Report - Project 1

Chapter 7: Polynomial Features and Interaction Effects

Learned to include polynomial terms and interaction features to model non-linear effects. Recognized the

tradeoff between accuracy improvement and risk of overfitting.

Additional Content for Chapter 1

In this chapter, we explored the business context of the dataset, focusing on the role of advertising
The dataset comprises data from a marketing campaign, with spending figures across TV, Radio, a
We emphasized the significance of clearly defining the dependent and independent variables to bui
Initial exploration helped us hypothesize how different media channels might impact sales differentl
This understanding guided our expectations and analysis in the subsequent chapters..
Additional Content for Chapter 2

Data cleaning is crucial for reliable results in any data science project.
We checked for missing values using functions like isnull().sum() and ensured that the column nam
Data types were examined and found appropriate.
The absence of missing data simplified the preprocessing.
We also considered renaming columns for clarity but retained the original names for consistency.
This chapter underlines the importance of validating data before proceeding to modeling..
Additional Content for Chapter 3

Using seaborn and matplotlib, we conducted an in-depth exploratory data analysis.

Pairplots revealed that TV and Radio spending showed a strong positive linear relationship with Sa
Boxplots helped identify the distribution and potential outliers in the dataset.
Correlation heatmaps visually supported our hypothesis about the varying impacts of each channel
These visual tools provided intuition and direction for model building..
Additional Content for Chapter 4

We computed Pearson correlation coefficients to quantify the strength of relationships between eac
The strongest correlation was observed between TV and Sales, followed by Radio.
Newspaper had a relatively weak correlation, suggesting it might not be a strong predictor.
We discussed the difference between correlation and causation and how this distinction affects bus
Additional Content for Chapter 5

Linear regression was implemented using sklearn.

We began with simple linear regression for individual predictors, followed by multiple regression inc
Model summaries provided insight into coefficients, intercepts, and R² values.
TV and Radio had statistically significant coefficients, reinforcing their importance as predictors.
Newspaper's coefficient was not significant, which raised considerations about model simplification
Additional Content for Chapter 6

Model evaluation was performed using R² and adjusted R² to assess fit quality.
We also reviewed p-values for each predictor to determine their statistical significance.
High R² values from models including TV and Radio indicated good fit, whereas including Newspap
This step taught the importance of balancing complexity with interpretability..
Additional Content for Chapter 7

Polynomial regression and interaction terms were introduced to capture non-linear and combined e
Polynomial terms like TV² and interaction terms like TV*Radio were added to improve prediction.
While model accuracy improved slightly, it also increased the risk of overfitting.
This experiment highlighted the trade-offs between model complexity and generalization capability

GMC Final Project - Maha
No ratings yet
GMC Final Project - Maha
20 pages
Ds - Lab - 4.ipynb - Colab
No ratings yet
Ds - Lab - 4.ipynb - Colab
7 pages
0.1 Advertising Dataset: Linear Regression and Model Assumption
No ratings yet
0.1 Advertising Dataset: Linear Regression and Model Assumption
42 pages
Exercise#8 Instructions Linear Regression Model
No ratings yet
Exercise#8 Instructions Linear Regression Model
4 pages
Assignment 03 - Report
No ratings yet
Assignment 03 - Report
14 pages
Exemplar - Hypothesis Testing With Python
No ratings yet
Exemplar - Hypothesis Testing With Python
14 pages
Sales
No ratings yet
Sales
7 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
4 pages
Exemplar - Perform Multiple Linear Regression
No ratings yet
Exemplar - Perform Multiple Linear Regression
20 pages
Linear Regression Analysis Report
No ratings yet
Linear Regression Analysis Report
21 pages
CIA Understanding
No ratings yet
CIA Understanding
5 pages
S02 - Regression Modelling
No ratings yet
S02 - Regression Modelling
17 pages
Abinash Nag Project Report CART
No ratings yet
Abinash Nag Project Report CART
40 pages
Regression Analysis Report - Sanjeev Kumar - 24MSG1R43
No ratings yet
Regression Analysis Report - Sanjeev Kumar - 24MSG1R43
6 pages
Sukanya December Predictive Modeling 14th Jan 2024
No ratings yet
Sukanya December Predictive Modeling 14th Jan 2024
50 pages
R Programming
No ratings yet
R Programming
40 pages
Class Exercise 1
No ratings yet
Class Exercise 1
2 pages
Predective Analytics
No ratings yet
Predective Analytics
11 pages
Black Friday Sales
No ratings yet
Black Friday Sales
26 pages
ReCell Project PDF
No ratings yet
ReCell Project PDF
21 pages
Business Report: Predictive Modelling
100% (2)
Business Report: Predictive Modelling
37 pages
Building Statistical Models in Python 1st Edition Anonymous Download
100% (1)
Building Statistical Models in Python 1st Edition Anonymous Download
47 pages
AAS DSExam
No ratings yet
AAS DSExam
5 pages
ML Project Stage 2
No ratings yet
ML Project Stage 2
9 pages
Walmart Case Study - Solution Approach
No ratings yet
Walmart Case Study - Solution Approach
6 pages
Predicting Cubic Zirconia Prices Using Linear Regression
100% (1)
Predicting Cubic Zirconia Prices Using Linear Regression
58 pages
BT4211 Data-Driven Marketing: Fundamentals: Process and Statistical Issues in Predictive Modeling
No ratings yet
BT4211 Data-Driven Marketing: Fundamentals: Process and Statistical Issues in Predictive Modeling
38 pages
Essential Topics For A Data Scientist's Daily Workflow
No ratings yet
Essential Topics For A Data Scientist's Daily Workflow
6 pages
Sample - Customer Churn Prediction Python Documentation
No ratings yet
Sample - Customer Churn Prediction Python Documentation
33 pages
Azure Data Studio 1694473395
No ratings yet
Azure Data Studio 1694473395
15 pages
COS10022 DSP Week02 Regressions
No ratings yet
COS10022 DSP Week02 Regressions
41 pages
Predictive Modeling for Business Insights
100% (3)
Predictive Modeling for Business Insights
69 pages
Data Understanding and Prepration
100% (1)
Data Understanding and Prepration
10 pages
Ds Lab 4.ipynb - TARUN
No ratings yet
Ds Lab 4.ipynb - TARUN
6 pages
A Strategic Framework For Predictive Feature Engineering - Maximizing Model Performance On E-Commerce Transaction Data
No ratings yet
A Strategic Framework For Predictive Feature Engineering - Maximizing Model Performance On E-Commerce Transaction Data
19 pages
Car Prediction Analysis
No ratings yet
Car Prediction Analysis
19 pages
Data Mining Problem 2 Report
No ratings yet
Data Mining Problem 2 Report
13 pages
1.descriptive Statistics and Probability Distributions:: Datascience Course Content
No ratings yet
1.descriptive Statistics and Probability Distributions:: Datascience Course Content
10 pages
Predictive Modelling Project Report Final
45% (11)
Predictive Modelling Project Report Final
49 pages
Handout PS 1 - Customer Analytics
No ratings yet
Handout PS 1 - Customer Analytics
16 pages
ADS IA 1 Syllabus Prep
No ratings yet
ADS IA 1 Syllabus Prep
5 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
46 pages
Cart-Rf-Ann: Prepared by Muralidharan N
67% (3)
Cart-Rf-Ann: Prepared by Muralidharan N
33 pages
SigmaDAInduction25 Analytics Task 1
No ratings yet
SigmaDAInduction25 Analytics Task 1
5 pages
Analysis and Presentation For Bank Marketing Data: Vinay Kumar MS by Research Scholar IIT Kharagpur +91-8348575432
No ratings yet
Analysis and Presentation For Bank Marketing Data: Vinay Kumar MS by Research Scholar IIT Kharagpur +91-8348575432
20 pages
Nanduri Naga Sowri Pgp-Dsba - Octa - G2 Great Learning
No ratings yet
Nanduri Naga Sowri Pgp-Dsba - Octa - G2 Great Learning
40 pages
Module 2
No ratings yet
Module 2
38 pages
Data Science Course Agenda
No ratings yet
Data Science Course Agenda
29 pages
Big Mart Sales Prediction Using Machine Learning Report PDF
No ratings yet
Big Mart Sales Prediction Using Machine Learning Report PDF
56 pages
Statistics For Data Science
100% (2)
Statistics For Data Science
39 pages
Dap
No ratings yet
Dap
1,254 pages
Cart-Rf-ANN: Prepared by Muralidharan N
0% (1)
Cart-Rf-ANN: Prepared by Muralidharan N
16 pages
Module 3: Introduction To Machine Learning With Python: Case Study
No ratings yet
Module 3: Introduction To Machine Learning With Python: Case Study
3 pages
Revenue Predictor - Udit Ennam PDF
No ratings yet
Revenue Predictor - Udit Ennam PDF
30 pages
Machine Learning
100% (1)
Machine Learning
33 pages
Regression
No ratings yet
Regression
21 pages
TE AINDS Syllabus REV 2019 - DAV
No ratings yet
TE AINDS Syllabus REV 2019 - DAV
3 pages
Realms of The Living Dead - Curtiss, Order of The 15, Mystics (1926)
100% (1)
Realms of The Living Dead - Curtiss, Order of The 15, Mystics (1926)
342 pages
United States v. Laureano-Perez, 1st Cir. (2015)
No ratings yet
United States v. Laureano-Perez, 1st Cir. (2015)
77 pages
Nepal Drugs Category Rules 1986
No ratings yet
Nepal Drugs Category Rules 1986
27 pages
Nepali Class 11211.
No ratings yet
Nepali Class 11211.
163 pages
Fundamentals of Marketing: Final Project
No ratings yet
Fundamentals of Marketing: Final Project
24 pages
Get Test Bank For Fitzgeralds Clinical Neuroanatomy and Neuroscience 8th Edition Estomih Mtui HQ File PDF Download
No ratings yet
Get Test Bank For Fitzgeralds Clinical Neuroanatomy and Neuroscience 8th Edition Estomih Mtui HQ File PDF Download
408 pages
Maths 59b0
No ratings yet
Maths 59b0
8 pages
SOP Maintenance AC
No ratings yet
SOP Maintenance AC
2 pages
CCProject Phase One
No ratings yet
CCProject Phase One
2 pages
BlueCrest College Fees Guide
No ratings yet
BlueCrest College Fees Guide
1 page
Asian Culture Brief Philippines
No ratings yet
Asian Culture Brief Philippines
4 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
4 pages
PostgreSQL & PostgREST Deployment Guide
No ratings yet
PostgreSQL & PostgREST Deployment Guide
5 pages
05 - BTJB - Quiz4 - Database Programming With JDBC: Tests & Quizzes
No ratings yet
05 - BTJB - Quiz4 - Database Programming With JDBC: Tests & Quizzes
7 pages
Velocity 3.6 SP1 Installation Guide
No ratings yet
Velocity 3.6 SP1 Installation Guide
172 pages
SIP-IMS Model PDF
No ratings yet
SIP-IMS Model PDF
4 pages
Guidlines For Nach Documentation
No ratings yet
Guidlines For Nach Documentation
16 pages
Systems Sthinking Smart and Sustainable
No ratings yet
Systems Sthinking Smart and Sustainable
19 pages
Overview, Analyzes The Entire Afghan War of The 1980s and Explains How The Rise of The Taliban
No ratings yet
Overview, Analyzes The Entire Afghan War of The 1980s and Explains How The Rise of The Taliban
2 pages
Calcium Hydroxide Solubility Study
No ratings yet
Calcium Hydroxide Solubility Study
3 pages
Dungeons & Lairs #48: Assassin School: Level (APL) of 3, 5, 8, or 11. This Document Of-Credits
No ratings yet
Dungeons & Lairs #48: Assassin School: Level (APL) of 3, 5, 8, or 11. This Document Of-Credits
20 pages
Wincom Quality Manual
No ratings yet
Wincom Quality Manual
5 pages
WWW - Incar.tw-Cd30 Mp3 User Manual PDF
No ratings yet
WWW - Incar.tw-Cd30 Mp3 User Manual PDF
5 pages
Nhẫn Marquis - GIA 2417823853
No ratings yet
Nhẫn Marquis - GIA 2417823853
1 page
Kohlberg Reviewer
No ratings yet
Kohlberg Reviewer
3 pages
C.N.H. Holding EUR Transaction Statement
No ratings yet
C.N.H. Holding EUR Transaction Statement
3 pages
Flight Planning and Monitering
No ratings yet
Flight Planning and Monitering
61 pages
ABC of Salvation
No ratings yet
ABC of Salvation
3 pages
Vehicle Yaw & Sideslip Control Review
No ratings yet
Vehicle Yaw & Sideslip Control Review
19 pages

Project1 Research Report Week2 FullPages

Uploaded by

Project1 Research Report Week2 FullPages

Uploaded by

Chapter-wise Research Report - Project 1

Chapter 1: Understanding the Dataset and Problem Statement

Chapter 2: Data Cleaning and Preprocessing

Chapter 3: Exploratory Data Analysis (EDA)

Chapter 4: Correlation and Statistical Understanding

that correlation does not imply causation.

Chapter 5: Linear Regression Modeling

Chapter 6: Model Evaluation and Interpretation

including insignificant predictors like Newspaper.

Chapter 7: Polynomial Features and Interaction Effects

tradeoff between accuracy improvement and risk of overfitting.

Using seaborn and matplotlib, we conducted an in-depth exploratory data analysis.

Linear regression was implemented using sklearn.

You might also like