0% found this document useful (0 votes)

8 views4 pages

Telecom Churn Proj

Uploaded by

FACTZ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

Telecom Churn Proj

Uploaded by

FACTZ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from sklearn.model_selection import train_test_split, cross_val_score

from sklearn.preprocessing import LabelEncoder, StandardScaler
from sklearn.metrics import classification_report, confusion_matrix,
roc_auc_score
from sklearn.linear_model import LogisticRegression
from sklearn.ensemble import RandomForestClassifier
from xgboost import XGBClassifier
from imblearn.over_sampling import SMOTE

# Load dataset
df = pd.read_csv('/content/WA_Fn-UseC_-Telco-Customer-Churn.csv')

# Drop customerID
df.drop('customerID', axis=1, inplace=True)

# Handle TotalCharges (has missing values)

df['TotalCharges'] = pd.to_numeric(df['TotalCharges'],
errors='coerce')
df.dropna(inplace=True)

# Encode target variable

df['Churn'] = df['Churn'].map({'Yes': 1, 'No': 0})

# Convert binary categorical features

binary_cols = ['gender', 'Partner', 'Dependents', 'PhoneService',
'PaperlessBilling']
for col in binary_cols:
df[col] = df[col].map({'Yes': 1, 'No': 0, 'Male': 1, 'Female': 0})

# One-hot encode remaining categorical variables

df = pd.get_dummies(df, drop_first=True)

# Features and target

X = df.drop('Churn', axis=1)
y = df['Churn']

# Scale numerical features

scaler = StandardScaler()
X[['tenure', 'MonthlyCharges', 'TotalCharges']] =
scaler.fit_transform(X[['tenure', 'MonthlyCharges', 'TotalCharges']])

# Balance the dataset using SMOTE

smote = SMOTE(random_state=42)
X_bal, y_bal = smote.fit_resample(X, y)
# Split the data
X_train, X_test, y_train, y_test = train_test_split(X_bal, y_bal,
test_size=0.2, random_state=42)

# --- 1. Logistic Regression ---

logreg = LogisticRegression(max_iter=1000)
logreg.fit(X_train, y_train)
y_pred_log = logreg.predict(X_test)
print("Logistic Regression:")
print(classification_report(y_test, y_pred_log))
print("ROC-AUC:", roc_auc_score(y_test, logreg.predict_proba(X_test)
[:, 1]))
print("-" * 60)

Logistic Regression:
precision recall f1-score support

0 0.81 0.78 0.80 1037

1 0.79 0.82 0.80 1029

accuracy 0.80 2066

macro avg 0.80 0.80 0.80 2066
weighted avg 0.80 0.80 0.80 2066

ROC-AUC: 0.880777135210056
------------------------------------------------------------

# --- 2. Random Forest ---

rf = RandomForestClassifier(n_estimators=100, random_state=42)
rf.fit(X_train, y_train)
y_pred_rf = rf.predict(X_test)
print("Random Forest:")
print(classification_report(y_test, y_pred_rf))
print("ROC-AUC:", roc_auc_score(y_test, rf.predict_proba(X_test)[:,
1]))
print("-" * 60)

Random Forest:
precision recall f1-score support

0 0.84 0.82 0.83 1037

1 0.82 0.85 0.83 1029

accuracy 0.83 2066

macro avg 0.83 0.83 0.83 2066
weighted avg 0.83 0.83 0.83 2066
ROC-AUC: 0.9135555861688939
------------------------------------------------------------

# --- 3. XGBoost ---

xgb = XGBClassifier(use_label_encoder=False, eval_metric='logloss',
random_state=42)
xgb.fit(X_train, y_train)
y_pred_xgb = xgb.predict(X_test)
print("XGBoost Classifier:")
print(classification_report(y_test, y_pred_xgb))
print("ROC-AUC:", roc_auc_score(y_test, xgb.predict_proba(X_test)[:,
1]))
print("-" * 60)

/usr/local/lib/python3.11/dist-packages/xgboost/core.py:158:
UserWarning: [14:42:51] WARNING: /workspace/src/learner.cc:740:
Parameters: { "use_label_encoder" } are not used.

warnings.warn(smsg, UserWarning)

XGBoost Classifier:
precision recall f1-score support

0 0.84 0.81 0.82 1037

1 0.81 0.84 0.83 1029

accuracy 0.83 2066

macro avg 0.83 0.83 0.83 2066
weighted avg 0.83 0.83 0.83 2066

ROC-AUC: 0.9048410933460034
------------------------------------------------------------

# Feature Importance from Random Forest

importances = rf.feature_importances_
indices = np.argsort(importances)[-10:]
features = X.columns[indices]

plt.figure(figsize=(10, 6))
plt.title("Top 10 Feature Importances (Random Forest)")
plt.barh(range(len(indices)), importances[indices], align="center")
plt.yticks(range(len(indices)), features)
plt.xlabel("Relative Importance")
plt.show()

Random Forest Classifier on Banking Dataset
No ratings yet
Random Forest Classifier on Banking Dataset
7 pages
Decision Tree, Random Forest
No ratings yet
Decision Tree, Random Forest
37 pages
Car Evaluation Data Analysis & Random Forest Model
No ratings yet
Car Evaluation Data Analysis & Random Forest Model
12 pages
Random Forest
No ratings yet
Random Forest
8 pages
Rev Insurance Business Report
No ratings yet
Rev Insurance Business Report
4 pages
Assgn 06 ML - Ipynb - Colab
No ratings yet
Assgn 06 ML - Ipynb - Colab
5 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
Loan Default Prediction System 1753830667
No ratings yet
Loan Default Prediction System 1753830667
11 pages
DA PRA WEEK 13 (Random Forest) - 054551
No ratings yet
DA PRA WEEK 13 (Random Forest) - 054551
12 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
AAM 6th Prac
No ratings yet
AAM 6th Prac
3 pages
Classification
No ratings yet
Classification
3 pages
Big Data Practical
No ratings yet
Big Data Practical
20 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Code ExerciseModelSelection
100% (1)
Code ExerciseModelSelection
19 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
15 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
ML Functions
No ratings yet
ML Functions
12 pages
CS326 Report
No ratings yet
CS326 Report
36 pages
Major Project
No ratings yet
Major Project
9 pages
Najir Shaikh Practical 4
No ratings yet
Najir Shaikh Practical 4
4 pages
Aiml 5-8
No ratings yet
Aiml 5-8
19 pages
Exp 6
No ratings yet
Exp 6
3 pages
05 E RandomForest LoanData
No ratings yet
05 E RandomForest LoanData
8 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Random Forest
100% (1)
Random Forest
11 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
ML Asst.-01
No ratings yet
ML Asst.-01
21 pages
Detect Fake Profiles in Online Social Networks Using Support Vector Machine
No ratings yet
Detect Fake Profiles in Online Social Networks Using Support Vector Machine
8 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
CCD - Ipynb - Colab
No ratings yet
CCD - Ipynb - Colab
6 pages
Maternal-Risk-Prediction - Ipynb - Colab
No ratings yet
Maternal-Risk-Prediction - Ipynb - Colab
9 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
Slip
No ratings yet
Slip
5 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
30 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
AML Code For m2
No ratings yet
AML Code For m2
7 pages
ML Prac1-10
No ratings yet
ML Prac1-10
32 pages
ML Fat
No ratings yet
ML Fat
9 pages
Prathamesh KRAI
No ratings yet
Prathamesh KRAI
38 pages
Assign 4 8057
No ratings yet
Assign 4 8057
7 pages
Text Classification with ML Algorithms
No ratings yet
Text Classification with ML Algorithms
5 pages
Import As Import As From Import From Import From Import From Import
No ratings yet
Import As Import As From Import From Import From Import From Import
4 pages
Heart Disease Prediction Guide
100% (1)
Heart Disease Prediction Guide
73 pages
Supple Maximizing Performance in Cs CuBiCl
No ratings yet
Supple Maximizing Performance in Cs CuBiCl
5 pages
ML Lab 8
No ratings yet
ML Lab 8
9 pages
Reast Cancer Prediction Using Debt
No ratings yet
Reast Cancer Prediction Using Debt
18 pages
Ensemble Learning
No ratings yet
Ensemble Learning
1 page
Facebook Graph Link Prediction
No ratings yet
Facebook Graph Link Prediction
14 pages
10 Random - Forest - Algo
No ratings yet
10 Random - Forest - Algo
6 pages
Build A Random Forest Algorithm Aim
No ratings yet
Build A Random Forest Algorithm Aim
3 pages
ML Manual With Outputs
No ratings yet
ML Manual With Outputs
30 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Machine Learning Evaluation Guide
100% (1)
Machine Learning Evaluation Guide
504 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
Patrimonio Cultural
No ratings yet
Patrimonio Cultural
6 pages
Pure O2 Activated Sludge
No ratings yet
Pure O2 Activated Sludge
35 pages
Symbols in Song of Myself
50% (4)
Symbols in Song of Myself
4 pages
Kelly Attachment Report
No ratings yet
Kelly Attachment Report
18 pages
Fairy Tales for Class IX Students
No ratings yet
Fairy Tales for Class IX Students
12 pages
IE Matrx
No ratings yet
IE Matrx
3 pages
Unit 4 Reading 2
No ratings yet
Unit 4 Reading 2
4 pages
Contents
No ratings yet
Contents
8 pages
Skill Diary
No ratings yet
Skill Diary
10 pages
Ethanol Yield from Molasses & Alternatives
No ratings yet
Ethanol Yield from Molasses & Alternatives
19 pages
MBMMBI vs PPSMI: Policy Analysis
No ratings yet
MBMMBI vs PPSMI: Policy Analysis
11 pages
UCS Lab Test Data for YGN-MDLY Expressway
No ratings yet
UCS Lab Test Data for YGN-MDLY Expressway
2 pages
Main Memory: Prof. Mike Giles
No ratings yet
Main Memory: Prof. Mike Giles
9 pages
TMT Bars Data Analysis
No ratings yet
TMT Bars Data Analysis
71 pages
Fundamental Principles of Bhaishajya Kalpana
100% (3)
Fundamental Principles of Bhaishajya Kalpana
21 pages
Pomo Gay
No ratings yet
Pomo Gay
2 pages
Group7 Lab Act 4.2 Snapback Timing Method
No ratings yet
Group7 Lab Act 4.2 Snapback Timing Method
5 pages
Project Management Methodologies: A Comparative Analysis: October 2012
No ratings yet
Project Management Methodologies: A Comparative Analysis: October 2012
14 pages
Computer Self Final Full
No ratings yet
Computer Self Final Full
6 pages
Jeep P0113 Error
No ratings yet
Jeep P0113 Error
3 pages
Presentation of Parallel Universe To My Ma'm Dr. Ayesha GCUF
0% (1)
Presentation of Parallel Universe To My Ma'm Dr. Ayesha GCUF
22 pages
C1 Writing Dossier
100% (1)
C1 Writing Dossier
36 pages
DLL For Demo
No ratings yet
DLL For Demo
4 pages
Method of Estimating The Remaining Life of The Lokv Xlpe Cable Operated Years
No ratings yet
Method of Estimating The Remaining Life of The Lokv Xlpe Cable Operated Years
5 pages
Stress Management for Self-Help
100% (1)
Stress Management for Self-Help
83 pages
Child Development A Thematic Approach 6th Edition Bukatko and Daehler PDF Version
No ratings yet
Child Development A Thematic Approach 6th Edition Bukatko and Daehler PDF Version
146 pages
ENGLISH 8 AND 10, Q2-Week 3-4 SUMM. TESTS
No ratings yet
ENGLISH 8 AND 10, Q2-Week 3-4 SUMM. TESTS
10 pages
Class X Olympiads Syllabus-1
No ratings yet
Class X Olympiads Syllabus-1
8 pages
Hipertexto Lenguaje Santillana 7
No ratings yet
Hipertexto Lenguaje Santillana 7
4 pages