Skip to content

kumarsubodh05/Telco_Customer_Churn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Perfect πŸ‘Œ Here is your corporate-level, ready-to-paste README.md for the Telco Customer Churn Prediction project.

You can directly copy this into GitHub.

πŸ“Š Telco Customer Churn Prediction Machine Learning-Based Retention Strategy Optimization πŸ“– Project Overview

Customer churn is one of the biggest challenges in subscription-based industries like telecommunications. Acquiring new customers is significantly more expensive than retaining existing ones.

This project builds a machine learning model to predict customer churn using demographic, service usage, and billing data. The objective is to identify high-risk customers early and enable proactive retention strategies.

🎯 Business Problem

Telecom companies face:

High market competition

Low switching barriers

Rising customer acquisition costs

The inability to identify customers likely to churn leads to revenue loss and inefficient retention campaigns.

Objective: Develop a predictive system to identify customers at risk of churning before they leave.

πŸ“Š Dataset Information

Total Customers: 7,043

Total Features: 20 (excluding target variable)

Target Variable: Churn (Yes/No)

Feature Categories:

Demographics

Service Usage

Contract Details

Billing Information

πŸ”Ž Exploratory Data Analysis (EDA)

Key insights discovered:

Month-to-month contracts show highest churn rates

Customers with low tenure churn more frequently

Higher monthly charges correlate with increased churn

Electronic check payment users exhibit higher churn behavior

βš™οΈ Data Preparation & Feature Engineering

Missing value handling

Categorical variable encoding

Feature scaling

Stratified 80/20 train-test split

Creation of tenure groups

Engineering billing and contract indicators

These steps improved model interpretability and predictive performance.

πŸ€– Modeling Approach Model Used:

Logistic Regression (Primary Model)

Evaluation Metrics:

Accuracy

Precision

Recall

F1-Score

Special focus was placed on Recall for churn (Class 1) to minimize missed churn cases.

πŸ“ˆ Model Performance

Overall Accuracy: 79%

Recall (Non-Churn): 94%

Recall (Churn): 43%

Precision (Churn): 72%

Key Insight:

The model performs well in identifying stable customers but requires improvement in detecting churn cases (false negatives).

πŸ’‘ Business Impact

Establishes a baseline predictive retention system

Identifies high-risk customer segments

Supports targeted retention campaigns

Enables data-driven decision making

False negatives (missed churn cases) highlight opportunity for further model optimization.

πŸš€ Strategic Recommendations

Improve churn recall using class balancing and advanced models

Focus retention efforts on high-risk segments:

Month-to-month contracts

Low tenure customers

High monthly charges

Electronic check users

Implement risk-based retention strategy instead of blanket campaign

Conclusion

This project demonstrates how predictive analytics can shift churn management from reactive response to proactive revenue protection strategy.

Author

Subodh Kumar Machine Learning & Data Analytics Enthusiast

About

Machine learning-based customer churn prediction system to identify high-risk telecom customers and enable data-driven retention strategies.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors