Back to Portfolio
Customer Analytics

Customer Churn Prediction

Classification model predicting telecom customer churn — focused on high recall to enable targeted retention before customers leave.

85%
Recall
4/5
Churners Identified
XGBoost
Best Model

Problem Statement

Customer churn costs telecom companies billions annually. Identifying at-risk customers before they leave enables targeted retention campaigns. The challenge: churn is rare (~15-20%), requiring models optimized for recall over precision to catch as many churners as possible.

Technical Approach

Model Development

Feature Engineering

Engineered 40+ behavioral features from raw telecom subscription data: usage pattern velocity (week-over-week changes in call minutes and data usage), customer service interaction frequency, payment delay indicators, and contract lifecycle stage. The top three predictors: contract type (month-to-month vs annual), tenure length, and monthly charge amount.

Threshold Optimization

Tuned classification threshold to maximize recall while keeping precision above 50%. This ensured marketing teams could act on predictions without an overwhelming false-positive rate. Delivered actionable customer segments ranked by churn probability for phased retention outreach.

Key Results

Tech Stack

PythonXGBoostScikit-learnPandasSeabornMatplotlibJupyter
Back to Portfolio View on GitHub