Back to Portfolio
Healthcare

Sleep Disorder Prediction

Multi-class classification model predicting sleep disorders from lifestyle and health metrics — focus on feature interpretability.

89%
F1-Score
3 Classes
Insomnia / Apnea / None
XGBoost
Best Model

Problem Statement

Sleep disorders affect 30% of adults and are linked to cardiovascular disease, depression, and reduced quality of life. Early screening using non-clinical data — lifestyle factors, physical metrics, and daily habits — can identify at-risk individuals before they develop chronic conditions. The challenge: distinguishing between multiple disorder types (insomnia vs sleep apnea) from overlapping symptoms.

Technical Approach

Data & Features

The dataset contained health and lifestyle metrics for 400+ individuals across three classes: no disorder, insomnia, and sleep apnea. Engineered features included BMI, daily step count, physical activity minutes, stress level (self-reported scale 1-10), resting heart rate, sleep duration, age, and occupation type. Created composite features: activity-to-stress ratio and sleep efficiency index (sleep duration / time in bed).

Model Selection

Feature Importance Analysis

Built comprehensive visualizations showing stress level as the strongest single predictor, followed by daily steps and BMI. Physical activity showed a protective effect — individuals with >7,000 daily steps had 60% lower predicted probability of sleep apnea. Interactive Seaborn plots enabled exploration of feature interactions and threshold effects.

Key Results

Tech Stack

PythonXGBoostScikit-learnSeabornPandasMatplotlibJupyter
Back to Portfolio View on GitHub