Multi-class classification model predicting sleep disorders from lifestyle and health metrics — focus on feature interpretability.
Sleep disorders affect 30% of adults and are linked to cardiovascular disease, depression, and reduced quality of life. Early screening using non-clinical data — lifestyle factors, physical metrics, and daily habits — can identify at-risk individuals before they develop chronic conditions. The challenge: distinguishing between multiple disorder types (insomnia vs sleep apnea) from overlapping symptoms.
The dataset contained health and lifestyle metrics for 400+ individuals across three classes: no disorder, insomnia, and sleep apnea. Engineered features included BMI, daily step count, physical activity minutes, stress level (self-reported scale 1-10), resting heart rate, sleep duration, age, and occupation type. Created composite features: activity-to-stress ratio and sleep efficiency index (sleep duration / time in bed).
Built comprehensive visualizations showing stress level as the strongest single predictor, followed by daily steps and BMI. Physical activity showed a protective effect — individuals with >7,000 daily steps had 60% lower predicted probability of sleep apnea. Interactive Seaborn plots enabled exploration of feature interactions and threshold effects.