Solution
We Trained on 750K Customers
We used two years of NPS and CSAT data—linked to service usage, care interactions, call logs, network incidents, and mobile app activity. A supervised ML model was trained and validated on over 750,000 customers, predicting satisfaction with 86% accuracy at the segment level.
Methodology
We followed a structured pipeline: (1) unify survey and operational data at customer level with consistent identifiers and time windows; (2) engineer features from usage, care, network, and app behavior; (3) train supervised models (e.g. gradient boosting) to predict NPS/CSAT segment, with cross-validation and holdout testing; (4) deploy weekly score updates and integrate outputs into CRM and care dashboards; (5) measure lift from proactive interventions vs. control segments.
Data sources
NPS and CSAT survey responses (two years; 750K+ linked customers). Service and usage data: plan type, tenure, data usage, call/SMS patterns. Care interactions: contact frequency, channel, resolution codes. Network and incident data: outages, latency, coverage. Mobile app activity: sessions, feature use, support requests. All joined at customer level with appropriate lag windows for prediction.