Your model accuracy dropped 8% over 2 weeks — walk through your debugging process

Question

Accepted Answer

Your model's accuracy dropped 8% over 2 weeks. Walk through your complete debugging process. How do you isolate the cause and decide whether to rollback, hotfix, or retrain? Think about: what you check before assuming the model is broken — is the metric drop real? What order you investigate: metrics → serving → data → model → labels. What distinguishes a data pipeline failure from concept drift from a code regression. What "8% over 2 weeks" tells you about the failure mode (gradual vs sudden). W