Your offline metrics improved but A/B test shows nothing — how do you debug?

Question

Accepted Answer

Your model improves NDCG by 2% in offline evaluation but shows no movement in the A/B test. Walk me through your debugging checklist. Think about: all the reasons offline metrics can move without moving online metrics. What "offline NDCG" is actually measuring vs what the A/B metric is measuring. Whether the offline eval is using representative data. Whether the experiment is set up correctly. Whether the change in NDCG is actually large enough to produce a detectable signal given the experiment