mlprep

Explain the Central Limit Theorem in plain terms. Why does it matter for A/B testing, confidence intervals, and building ML systems? When does it break down?

formulate your answer, then —

tldr

CLT: sample means of any finite-variance distribution converge to N(μ, σ²/n) as n grows. This is why z-tests, t-tests, and confidence intervals work without knowing the true data distribution. Standard error = σ/√n — halving it requires 4× more data. Breaks down for heavy-tailed distributions (revenue, session time), very small n on skewed data, and correlated observations. Foundation of all frequentist inference and A/B testing.

follow-up

  • Why does averaging predictions in an ensemble reduce variance, and what does CLT have to do with it?
  • When would you use a bootstrap confidence interval instead of a CLT-based one?
  • How does the CLT break down when your metric has a heavy tail, and what alternatives exist?