Explain the Central Limit Theorem in plain terms. Why does it matter for A/B testing, confidence intervals, and building ML systems? When does it break down?
formulate your answer, then —
tldr
CLT: sample means of any finite-variance distribution converge to N(μ, σ²/n) as n grows. This is why z-tests, t-tests, and confidence intervals work without knowing the true data distribution. Standard error = σ/√n — halving it requires 4× more data. Breaks down for heavy-tailed distributions (revenue, session time), very small n on skewed data, and correlated observations. Foundation of all frequentist inference and A/B testing.
follow-up
- Why does averaging predictions in an ensemble reduce variance, and what does CLT have to do with it?
- When would you use a bootstrap confidence interval instead of a CLT-based one?
- How does the CLT break down when your metric has a heavy tail, and what alternatives exist?