Walk me through feature engineering for tabular data

Question

Accepted Answer

Walk me through your feature engineering process for a new tabular ML problem. What do you look at first, and what transformations do you commonly apply? Think about: the difference between transformations that require knowing global statistics (z-score needs mean/std from training set) vs. local ones (log transform). Why you must compute statistics on training data only and then apply to val/test. What tree-based models need vs linear models vs neural nets. **Start with data understanding** Bef