Whoever read this - please please please ignore the posts that suggest to just p...

hcho3 · on Jan 1, 2020

Given that many data people run across is tabular, I appreciate your advice about the importance of statistics. Also kudos for mentioning hypothesis testing (no one in this thread mentioned it). Lastly, I’d add that ML practitioners will gain a lot by listening to statisticians and economists on the issue of data quality, e.g. selection bias.

That said, I am not as cynical about “machine learning.” ML and “data science” brought the importance of prediction front and center, i.e. can you fit a model that accurately predict the target value given a previously seen input? This point is made by the recently published stats textbook Computer Age Statistical Inference (Efron and Hastie).

In some applications, it may be beneficial to choose black box models with high predictive accuracy, as the goal for these applications is prediction, not interpreting individual model coefficients.

rckoepke · on Jan 1, 2020

You can do pose estimation with basic statistics?

hcho3 · on Jan 1, 2020

Many business data is tabular (possibly with time component), and if you are working with tabular data, the OP’s advice is sound.