六个统计上的错误, 而且被常用到
This article discusses a number of incorrect statements appearing in textbooks on data analysis, machine learning, or computational methods; the common theme in all these cases is the relevance and application of statistics to the study of scientific or engineering data; these mistakes are also quite prevalent in the research literature. Crucially, we do not address errors made by an individual author, focusing instead on mistakes that are widespread in the introductory literature. After some background on frequentist and Bayesian linear regression, we turn to our six paradigmatic cases, providing in each instance a specific example of the textbook mistake, pointers to the specialist literature where the topic is handled properly, along with a correction that summarizes the salient points. The mistakes (and corrections) are broadly relevant to any technical setting where statistical techniques are used to draw practical conclusions, ranging from topics introduced in an elementary course on experimental measurements all the way to more involved approaches to regression.
A. Maximum-likelihood parameter estimation
小量的时候, MLE不能用于uncertainty
B. Chi-squared statistic and quality of fit不能过度追求ki^2/dof=1
C. Confidence intervals for model parameters概念问题, 实在想要置信范围的话, 用贝叶斯统计.
D. Empirical rule in the multivariate case 这个错误很常见E. What is random in frequentist vs Bayesian regression概念错误
F. Posterior predictive distribution, noise, and samples概念错误.
附件列表
标签:常用,literature,错误,frequentist,六个,mistakes,data,regression From: https://www.cnblogs.com/zouastro/p/16710623.html