|
In statistics, the mean squared error or MSE of an estimator is one of many ways to quantify the amount by which an estimator differs from the true value of the quantity being estimated. As a loss function, MSE is called squared error loss. MSE measures the average of the square of the "error." The error is the amount by which the estimator differs from the quantity to be estimated. The difference occurs because of randomness or because the estimator doesn't account for information that could produce a more accurate estimate.[1] The MSE is the second moment (about the origin) of the error, and thus incorporates both the variance of the estimator and its bias. For an unbiased estimator, the MSE is the variance. Like the variance, MSE has the same unit of measurement as the square of the quantity being estimated. In an analogy to standard deviation, taking the square root of MSE yields the root mean squared error or RMSE, which has the same units as the quantity being estimated; for an unbiased estimator, the RMSE is the square root of the variance, known as the standard error.
Definition and basic propertiesIn statistics, Mean squared error is used in two distinct senses. EstimationThe MSE of an estimator The MSE can be written as the sum of the variance and the squared bias of the estimator The MSE thus assesses the quality of an estimator in terms of its variation and unbiasedness. Note that the MSE is not equivalent to the expected value of the absolute error. Since MSE is an expectation, it is a number, and not a random variable. It may be a function of the unknown parameter θ, but it does not depend on any random quantities. ResidualsIn a linear model and other regression models, the residuals, or estimated errors, are the differences between the observed data and fitted model, (the n in the denominator is often modified by a correction for degrees of freedom). In this case the MSE depends on data, and is a random variable. If the true errors have mean 0 and variance σ2, then the MSE is an estimate of σ2. ExamplesSuppose we have a random sample of size n from an identically distributed population, Some commonly-used estimators of the true parameters of the population, μ and σ2, are[2] shown in the following table (see notes for distribution requirements for the MSEs in the table related to variance estimators).
Note that:
InterpretationAn MSE of zero, meaning that the estimator While particular values of MSE other than zero are meaningless in and of themselves, they may be used for comparative purposes. Two or more statistical models may be compared using their MSEs as a measure of how well they explain a given set of observations: The unbiased model with the smallest MSE is generally interpreted as best explaining the variability in the observations. Both Analysis of Variance and Linear Regression techniques estimate MSE as part of the analysis and use the estimated MSE to determine the statistical significance of the factors or predictors under study. The goal of Design of Experiments is to construct experiments in such a way that when the observations are analyzed, the MSE is close to zero relative to the magnitude of at least one of the estimated treatment effects. MSE is also used in several stepwise regression techniques as part of the determination as to how many predictors from a candidate set to include in a model for a given set of observations. Applications
CriticismThe MSE is one of the most widely-used loss functions in statistics. Its widespread use stems more from mathematical convenience than considerations of actual loss in applications. Carl Friedrich Gauss, who introduced the use of mean squared error, was aware of its arbitrariness and was in agreement with objections to it on these grounds.[1] The mathematical benefits of mean squared error are particularly evident in its use at analyzing the performance of linear regression, as it allows one to partition the variation in a dataset into variation explained by the model and variation explained by randomness. The use of mean squared error without question has been criticized by the decision theorist J.O. Berger. Mean squared error conflicts with most losses derived from utility functions; mean squared error is convex everywhere, whereas most losses derived from utility theory have concave tails (and may be concave everywhere). There are, however, some scenarios where mean squared error can serve as a good approximation to a loss function occurring naturally in an application.[4] Like variance, mean squared error has the disadvantage of heavily weighting outliers.[5] This is a result of the squaring of each term, which effectively weights large errors more heavily than small ones. This property, undesirable in many applications, has led researchers to use alternatives such as the mean absolute error, or those based on the median. References
See also |
This article is from Wikipedia. All text is available under the terms of the GNU Free Documentation License.
Mercedes Car
This site monitored by SitePinger.net