Education
 

Uniformly most powerful test

From Psychology Wiki

Community portal · Tasks to do · News · Help

Clinical · Educational · Ind&Org · Other fields · Professional · Transpersonal · World

Assessment | Biopsychology | Comparative | Cognitive | Developmental | Language
Personality | Philosophy | Research Methods | Social | Statistics

Statistics: Scientific method · Research methods · Experimental design · Undergraduate statistics courses · Statistical tests · Game theory · Decision theory


In statistical hypothesis testing, a uniformly most powerful (UMP) test is a hypothesis test which has the greatest power math among all possible tests of a given size α. For example, according to the Neyman-Pearson lemma, the likelihood-ratio test is UMP for testing simple (point) hypotheses.

Contents

[edit] Setting

Let math denote a random vector (corresponding to the measurements), taken from a parametrized family of probability density functions or probability mass functions math, which depends on the unknown deterministic parameter math. The parameter space math is partitioned into two disjoint sets math and math. Let math denote the hypothesis that math, and let math denote the hypothesis that math. The binary test of hypotheses is performed using a test function math.

math

meaning that math is in force if the measurement math and that math is in force if the measurement math. math is a disjoint covering of the measurement space.

[edit] Formal definition

A test function math is UMP of size math if for any other test function math we have:

math
math

[edit] The Karlin-Rubin theorem

The Karlin-Rubin theorem can be regarded as an extension of the Neyman-Pearson lemma for composite hypotheses. Consider a scalar measurement having a probability density function parameterized by a scalar parameter θ, and define the likelihood ratio math. If math is monotone non-decreasing for any pair math (meaning that the greater math is, the more likely math is), then the threshold test:

math
math

is the UMP test of size α for testing math

Note that exactly the same test is also UMP for testing math

[edit] Important case: The exponential family

Although the Karlin-Rubin may seem weak because of its restriction to scalar parameter and scalar measurement, it turns out that there exist a host of problems for which the theorem holds. In particular, the one-dimensional exponential family of probability density functions or probability mass functions with math has a monotone non-decreasing likelihood ratio in the sufficient statistic T(x), provided that math is non-decreasing.

[edit] Example

Let math denote i.i.d. normally distributed math-dimensional random vectors with mean math and covariance matrix math. We then have

math
math

which is exactly in the form of the exponential family shown in the previous section, with the sufficient statistic being

math

Thus, we conclude that the test

math
math

is the UMP test of size math for testing math vs. math

[edit] Further discussion

Finally, we note that in general, UMP tests do not exist for vector parameters or for two-sided tests (a test in which one hypothesis lies on both sides of the alternative). Why is it so?

The reason is that in these situations, the most powerful test of a given size for one possible value of the parameter (e.g. for math where math) is different than the most powerful test of the same size for a different value of the parameter (e.g. for math where math). As a result, no test is Uniformly most powerful.

[edit] References

  • L. L. Scharf, Statistical Signal Processing, Addison-Wesley, 1991, section 4.7.
Smallwikipedialogo.png This page uses content from the English-language version of Wikipedia. The original article was at Uniformly most powerful test. The list of authors can be seen in the page history. As with Psychology Wiki, the text of Wikipedia is available under the GNU Free Documentation License.