# Odds ratio

34,142pages on
this wiki

The odds ratio is a measure of effect size particularly important in Bayesian statistics and logistic regression.

It is defined as the ratio of the odds of an event occurring in one group to the odds of it occurring in another group, or to a data-based estimate of that ratio. These groups might be men and women, an experimental group and a control group, or any other dichotomous classification. So if the probabilities of the event in each of the groups are p (first group) and q (second group), then the odds-ratio is:

${ p/(1-p) \over q/(1-q)}=\frac{\;p(1-q)\;}{\;(1-p)q\;}.$

An odds ratio of 1 indicates that the condition or event under study is equally likely in both groups. An odds ratio greater than 1 indicates that the condition or event is more likely in the first group. And an odds ratio less than 1 indicates that the condition or event is less likely in the first group. The odds ratio must be zero or greater than zero. As the odds of the first group approaches zero, the odds ratio approaches zero. As the odds of the second group approaches zero, the odds ratio approaches positive infinity.

For example, imagine that out of a sample of 100 men, 80 have drunk beer in the previous week, while a sample of 100 women reveals that only 20 have drunk beer during the previous seven days. The odds of a man drinking beer are 80 to 20, or 4:1 while the odds of a women drinking beer are only 20 to 80, or 1:4 = 0.25:1. Now, 4 divided by 0.25 equals 16. So the odds ratio is 16, showing that men are much more likely to drink beer than women. Using the above formula for the calculation yields:

${ 0.8/0.2 \over 0.2/0.8}=\frac{\;0.8\times 0.8\;}{\;0.2\times 0.2\;}={0.64 \over 0.04} = 16.$

This example also shows how odds ratios can sometimes seem to overstate relative positions: in this sample men are 4 times more likely to have drunk beer than women, but have 16 times the odds.

The logarithm of the odds-ratio is the difference of the logits of the probabilities.

The increased use of logistic regression in medical and social science research means that the odds-ratio is commonly used as a means of expressing the results in some forms of clinical trials, such as case-controlled trials, and in survey research. It is often abbreviated "OR" in reports. When data from multiple surveys is combined, it will often be expressed as "Pooled OR".