Dunnett's test

In statistics, Dunnett's test is a multiple comparison procedure developed by Canadian statistician Charles Dunnett to compare each of a number of treatments with a single control. Multiple comparisons to a control are also referred to as many-to-one comparisons.

History

Dunnett's test was developed in 1955; an updated table of critical values was published in 1964.

Multiple Comparisons Problem

The multiple comparisons, multiplicity or multiple testing problem occurs when one considers a set of statistical inferences simultaneously or infers a subset of parameters selected based on the observed values. The major issue in any discussion of multiple-comparison procedures is the question of the probability of Type I errors. Most differences among alternative techniques result from different approaches to the question of how to control these errors. The problem is in part technical; but it is really much more a subjective question of how you want to define the error rate and how large you are willing to let the maximum possible error rate be.
Dunnett's test are well known and widely used in multiple comparison procedure for simultaneously comparing, by interval estimation or hypothesis testing, all active treatments with a control when sampling from a distribution where the normality assumption is reasonable.
Dunnett's test is designed to hold the familywise error rate at or below when performing multiple comparisons of treatment group with control.

Uses of Dunnett’s test

The original work on Multiple Comparisons problem was made by Tukey and Scheffé. Their method was a general one, which considered all kinds of pairwise comparisons. Tukey's and Scheffé's methods allow any number of comparisons among a set of sample means. On the other hand, Dunnett's test only compares one group with the others, addressing a special case of multiple comparisons problem — pairwise comparisons of multiple treatment groups with a single control group. In the general case, where we compare each of the pairs, we make comparisons, but in the treatment vs. controls case we will make only comparisons. If in the case of treatment and control groups we were to use the more general Tukey's and Scheffé's methods, they can result in unnecessarily wide confidence intervals. Dunnett's test takes into consideration the special structure of comparing treatment against control, yielding narrower confidence intervals.
It is very common to use Dunnett's test in medical experiments, for example comparing blood count measurements on three groups of animals, one of which served as a control while the other two were treated with two different drugs. Another common use of this method is among agronomists: agronomists may want to study the effect of certain chemicals added to the soil on crop yield, so they will leave some plots untreated and compare them to the plots where chemicals were added to the soil.

Formal description of Dunnett's test

Dunnett's test is performed by computing a Student's t-statistic for each experimental, or treatment, group where the statistic compares the treatment group to a single control group. Since each comparison has the same control in common, the procedure incorporates the dependencies between these comparisons. In particular, the t-statistics are all derived from the same estimate of the error variance which is obtained by pooling the sums of squares for error across all groups. The formal test statistic for Dunnett's test is either the largest in absolute value of these t-statistics, or the most negative or most positive of the t-statistics.
In Dunnett's test we can use a common table of critical values, but more flexible options are nowadays readily available in many statistics packages such as R. The critical values for any given percentage point depend on: whether a one- or- two-tailed test is performed; the number of groups being compared; the overall number of trials.

Assumptions

The analysis considers the case where the results of the experiment are numerical, and the experiment is performed to compare p treatments with a control group. The results can be summarized as a set of calculated means of the sets of observations,, while are referring to the treatment and is referring to the control set of observations, and is an independent estimate of the common standard deviation of all sets of observations. All of the sets of observations are assumed to be independently and normally distributed with a common variance and means. There is also an assumption that there is an available estimate for.

Calculation

Dunnett's test's calculation is a procedure that is based on calculating confidence statements about the true or the expected values of the differences, thus the differences between treatment groups' mean and control group's mean. This procedure ensures that the probability of all statements being simultaneously correct is equal to a specified value,. When calculating one sided upper Confidence interval for the true value of the difference between the mean of the treatment and the control group, constitutes the probability that this actual value will be less than the upper limit of that interval. When calculating two-sided confidence interval, constitutes the probability that the true value will be between the upper and the lower limits.
First, we will denote the available N observations by when and and estimate the common variance by, for example:
when is the mean of group and is the number of observations in group, and degrees of freedom. As mentioned before, we would like to obtain separate confidence limits for each of the differences such that the probability that all confidence intervals will contain the corresponding is equal to.
We will consider the general case where there are treatment groups and one control group. We will write:
we will also write:, which follows the Student's t-statistic distribution with n degrees of freedom. The lower confidence limits with joint confidence coefficient for the treatment effects will be given by:
and the constants are chosen so that.
Similarly, the upper limits will be given by:
For bounding in both directions, the following interval might be taken:
when are chosen to satisfy.
The solution to those particular values of for two sided test and for one sided test is given in the tables. An updated table of critical values was published in 1964.

Examples

Breaking strength of fabric

The following example was adapted from one given by Villars. The data represent measurements on the breaking strength of fabric treated by three different chemical process compared with a standard method of manufacture.

	standard	process 1	process 2	process 3
	55	55	55	50
	47	64	49	44
	48	64	52	41
Means	50	61	52	45
Variance	19	27	9	21

Here, p=3 and N=3. The average variance is,
which is an estimate of the common variance of the four sets with =8
degrees of freedom.
This can be calculated as follows:
The standard deviation is and the estimated standard error of a difference between two means is
The quantity which must be added to and/or subtracted from the observed differences between the means to give their confidence limits has been called by Tukey an "allowance" and is given by,
where t drawn from the Multivariate t-distribution, or can be obtained from Dunnett's Table 1 if one side limits are desired or from
Dunnett's Table 2 if two-sided limits are wanted.
For p=3 and d.f.=8, t=2.42 for one side limits and t=2.88 for two-sided limits for p=95%. Analogous values of t can be determined from the tables if p=99% confidence is required.
For one-sided limits, the allowance is A9 and the experimenter can conclude that:

The breaking strength using process 1 exceeds the standard by at least
The breaking strength using process 2 exceeds the standard by at least.
The breaking strength using process 3 exceeds the standard by at least.

The joint statement consisting of the above three conclusions has a confidence coefficient of 95%, i.e., in the long run, 95% of such joint statements will actually be correct. Upper limits for the three differences could be obtained in an analogous manner.
For two-sided limits, the allowance is A11 and the experimenter can conclude that:

The breaking strength using process 1 exceeds the standard by an amount between

and

The breaking strength using process 2 exceeds the standard by an amount between

and.

The breaking strength using process 3 exceeds the standard by an amount between

and.
The joint confidence coefficient for these three statement is greater than 95%.
.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...