Levene's test


In statistics, Levene's test is an inferential statistic used to assess the equality of variances for a variable calculated for two or more groups. Some common statistical procedures assume that variances of the populations from which different samples are drawn are equal. Levene's test assesses this assumption. It tests the null hypothesis that the population variances are equal. If the resulting p-value of Levene's test is less than some significance level, the obtained differences in sample variances are unlikely to have occurred based on random sampling from a population with equal variances. Thus, the null hypothesis of equal variances is rejected and it is concluded that there is a difference between the variances in the population.
Some of the procedures typically assuming homoscedasticity, for which one can use Levene's tests, include analysis of variance and t-tests.
Levene's test is often used before a comparison of means. When Levene's test shows significance, one should switch to more generalized tests that is free from homoscedasticity assumptions. Welch's t-test, or unequal variances t-test are more conservative test.
Levene's test may also be used as a main test for answering a stand-alone question of whether two sub-samples in a given population have equal or different variances.

Definition

Levene's test is equivalent to a 1-way between-groups analysis of variance with the dependent variable being the absolute value of the difference between a score and the mean of the group to which the score belongs. The test statistic,, is equivalent to the statistic that would be produced by such an ANOVA, and is defined as follows:
where
The test statistic is approximately F-distributed with and degrees of freedom, and hence is the significance of the outcome of tested against where is a quantile of the F-distribution, with and degrees of freedom, and is the chosen level of significance.

Comparison with the Brown–Forsythe test

The Brown–Forsythe test uses the median instead of the mean in computing the spread within each group. Although the optimal choice depends on the underlying distribution, the definition based on the median is recommended as the choice that provides good robustness against many types of non-normal data while retaining good statistical power. If one has knowledge of the underlying distribution of the data, this may indicate using one of the other choices. Brown and Forsythe performed Monte Carlo studies that indicated that using the trimmed mean performed best when the underlying data followed a Cauchy distribution and the median performed best when the underlying data followed a chi-squared distribution with four degrees of freedom. Using the mean provided the best power for symmetric, moderate-tailed, distributions.