What if Levene’s Test is “Significant”? (2024)

An assumption required for ANOVA is hom*ogeneity of variances. We often run Levene’s test to check if this holds. But what if it doesn't? This tutorial walks you through.

  • SPSS ANOVA Dialogs I
  • Results I - Levene’s Test “Significant“
  • SPSS ANOVA Dialogs II
  • Results II - Welch and Games-Howell Tests
  • Plan B - Kruskal-Wallis Test

Example Data

All analyses in this tutorial use staff.sav, part of which is shown below. We encourage you to download these data and replicate our analyses.

What if Levene’s Test is “Significant”? (1)

Our data contain some details on a sample of N = 179 employees. The research question for today is:is salary associated with region?We'll try to support this claim by rejecting the null hypothesis that all regions have equal mean population salaries. A likely analysis for this is an ANOVA but this requires a couple of assumptions.

ANOVA Assumptions

An ANOVA requires 3 assumptions:

  1. independent observations;
  2. normality: the dependent variable must follow a normal distribution within each subpopulation.
  3. hom*ogeneity: the variance of the dependent variable must be equal over all subpopulations.

With regard to our data, independent observations seem plausible: each record represents a distinct person and people didn't interact in any way that's likely to affect their answers.

Second, normality is only needed for small sample sizes of, say, N < 25 per subgroup. We'll inspect if our data meet this requirement in a minute.

Last, hom*ogeneity is only needed if sample sizes are sharply unequal. If so, we usually run Levene's test. This procedure tests if 2+ population variances are all likely to be equal.

Quick Data Check

Before running our ANOVA, let's first see if the reported salaries are even plausible. The best way to do so is inspecting a histogram which we'll create by running the syntax below.

*Run basic histogram on salary.

frequencies salary
/format notable
/histogram.

Result

What if Levene’s Test is “Significant”? (2)
  • Note that our histogram reports N = 175 rather than our N = 179 respondents. This implies that salary contains 4 missing values.
  • The frequency distribution, however, looks plausible: there's no clear outliers or other abnormalities that should ring any alarm bells.
  • The distribution shows some positive skewness. However, this makes perfect sense and is no cause for concern.

Let's now proceed to the actual ANOVA.

SPSS ANOVA Dialogs I

After opening our data in SPSS, let's first navigate toAnalyze What if Levene’s Test is “Significant”? (3) General Linear Model What if Levene’s Test is “Significant”? (4) Univariate as shown below.

What if Levene’s Test is “Significant”? (5)

Let's now fill in the dialog that opens as shown below.

What if Levene’s Test is “Significant”? (6)

Completing these steps results in the syntax below. Let's run it.

*ANOVA with descriptive statistics, Levene's test and effect size: (partial) eta squared.

UNIANOVA salary BY region
/METHOD=SSTYPE(3)
/INTERCEPT=INCLUDE
/PRINT ETASQ DESCRIPTIVE hom*oGENEITY
/CRITERIA=ALPHA(.05)
/DESIGN=region.

Results I - Levene’s Test “Significant”

The very first thing we inspect are the sample sizes used for our ANOVA and Levene’s test as shown below.

What if Levene’s Test is “Significant”? (7)
  • First off, note that our Descriptive Statistics table is based on N = 171 respondents (bottom row). This is due to some missing values in both region and salary.
  • Second, sample sizes for “North” and “East” are rather small. We may therefore need the normality assumption. For now, let's just assume it's met.
  • Next, our sample sizes are sharply unequal so we really need to meet the hom*ogeneity of variances assumption.
  • However, Levene’s test is statistically significant because its p < 0.05: we reject its null hypothesis of equal population variances.

The combination of these last 2 points implies thatwe can not interpret or report the F-testshown in the table below.

What if Levene’s Test is “Significant”? (8)

What if Levene’s Test is “Significant”? (9) As discussed, we can't rely on this p-value for the usual F-test.

What if Levene’s Test is “Significant”? (10) However, we can still interpret eta squared (often written as η2). This is a descriptive statistic that neither requires normality nor hom*ogeneity. η2 = 0.046 implies a small to medium effect size for our ANOVA.

Now, if we can't interpret our F-test, then how can we know if our mean salaries differ? Two good alternatives are:

  • running an ANOVA with the Welch statistic or
  • a Kruskal-Wallis test.

Let's start off with the Welch statistic.

SPSS ANOVA Dialogs II

For inspecting the Welch statistic, first navigate toAnalyze What if Levene’s Test is “Significant”? (11) Compare Means What if Levene’s Test is “Significant”? (12) One-Way ANOVA as shown below.

What if Levene’s Test is “Significant”? (13)

Next, we'll fill out the dialogs that open as shown below.

What if Levene’s Test is “Significant”? (14)

This results in the syntax below. Again, let's run it.

*ANOVA with Welch statistic and Games-Howell post hoc tests.

ONEWAY salary BY region
/STATISTICS hom*oGENEITY WELCH
/MISSING ANALYSIS
/POSTHOC=GH ALPHA(0.05).

Results II - Welch and Games-Howell Tests

As shown below, the Welch test rejects the null hypothesis of equal population means.

What if Levene’s Test is “Significant”? (15)

This table is labelled “Robust Tests...” because it's robust to a violation of the hom*ogeneity assumption as indicated by Levene’s test. So we now conclude that mean salaries are not equal over all regions.

But precisely which regions differ with regard to mean salaries? This is answered by inspecting post hoc tests. And if the hom*ogeneity assumption is violated, we usually prefer Games-Howell as shown below.

What if Levene’s Test is “Significant”? (16)

Note that each comparison is shown twice in this table. The only regions whose mean salaries differ “significantly” are North and Top 4 City.

Plan B - Kruskal-Wallis Test

So far, we overlooked one issue: some regions have sample sizes of n = 15 or n = 16. This implies that the normality assumption should be met as well. A terrible idea here is to run

  • a Kolmogorov-Smirnov test or
  • a Shapiro-Wilk test

for each region separately. Neither test rejects the null hypothesis of a normally distributed dependent variable but this is merely due to insufficient sample sizes.

A much better idea is running a Kruskal-Wallis test. You could do so with the syntax below.

*Kruskal-Wallis test from Analyze - Nonparametric Tests - Legacy Dialogs - K Independent Samples.

NPAR TESTS
/K-W=salary BY region(1 5)
/STATISTICS DESCRIPTIVES
/MISSING ANALYSIS.

Result

What if Levene’s Test is “Significant”? (17)

Sadly, our Kruskal-Wallis test doesn't detect any difference between mean salary ranks over regions, H(4) = 6.58, p = 0.16.

In short, our analyses come up with inconclusive outcomes and it's unclear precisely why. If you've any suggestions, please throw us a comment below. Other than that,

Thanks for reading!

What if Levene’s Test is “Significant”? (2024)
Top Articles
Latest Posts
Article information

Author: Prof. An Powlowski

Last Updated:

Views: 6033

Rating: 4.3 / 5 (64 voted)

Reviews: 95% of readers found this page helpful

Author information

Name: Prof. An Powlowski

Birthday: 1992-09-29

Address: Apt. 994 8891 Orval Hill, Brittnyburgh, AZ 41023-0398

Phone: +26417467956738

Job: District Marketing Strategist

Hobby: Embroidery, Bodybuilding, Motor sports, Amateur radio, Wood carving, Whittling, Air sports

Introduction: My name is Prof. An Powlowski, I am a charming, helpful, attractive, good, graceful, thoughtful, vast person who loves writing and wants to share my knowledge and understanding with you.