Statistical Hypothesis Testing

What kind of data are you working with?

This determines the family of tests available to you.

📏

Continuous / Numerical

Measurements, scores, concentrations, times…

🗂️

Categorical / Count

Frequencies, proportions, pass/fail counts…

↔️

Comparing Variability

Are two groups equally variable? (variances)

What is the goal of your comparison?

Think about how many groups and whether observations are independent or paired.

🎯

One sample vs. a known value

Is the average equal to a target or standard?

⚖️

Two independent groups

Do two unrelated groups have the same mean?

🔗

Paired / Before-After

Same subjects measured twice — is there a change?

📊

Three or more groups

Comparing means across multiple groups simultaneously.

🔧

Non-normal data / outliers

Data is skewed or small samples — no normality assumption.

What is the goal of your analysis?

📐

Does distribution fit expected?

Goodness-of-fit: observed vs. expected frequencies.

🔀

Are two variables associated?

Independence test on a contingency table.

📈

Comparing proportions

Is a proportion equal to a target, or are two proportions equal?

How many groups are you comparing?

📉

Two groups

F-test (Variance Ratio Test)

📊

Three or more groups

Use Levene's test (via ANOVA on residuals)

✅ One-Sample t-test

Compare the mean of your sample to a hypothesized value μ₀. The t-test is appropriate when σ is unknown (which is almost always). For known σ and large n, a Z-test is equivalent.

Open t-test Tool →

Also consider: Z-test if population σ is known.

✅ Two-Sample Independent t-test (Welch's)

Compare the means of two independent groups. Welch's t-test is recommended as it does not assume equal variances, making it more robust for most situations.

Open t-test Tool →

If data is clearly non-normal: Mann-Whitney U Test

✅ Paired t-test (Dependent Samples)

When each observation in group 1 is matched to one in group 2 (same subject before/after, matched pairs). The paired t-test analyses the differences directly, reducing within-subject variability.

Open t-test Tool →

✅ One-Way ANOVA (or Two-Way ANOVA)

ANOVA (Analysis of Variance) tests whether at least one group mean differs from the others. Use one-way for a single factor, two-way when you have two categorical factors. Post-hoc tests identify which specific pairs differ.

Open ANOVA Tool →

Non-parametric alternative: Kruskal-Wallis Test

✅ Non-Parametric Tests

When normality cannot be assumed — use rank-based tests that make no distribution assumptions. These are also robust to outliers and suitable for ordinal data.

Mann-Whitney U (2 groups) → Kruskal-Wallis (3+ groups) →

✅ Chi-Square Goodness-of-Fit Test

Tests whether observed frequencies in categories match expected theoretical frequencies. Suitable for testing if data follows a uniform, Poisson, or any specified distribution.

Open Chi-Square Tool →

✅ Chi-Square Test of Independence

Tests whether two categorical variables are independent in a contingency table. Also quantifies the strength of association with Cramér's V.

Open Chi-Square Tool →

✅ Z-Test for Proportions

Test whether a proportion equals a target value, or compare two proportions from independent samples. Requires n·p ≥ 5 and n·(1-p) ≥ 5 for the normal approximation to hold.

Open Z-Test Tool →

✅ F-Test (Variance Ratio Test)

Tests whether two populations have equal variances. Commonly used as a preliminary test before a pooled two-sample t-test, or to assess process consistency.

Open F-Test Tool →

✅ Homogeneity of Variance

For 3+ groups, the Kruskal-Wallis tool includes a Levene's-style variance comparison. Alternatively, run pairwise F-tests between groups.

Open F-Test Tool →

Situation	Assumptions	Test	Link
1 sample vs. known mean	Normal / large n	One-Sample t-test	→ Open
2 independent groups, means	Normal / large n	Welch's t-test	→ Open
Paired / before-after	Differences normal	Paired t-test	→ Open
3+ independent groups, means	Normal, equal variance	One-Way ANOVA	→ Open
2 factors + interaction	Normal, balanced design	Two-Way ANOVA	→ Open
Categorical frequencies vs. expected	Expected freq ≥ 5	Chi-Square GoF	→ Open
Two categorical variables, independence	Expected freq ≥ 5	Chi-Square Independence	→ Open
Proportion vs. target value	np ≥ 5, n(1-p) ≥ 5	Z-Test (proportion)	→ Open
Two proportions compared	np ≥ 5 in both groups	Two-Proportion Z-Test	→ Open
Equality of two variances	Normal distributions	F-Test	→ Open
2 groups, no normality / ordinal	None (rank-based)	Mann-Whitney U	→ Open
3+ groups, no normality / ordinal	None (rank-based)	Kruskal-Wallis	→ Open

Statistical Hypothesis Testing

✅ One-Sample t-test

✅ Two-Sample Independent t-test (Welch's)

✅ Paired t-test (Dependent Samples)

✅ One-Way ANOVA (or Two-Way ANOVA)

✅ Non-Parametric Tests

✅ Chi-Square Goodness-of-Fit Test

✅ Chi-Square Test of Independence

✅ Z-Test for Proportions

✅ F-Test (Variance Ratio Test)

✅ Homogeneity of Variance

All Available Tests

Quick Reference: Which test do I need?