Concept Flow - Goodness of fit evaluation

Collect observed data

↓

Define expected distribution

↓

Calculate test statistic

↓

Compare statistic to distribution

↓

Get p-value

↓

Decide if fit is good or not

We start with observed data and an expected distribution, calculate a test statistic, then find a p-value to decide if the data fits well.

Execution Sample

SciPy

from scipy.stats import chisquare
observed = [16, 18, 16, 14, 12, 12]
expected = [15, 15, 15, 15, 15, 15]
stat, p = chisquare(f_obs=observed, f_exp=expected)
print(stat, p)

This code runs a chi-square goodness of fit test comparing observed counts to expected counts.

Execution Table

Step	Action	Calculation	Result
1	Calculate differences (observed - expected)	[16-15, 18-15, 16-15, 14-15, 12-15, 12-15]	[1, 3, 1, -1, -3, -3]
2	Square differences	[1^2, 3^2, 1^2, (-1)^2, (-3)^2, (-3)^2]	[1, 9, 1, 1, 9, 9]
3	Divide squared differences by expected	[1/15, 9/15, 1/15, 1/15, 9/15, 9/15]	[0.0667, 0.6, 0.0667, 0.0667, 0.6, 0.6]
4	Sum all values	0.0667+0.6+0.0667+0.0667+0.6+0.6	2.0
5	Calculate p-value from chi-square distribution with df=5	p = 1 - CDF(2.0, df=5)	p = 0.849
6	Decision	p > 0.05 means fit is good	Fail to reject null hypothesis

💡 Test ends after p-value calculation and decision step

Variable Tracker

Variable	Start	After Step 1	After Step 2	After Step 3	After Step 4	After Step 5	Final
observed	[16,18,16,14,12,12]	[16,18,16,14,12,12]	[16,18,16,14,12,12]	[16,18,16,14,12,12]	[16,18,16,14,12,12]	[16,18,16,14,12,12]	[16,18,16,14,12,12]
expected	[15,15,15,15,15,15]	[15,15,15,15,15,15]	[15,15,15,15,15,15]	[15,15,15,15,15,15]	[15,15,15,15,15,15]	[15,15,15,15,15,15]	[15,15,15,15,15,15]
diff	N/A	[1,3,1,-1,-3,-3]	[1,3,1,-1,-3,-3]	[1,3,1,-1,-3,-3]	[1,3,1,-1,-3,-3]	[1,3,1,-1,-3,-3]	[1,3,1,-1,-3,-3]
squared_diff	N/A	N/A	[1,9,1,1,9,9]	[1,9,1,1,9,9]	[1,9,1,1,9,9]	[1,9,1,1,9,9]	[1,9,1,1,9,9]
chi_components	N/A	N/A	N/A	[0.0667,0.6,0.0667,0.0667,0.6,0.6]	[0.0667,0.6,0.0667,0.0667,0.6,0.6]	[0.0667,0.6,0.0667,0.0667,0.6,0.6]	[0.0667,0.6,0.0667,0.0667,0.6,0.6]
statistic	N/A	N/A	N/A	N/A	2.0	2.0	2.0
p_value	N/A	N/A	N/A	N/A	N/A	0.849	0.849

Key Moments - 3 Insights

Why do we square the differences between observed and expected counts?

What does a high p-value mean in this test?

Why do we divide squared differences by expected counts?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table at step 4, what is the sum of the chi-square components?

A2.0

B1.5

C3.0

D0.85

Concept Snapshot

Goodness of fit test compares observed data to expected distribution.
Calculate chi-square statistic: sum((observed - expected)^2 / expected).
Find p-value from chi-square distribution with degrees of freedom = categories - 1.
High p-value means data fits well; low p-value means poor fit.
Use scipy.stats.chisquare for easy calculation.

Full Transcript

Goodness of fit evaluation checks if observed data matches an expected pattern. We start by collecting observed counts and defining expected counts. Then, we calculate the differences between observed and expected, square them, and divide by expected counts. Summing these gives the chi-square statistic. Using the chi-square distribution with the right degrees of freedom, we find the p-value. A high p-value means the observed data fits the expected distribution well. This process is shown step-by-step in the execution table and variable tracker. Key points include squaring differences to avoid negatives, normalizing by expected counts, and interpreting the p-value correctly.