Data Analysis Pythondata~10 mins

t-test with scipy.stats in Data Analysis Python - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - t-test with scipy.stats

Start: Two data samples

↓

Choose t-test type

↓

Calculate t-statistic and p-value

↓

Compare p-value to threshold

↓

Reject null

↓

End

We start with two data samples, choose the t-test type, calculate the t-statistic and p-value, then decide if the difference is significant by comparing p-value to a threshold.

Execution Sample

Data Analysis Python

from scipy import stats
sample1 = [5, 6, 7, 8, 9]
sample2 = [5, 5, 5, 5, 6]
t_stat, p_val = stats.ttest_ind(sample1, sample2)
print(t_stat, p_val)

This code runs an independent t-test on two small samples and prints the t-statistic and p-value.

Execution Table

Step	Action	Input	Output	Explanation
1	Import scipy.stats	None	scipy.stats module ready	Prepare to use t-test functions
2	Define sample1	[5,6,7,8,9]	sample1 list created	First data sample stored
3	Define sample2	[5,5,5,5,6]	sample2 list created	Second data sample stored
4	Call ttest_ind(sample1, sample2)	sample1, sample2	(t_stat=2.449, p_val=0.041)	Calculate t-statistic and p-value for independent samples
5	Print results	(2.449, 0.041)	2.449 0.041	Show t-statistic and p-value
6	Compare p_val < 0.05	0.041 < 0.05	True	p-value is less than 0.05, reject null hypothesis
7	End	Decision made	Reject null hypothesis	Samples differ significantly

💡 Execution stops after printing results and deciding significance based on p-value.

Variable Tracker

Variable	Start	After Step 2	After Step 3	After Step 4	Final
sample1	None	[5,6,7,8,9]	[5,6,7,8,9]	[5,6,7,8,9]	[5,6,7,8,9]
sample2	None	None	[5,5,5,5,6]	[5,5,5,5,6]	[5,5,5,5,6]
t_stat	None	None	None	2.449	2.449
p_val	None	None	None	0.041	0.041

Key Moments - 3 Insights

Why do we compare the p-value to 0.05?

What does the t-statistic number mean?

Can we use ttest_ind for samples that are related?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table at step 4, what are the values of t_stat and p_val?

At_stat=5, p_val=6

Bt_stat=0.041, p_val=2.449

Ct_stat=2.449, p_val=0.041

Dt_stat=1, p_val=0.5

Concept Snapshot

t-test with scipy.stats:
- Use stats.ttest_ind(sample1, sample2) for independent samples
- Returns t-statistic and p-value
- If p-value < 0.05, reject null hypothesis (samples differ)
- Use ttest_rel for related samples
- Helps test if two groups differ significantly

Full Transcript

This visual execution shows how to perform a t-test using scipy.stats in Python. We start by importing the module, then define two data samples. We run the independent t-test function which returns a t-statistic and a p-value. The p-value tells us if the difference between samples is statistically significant. If the p-value is less than 0.05, we reject the null hypothesis, meaning the samples differ significantly. The variable tracker shows how sample data and results change step-by-step. Key moments clarify why we compare p-value to 0.05 and what the t-statistic means. The quiz tests understanding of the execution steps and decision points.