Challenge - 5 Problems

🎖️

Statistics Hypothesis Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

Understanding Hypothesis Testing Purpose

Why do we use statistics to validate hypotheses in data science?

ATo randomly select data points for analysis

BTo ignore data variability and focus only on averages

CTo measure the likelihood that observed data could happen if the hypothesis is false

DTo prove the hypothesis is absolutely true without any doubt

Attempts:

2 left

❓ Predict Output

intermediate

2:00remaining

Output of p-value calculation

What is the output of this Python code that calculates a p-value for a sample mean?

Data Analysis Python

import scipy.stats as stats
sample_mean = 105
population_mean = 100
std_dev = 15
sample_size = 30
z_score = (sample_mean - population_mean) / (std_dev / (sample_size ** 0.5))
p_value = 2 * (1 - stats.norm.cdf(abs(z_score)))
print(round(p_value, 4))

A0.0910

B0.0455

C0.9545

D0.1820

Attempts:

2 left

❓ data_output

advanced

2:00remaining

Result of Hypothesis Test on Sample Data

Given this sample data and a population mean of 50, what is the p-value from a one-sample t-test?

Data Analysis Python

import scipy.stats as stats
sample = [52, 48, 51, 49, 53, 47, 50, 54, 46, 55]
population_mean = 50
t_stat, p_value = stats.ttest_1samp(sample, population_mean)
print(round(p_value, 4))

A0.0543

B0.1234

C0.9876

D0.0027

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Identify the error in hypothesis test code

What error will this code raise when running a t-test?

Data Analysis Python

import scipy.stats as stats
sample = [5, 7, 8, 6, 9]
pop_mean = 7
result = stats.ttest_1samp(sample, pop_mean)
print(result.pvalue)

AAttributeError: 'tuple' object has no attribute 'pvalue'

BSyntaxError: invalid syntax

CTypeError: ttest_1samp() missing 1 required positional argument

DNo error, prints the p-value

Attempts:

2 left

🚀 Application

expert

3:00remaining

Choosing correct hypothesis test for data

You have two independent groups with small sample sizes and unknown population variances. Which test should you use to validate if their means differ?

AChi-square test

BTwo-sample t-test assuming equal variances

CPaired t-test

DTwo-sample t-test with Welch's correction

Attempts:

2 left