Challenge - 5 Problems
Pearson Correlation Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
❓ Predict Output
intermediate2:00remaining
Calculate Pearson correlation coefficient
What is the output of this code that calculates the Pearson correlation coefficient between two lists?
SciPy
from scipy.stats import pearsonr x = [1, 2, 3, 4, 5] y = [2, 4, 6, 8, 10] result = pearsonr(x, y) print(round(result[0], 2))
Attempts:
2 left
💡 Hint
Pearson correlation measures linear relationship strength between two variables.
✗ Incorrect
The lists x and y have a perfect positive linear relationship (y is exactly 2 times x). So, the Pearson correlation coefficient is 1.00.
❓ data_output
intermediate2:00remaining
Number of pairs with strong positive correlation
Given this DataFrame, how many pairs of columns have a Pearson correlation coefficient greater than 0.8?
SciPy
import pandas as pd import numpy as np np.random.seed(0) data = pd.DataFrame({ 'A': np.random.rand(10), 'B': np.random.rand(10), 'C': np.linspace(0, 1, 10), 'D': np.linspace(0, 1, 10) * 2 }) corr_matrix = data.corr() strong_pairs = (corr_matrix.where(np.triu(np.ones(corr_matrix.shape), k=1).astype(bool)) > 0.8).sum().sum() print(int(strong_pairs))
Attempts:
2 left
💡 Hint
Check pairs only once and ignore diagonal.
✗ Incorrect
Columns C and D are perfectly correlated (correlation 1). A and B have low correlation (~ -0.03), and other cross pairs are also low. The upper triangular mask counts unique off-diagonal pairs once, so only 1 pair (C, D).
🔧 Debug
advanced2:00remaining
Identify the error in Pearson correlation calculation
What error does this code raise when trying to calculate Pearson correlation?
SciPy
from scipy.stats import pearsonr x = [1, 2, 3, 4, 5] y = [2, 4, 6, 8] pearsonr(x, y)
Attempts:
2 left
💡 Hint
Check if both lists have the same number of elements.
✗ Incorrect
The lists x and y have different lengths (5 and 4). pearsonr requires equal length inputs and raises ValueError if lengths differ.
🚀 Application
advanced2:00remaining
Interpret Pearson correlation result
If the Pearson correlation coefficient between two variables is -0.85, what does this mean?
Attempts:
2 left
💡 Hint
Negative values indicate inverse relationships.
✗ Incorrect
A correlation of -0.85 means a strong negative linear relationship: as one variable increases, the other tends to decrease.
🧠 Conceptual
expert2:00remaining
Effect of outliers on Pearson correlation
Which statement best describes how outliers affect the Pearson correlation coefficient?
Attempts:
2 left
💡 Hint
Think about how extreme values influence averages and covariance.
✗ Incorrect
Pearson correlation is sensitive to outliers because it uses means and standard deviations. Extreme values can skew the correlation, making it misleading.