Challenge - 5 Problems
Exploratory Data Analysis Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
❓ data_output
intermediate1:30remaining
Calculate the mean and median of a dataset
You have a dataset of ages: [22, 25, 29, 24, 30, 22, 28]. What are the mean and median values?
ML Python
import numpy as np ages = [22, 25, 29, 24, 30, 22, 28] mean_age = np.mean(ages) median_age = np.median(ages) print(f"Mean: {mean_age}, Median: {median_age}")
Attempts:
2 left
❓ visualization
intermediate2:00remaining
Identify the correct histogram plot for given data
Given the data points: [1, 2, 2, 3, 3, 3, 4, 4, 5], which histogram correctly shows the frequency of each number?
ML Python
import matplotlib.pyplot as plt import numpy as np data = [1, 2, 2, 3, 3, 3, 4, 4, 5] plt.hist(data, bins=5, edgecolor='black') plt.show()
Attempts:
2 left
🧠 Conceptual
advanced1:30remaining
Understanding correlation coefficients
Which statement correctly describes a Pearson correlation coefficient of -0.85 between two variables?
Attempts:
2 left
🔧 Debug
advanced1:30remaining
Identify the error in this code for calculating variance
What error will this code raise?
import numpy as np data = [1, 2, 3, 4, 5] variance = np.var(data, ddof=1) print(variance)
ML Python
import numpy as np data = [1, 2, 3, 4, 5] variance = np.var(data, ddof=1) print(variance)
Attempts:
2 left
🚀 Application
expert2:00remaining
Determine the number of clusters from a silhouette score plot
You run k-means clustering on a dataset with k values from 2 to 6. The silhouette scores are: {2: 0.45, 3: 0.52, 4: 0.48, 5: 0.40, 6: 0.35}. Which k should you choose for best cluster separation?
Attempts:
2 left