0
0
Prompt Engineering / GenAIml~20 mins

Benchmark datasets in Prompt Engineering / GenAI - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Benchmark Dataset Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
1:30remaining
Understanding the purpose of benchmark datasets

Why do machine learning researchers use benchmark datasets?

ATo train models on random data without labels
BTo compare different models fairly using the same data
CTo avoid testing models on real-world data
DTo increase the size of training data by duplicating samples
Attempts:
2 left
💡 Hint

Think about why having a common dataset helps researchers.

Predict Output
intermediate
1:30remaining
Output of loading a benchmark dataset

What is the output shape of the features when loading the Iris dataset using scikit-learn?

Prompt Engineering / GenAI
from sklearn.datasets import load_iris
iris = load_iris()
X = iris.data
print(X.shape)
A(150, 4)
B(3, 150)
C(150, 3)
D(4, 150)
Attempts:
2 left
💡 Hint

Check how many samples and features the Iris dataset has.

Model Choice
advanced
2:00remaining
Choosing a model for the MNIST dataset

You want to classify handwritten digits from the MNIST dataset. Which model is best suited for this task?

AConvolutional Neural Network (CNN)
BLinear Regression
CK-Means Clustering
DDecision Tree Regressor
Attempts:
2 left
💡 Hint

Consider the type of data and the task (image classification).

Metrics
advanced
1:30remaining
Evaluating model performance on benchmark datasets

Which metric is most appropriate to evaluate a classification model on the CIFAR-10 benchmark dataset?

AR-squared
BMean Squared Error
CAccuracy
DPerplexity
Attempts:
2 left
💡 Hint

Think about the type of task CIFAR-10 represents.

🔧 Debug
expert
2:00remaining
Identifying the error when loading a benchmark dataset

What error will this code raise when trying to load the Boston Housing dataset using scikit-learn?

Prompt Engineering / GenAI
from sklearn.datasets import load_boston
boston = load_boston()
ANo error, dataset loads successfully
BValueError: Dataset not found
CDeprecationWarning: load_boston is deprecated
DImportError: cannot import name 'load_boston'
Attempts:
2 left
💡 Hint

Check recent changes in scikit-learn about this dataset.