Challenge - 5 Problems

🎖️

Regularization Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

Why does L2 regularization reduce overfitting?

Which of the following best explains why L2 regularization helps prevent overfitting in a neural network?

AIt adds noise to the input data, making the model more robust.

BIt penalizes large weights, encouraging simpler models that generalize better.

CIt increases the number of neurons to capture more complex patterns.

DIt stops training early to avoid memorizing the training data.

Attempts:

2 left

❓ Predict Output

intermediate

2:00remaining

Effect of dropout on training accuracy

Consider the following TensorFlow code snippet that trains a simple model with and without dropout. What will be the expected difference in training accuracy after 10 epochs?

TensorFlow

import tensorflow as tf
from tensorflow.keras import layers, models

# Model without dropout
model_no_dropout = models.Sequential([
    layers.Dense(64, activation='relu', input_shape=(20,)),
    layers.Dense(1, activation='sigmoid')
])
model_no_dropout.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

# Model with dropout
model_dropout = models.Sequential([
    layers.Dense(64, activation='relu', input_shape=(20,)),
    layers.Dropout(0.5),
    layers.Dense(1, activation='sigmoid')
])
model_dropout.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

# Assume training data X_train, y_train are available
# Both models trained for 10 epochs
# What is the expected difference in training accuracy between the two models?

AThe model with dropout will have higher training accuracy than the one without dropout.

BBoth models will have the same training accuracy after 10 epochs.

CThe model without dropout will have higher training accuracy than the one with dropout.

DThe model with dropout will fail to train and have very low accuracy.

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

Interpreting validation loss with and without regularization

You train two neural networks on the same dataset: one with L2 regularization and one without. After training, you observe the following validation losses:

Model with L2 regularization: 0.35
Model without regularization: 0.60

What does this difference in validation loss indicate?

AThe model with L2 regularization generalizes better and is less overfitted.

BThe model with L2 regularization is overfitting the training data.

CThe model without regularization generalizes better to new data.

DBoth models have the same generalization ability despite different losses.

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Identifying the cause of overfitting despite using dropout

Given the following TensorFlow model code, the model still overfits the training data. What is the most likely reason?

TensorFlow

import tensorflow as tf
from tensorflow.keras import layers, models

model = models.Sequential([
    layers.Dense(128, activation='relu', input_shape=(30,)),
    layers.Dropout(0.2),
    layers.Dense(128, activation='relu'),
    layers.Dense(1, activation='sigmoid')
])
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

# Model trained on small dataset for 50 epochs

# Overfitting observed: training accuracy much higher than validation accuracy

AThe optimizer 'adam' is not suitable for this task.

BThe model has too few layers to learn the data well.

CThe activation function 'relu' causes overfitting.

DDropout rate is too low to effectively prevent overfitting.

Attempts:

2 left

❓ Model Choice

expert

3:00remaining

Choosing the best regularization method for a complex image classification task

You are training a deep convolutional neural network on a large image dataset. The model overfits despite using L2 regularization. Which additional regularization technique is most appropriate to try next?

AAdd dropout layers between convolutional layers.

BIncrease the learning rate to speed up training.

CRemove batch normalization layers to simplify the model.

DUse a smaller batch size to reduce memory usage.

Attempts:

2 left