Practice

(1/5)

1. What activation function is commonly used in the output layer of a binary classification model in TensorFlow?

easy

A. Tanh

B. ReLU

C. Softmax

D. Sigmoid

Solution

Step 1: Understand output layer role in binary classification
The output layer must produce a probability between 0 and 1 to represent two classes.
Step 2: Identify suitable activation function
Sigmoid activation compresses output to range [0, 1], perfect for binary decisions.
Final Answer:
Sigmoid -> Option D
Quick Check:
Binary output needs sigmoid = Sigmoid [OK]

Hint: Binary output needs sigmoid activation [OK]

Common Mistakes:

Using softmax for binary output
Using ReLU which outputs unbounded values
Using tanh which outputs between -1 and 1

2. Which of the following is the correct way to compile a binary classification model in TensorFlow?

easy

A. model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

B. model.compile(optimizer='rmsprop', loss='hinge', metrics=['accuracy'])

C. model.compile(optimizer='sgd', loss='mean_squared_error', metrics=['accuracy'])

D. model.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

Solution

Step 1: Identify appropriate loss for binary classification
Binary classification requires 'binary_crossentropy' loss to measure error correctly.
Step 2: Check optimizer and metrics
'adam' optimizer and 'accuracy' metric are standard choices for training and evaluation.
Final Answer:
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy']) -> Option A
Quick Check:
Binary loss = binary_crossentropy [OK]

Hint: Use binary_crossentropy loss for binary classification [OK]

Common Mistakes:

Using categorical_crossentropy for binary tasks
Using mean_squared_error which is for regression
Choosing hinge loss which is for SVMs

3. Given the following TensorFlow model code, what will be the shape of the output layer?

model = tf.keras.Sequential([
  tf.keras.layers.Dense(10, activation='relu', input_shape=(5,)),
  tf.keras.layers.Dense(1, activation='sigmoid')
])

medium

A. (None, 1)

B. (None, 10)

C. (5, 1)

D. (1,)

Solution

Step 1: Analyze the last layer configuration
The last Dense layer has 1 unit and sigmoid activation, so output shape is (batch_size, 1).
Step 2: Understand batch dimension placeholder
TensorFlow uses None for batch size, so output shape is (None, 1).
Final Answer:
(None, 1) -> Option A
Quick Check:
Output units = 1 means shape = (None, 1) [OK]

Hint: Output shape matches last layer units with batch size None [OK]

Common Mistakes:

Confusing input shape with output shape
Ignoring batch size dimension
Assuming output shape is (1,) without batch

4. You trained a binary classification model but the accuracy stays around 50% after many epochs. Which fix is most likely to improve the model?

medium

A. Change the output activation to softmax

B. Use binary_crossentropy loss instead of categorical_crossentropy

C. Increase the batch size to 1024

D. Remove the activation function from the output layer

Solution

Step 1: Identify the cause of poor accuracy
Using categorical_crossentropy loss with a single sigmoid output causes wrong loss calculation.
Step 2: Apply correct loss function
Switching to binary_crossentropy aligns loss with sigmoid output for binary classification.
Final Answer:
Use binary_crossentropy loss instead of categorical_crossentropy -> Option B
Quick Check:
Loss must match output activation [OK]

Hint: Match loss to output activation for correct training [OK]

Common Mistakes:

Using softmax for binary output
Removing output activation causing invalid probabilities
Assuming batch size alone fixes accuracy

5. You want to build a binary classification model to predict if an email is spam or not. Your dataset has 1000 samples with 20 features each. Which model architecture and compile settings are best?

hard

A. Sequential model with one Dense layer (1 unit, sigmoid), compile with binary_crossentropy and adam

B. Sequential model with one Dense layer (20 units, softmax), compile with categorical_crossentropy and sgd

C. Sequential model with two Dense layers (10 units relu, then 1 unit sigmoid), compile with binary_crossentropy and adam

D. Sequential model with three Dense layers (64 relu, 32 relu, 1 tanh), compile with mean_squared_error and rmsprop

Solution

Step 1: Choose model complexity for dataset size
Two layers with relu then sigmoid balance learning capacity and binary output.
Step 2: Select correct loss and optimizer
Binary_crossentropy fits binary tasks; adam optimizer adapts well for small datasets.
Final Answer:
Sequential model with two Dense layers (10 units relu, then 1 unit sigmoid), compile with binary_crossentropy and adam -> Option C
Quick Check:
Two layers + sigmoid + binary_crossentropy = Best practice [OK]

Hint: Use relu hidden layers + sigmoid output + binary_crossentropy [OK]

Common Mistakes:

Using softmax for binary classification
Using tanh output activation
Using mean_squared_error loss for classification

Epoch	Loss ↓	Accuracy ↑	Observation
1	0.65	0.60	Model starts learning, accuracy low
2	0.50	0.72	Loss decreases, accuracy improves
3	0.40	0.80	Model learns important patterns
4	0.32	0.85	Good improvement, model converging
5	0.28	0.88	Loss low, accuracy high, training stabilizes

Binary classification model in TensorFlow - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand output layer role in binary classification

Step 2: Identify suitable activation function

Final Answer:

Quick Check:

Solution

Step 1: Identify appropriate loss for binary classification

Step 2: Check optimizer and metrics

Final Answer:

Quick Check:

Solution

Step 1: Analyze the last layer configuration

Step 2: Understand batch dimension placeholder

Final Answer:

Quick Check:

Solution

Step 1: Identify the cause of poor accuracy

Step 2: Apply correct loss function

Final Answer:

Quick Check:

Solution

Step 1: Choose model complexity for dataset size

Step 2: Select correct loss and optimizer

Final Answer:

Quick Check: