Practice

(1/5)

What activation function is commonly used in the last layer of a multi-class classification model in TensorFlow?

easy

A. Sigmoid

B. ReLU

C. Softmax

D. Tanh

Solution

Step 1: Understand the purpose of the last layer in multi-class classification
The last layer outputs probabilities for each class, so the activation must convert raw scores to probabilities.
Step 2: Identify the activation function that outputs probabilities summing to 1
Softmax converts logits into probabilities that sum to 1, suitable for multi-class classification.
Final Answer:
Softmax -> Option C
Quick Check:
Softmax = last layer activation [OK]

Hint: Use softmax for multi-class output probabilities [OK]

Common Mistakes:

Using sigmoid which is for binary classification
Using ReLU which does not output probabilities
Using tanh which outputs values between -1 and 1

Which loss function should you use in TensorFlow for a multi-class classification model with integer labels?

easy

A. binary_crossentropy

B. sparse_categorical_crossentropy

C. mean_squared_error

D. hinge

Solution

Step 1: Identify the label format
Labels are integer class IDs, not one-hot encoded vectors.
Step 2: Choose loss function matching integer labels for multi-class
Sparse categorical crossentropy works with integer labels directly, unlike categorical crossentropy which needs one-hot labels.
Final Answer:
sparse_categorical_crossentropy -> Option B
Quick Check:
Integer labels = sparse_categorical_crossentropy [OK]

Hint: Use sparse_categorical_crossentropy for integer class labels [OK]

Common Mistakes:

Using binary_crossentropy which is for two classes
Using mean_squared_error which is for regression
Using hinge loss which is for SVMs

What will be the shape of the output tensor from the last layer of this TensorFlow model for multi-class classification with 4 classes?

model = tf.keras.Sequential([
  tf.keras.layers.Dense(10, activation='relu'),
  tf.keras.layers.Dense(4, activation='softmax')
])
inputs = tf.random.uniform((5, 8))
outputs = model(inputs)
print(outputs.shape)

medium

A. (4, 5)

B. (8, 4)

C. (5, 10)

D. (5, 4)

Solution

Step 1: Understand input and output shapes
Input batch size is 5, each input has 8 features. The last Dense layer outputs 4 units (classes).
Step 2: Determine output shape from last layer
Output shape is (batch_size, number_of_classes) = (5, 4).
Final Answer:
(5, 4) -> Option D
Quick Check:
Batch size 5, classes 4 = (5, 4) [OK]

Hint: Output shape = (batch_size, number_of_classes) [OK]

Common Mistakes:

Confusing batch size and feature dimensions
Swapping rows and columns in output shape
Assuming output shape matches input feature size

Identify the error in this TensorFlow multi-class classification model code:

model = tf.keras.Sequential([
  tf.keras.layers.Dense(16, activation='relu'),
  tf.keras.layers.Dense(3, activation='sigmoid')
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

medium

A. Last layer activation should be softmax, not sigmoid

B. Loss function should be binary_crossentropy

C. Optimizer 'adam' is invalid

D. Dense layer units must be 1 for multi-class

Solution

Step 1: Check last layer activation for multi-class
Sigmoid outputs independent probabilities, not suitable for multi-class where classes are exclusive.
Step 2: Correct activation for multi-class classification
Softmax outputs probabilities summing to 1, appropriate for multi-class classification.
Final Answer:
Last layer activation should be softmax, not sigmoid -> Option A
Quick Check:
Multi-class needs softmax activation [OK]

Hint: Use softmax activation for multi-class last layer [OK]

Common Mistakes:

Using sigmoid activation for multi-class output
Confusing loss functions for classification types
Thinking optimizer name 'adam' is invalid

You want to build a multi-class classification model with 5 classes. Your labels are integers from 0 to 4. Which of the following code snippets correctly defines and compiles the model?

Option A:
model = tf.keras.Sequential([
  tf.keras.layers.Dense(32, activation='relu'),
  tf.keras.layers.Dense(5, activation='softmax')
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

Option B:
model = tf.keras.Sequential([
  tf.keras.layers.Dense(32, activation='relu'),
  tf.keras.layers.Dense(5, activation='sigmoid')
])
model.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])

Option C:
model = tf.keras.Sequential([
  tf.keras.layers.Dense(32, activation='relu'),
  tf.keras.layers.Dense(1, activation='softmax')
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

Option D:
model = tf.keras.Sequential([
  tf.keras.layers.Dense(32, activation='relu'),
  tf.keras.layers.Dense(5, activation='softmax')
])
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

hard

A. Option A

B. Option B

C. Option C

D. Option D

Solution

Step 1: Check output layer units and activation
For 5 classes, output units must be 5 with softmax activation to get class probabilities.
Step 2: Check loss function matches label format
Labels are integers, so sparse_categorical_crossentropy is correct loss.
Step 3: Verify optimizer and metrics
Adam optimizer and accuracy metric are appropriate choices.
Final Answer:
Option A -> Option A
Quick Check:
Correct output units, activation, and loss for integer labels [OK]

Hint: Match output units and loss to label format [OK]

Common Mistakes:

Using sigmoid activation for multi-class output
Using binary_crossentropy loss for multi-class
Setting output units to 1 instead of number of classes

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.10	0.45	Model starts learning, accuracy low
2	0.85	0.60	Loss decreases, accuracy improves
3	0.65	0.72	Model learns important patterns
4	0.50	0.80	Good progress, accuracy rising
5	0.40	0.85	Model converging well

Multi-class classification model in TensorFlow - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of the last layer in multi-class classification

Step 2: Identify the activation function that outputs probabilities summing to 1

Final Answer:

Quick Check:

Solution

Step 1: Identify the label format

Step 2: Choose loss function matching integer labels for multi-class

Final Answer:

Quick Check:

Solution

Step 1: Understand input and output shapes

Step 2: Determine output shape from last layer

Final Answer:

Quick Check:

Solution

Step 1: Check last layer activation for multi-class

Step 2: Correct activation for multi-class classification

Final Answer:

Quick Check:

Solution

Step 1: Check output layer units and activation

Step 2: Check loss function matches label format

Step 3: Verify optimizer and metrics

Final Answer:

Quick Check: