TensorFlowml~20 mins

Input shape specification in TensorFlow - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Input shape specification

Problem:You are building a neural network to classify images of size 28x28 pixels in grayscale. The current model does not specify the input shape correctly, causing errors or poor training.

Current Metrics:Model training fails or accuracy is very low due to incorrect input shape.

Issue:The input shape is not properly defined in the first layer of the model, leading to shape mismatch errors or inability to train.

Your Task

Correctly specify the input shape in the model so it matches the data shape (28x28 grayscale images). The model should train successfully and achieve at least 80% accuracy on the test set.

Do not change the dataset or model architecture except for input shape.

Use TensorFlow and Keras only.

Hint 1

Hint 2

Hint 3

Solution

TensorFlow

import tensorflow as tf
from tensorflow.keras import layers, models

# Load MNIST dataset
(train_images, train_labels), (test_images, test_labels) = tf.keras.datasets.mnist.load_data()

# Reshape data to add channel dimension (grayscale = 1 channel)
train_images = train_images.reshape((-1, 28, 28, 1)).astype('float32') / 255.0
test_images = test_images.reshape((-1, 28, 28, 1)).astype('float32') / 255.0

# Build model with correct input shape
model = models.Sequential([
    layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)),
    layers.MaxPooling2D((2, 2)),
    layers.Flatten(),
    layers.Dense(64, activation='relu'),
    layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

# Train model
model.fit(train_images, train_labels, epochs=5, batch_size=64, validation_split=0.2)

# Evaluate model
test_loss, test_acc = model.evaluate(test_images, test_labels)
print(f'Test accuracy: {test_acc:.4f}')

Added input_shape=(28, 28, 1) to the first Conv2D layer to match grayscale image shape.

Reshaped training and test images to have shape (num_samples, 28, 28, 1).

Normalized pixel values to range 0-1 by dividing by 255.

Results Interpretation

Before: Model training failed or accuracy was very low due to input shape mismatch.

After: Model trains successfully and achieves about 85% accuracy on test data.

Specifying the correct input shape is essential for the model to understand the data format and train properly. For image data, the input shape must include height, width, and channels.

Bonus Experiment

Try changing the input shape to (28, 28) without the channel dimension and observe what error or behavior occurs.

💡 Hint

TensorFlow Conv2D layers expect 3D input (height, width, channels). Omitting channels causes shape mismatch errors.

Practice

(1/5)

1. What does the input_shape parameter specify in a TensorFlow Keras model?

easy

A. The size and format of the input data the model expects

B. The number of layers in the model

C. The learning rate for training

D. The number of output classes

Input shape specification in TensorFlow - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of input_shape

Step 2: Differentiate from other parameters

Final Answer:

Quick Check:

Solution

Step 1: Identify the correct shape for grayscale images

Step 2: Check each option

Final Answer:

Quick Check:

Solution

Step 1: Understand Conv2D output shape calculation

Step 2: Calculate output dimensions

Final Answer:

Quick Check:

Solution

Step 1: Check the syntax of shape argument

Step 2: Verify other options

Final Answer:

Quick Check:

Solution

Step 1: Understand variable-length sequences

Step 2: Identify feature dimension position

Step 3: Match shape to (sequence_length, features)

Final Answer:

Quick Check: