Bird
Raised Fist0
TensorFlowml~12 mins

First neural network in TensorFlow - Model Pipeline Trace

Choose your learning style10 modes available

Start learning this pattern below

Jump into concepts and practice - no test required

or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Model Pipeline - First neural network

This pipeline shows how a simple neural network learns to classify handwritten digits from images. It starts with raw image data, processes it, trains a small neural network, and then makes predictions.

Data Flow - 5 Stages
1Load Data
70000 rows x 28 x 28 pixelsLoad MNIST handwritten digits dataset70000 rows x 28 x 28 pixels
Image of digit '5' as 28x28 grayscale pixels
2Preprocessing
70000 rows x 28 x 28 pixelsNormalize pixel values from 0-255 to 0-170000 rows x 28 x 28 pixels
Pixel value 0 becomes 0.0, 255 becomes 1.0
3Flatten Images
70000 rows x 28 x 28 pixelsConvert 2D images to 1D vectors70000 rows x 784 columns
28x28 image becomes a list of 784 pixel values
4Train/Test Split
70000 rows x 784 columnsSplit data into training (60000) and test (10000) setsTraining: 60000 rows x 784 columns, Test: 10000 rows x 784 columns
Training set image vector and label for digit '3'
5Model Training
60000 rows x 784 columnsTrain neural network with one hidden layerTrained model with weights and biases
Model learns to map input vectors to digit labels
Training Trace - Epoch by Epoch
Loss
0.5 |****
0.4 |***
0.3 |**
0.2 |*
0.1 |
    +---------
     1 2 3 4 5 Epochs
EpochLoss ↓Accuracy ↑Observation
10.450.87Model starts learning, accuracy improves quickly
20.300.92Loss decreases, accuracy increases as model fits data
30.250.94Model continues to improve with more training
40.220.95Loss decreases steadily, accuracy nears 95%
50.200.96Training converges with high accuracy
Prediction Trace - 4 Layers
Layer 1: Input Layer
Layer 2: Hidden Layer (Dense + ReLU)
Layer 3: Output Layer (Dense + Softmax)
Layer 4: Prediction
Model Quiz - 3 Questions
Test your understanding
What does the 'Flatten Images' step do to the data?
AConverts 2D images into 1D vectors
BSplits data into training and test sets
CNormalizes pixel values between 0 and 1
DApplies activation functions to neurons
Key Insight
A simple neural network learns by adjusting weights to reduce error (loss) and improve accuracy. Flattening images and normalizing pixels prepare data for the model. Activation functions like ReLU and softmax help the network learn complex patterns and output probabilities.

Practice

(1/5)
1. What is the main purpose of the compile method in a TensorFlow neural network model?
easy
A. To set the optimizer, loss function, and metrics for training
B. To add layers to the model
C. To train the model on data
D. To make predictions on new data

Solution

  1. Step 1: Understand the role of compile

    The compile method prepares the model for training by specifying how it learns, including the optimizer, loss function, and metrics.
  2. Step 2: Differentiate from other methods

    Adding layers is done before compiling, training is done with fit, and predictions use predict.
  3. Final Answer:

    To set the optimizer, loss function, and metrics for training -> Option A
  4. Quick Check:

    compile sets training details = A [OK]
Hint: Compile sets how the model learns before training [OK]
Common Mistakes:
  • Confusing compile with fit (training)
  • Thinking compile adds layers
  • Mixing compile with prediction
2. Which of the following is the correct way to add a dense hidden layer with 10 neurons and ReLU activation in TensorFlow?
easy
A. model.add(tf.keras.Dense(10, activation='relu'))
B. model.add(Dense(activation='relu', 10))
C. model.add(tf.keras.layers.Dense(10, activation='relu'))
D. model.add(tf.layers.Dense(activation='relu', units=10))

Solution

  1. Step 1: Recall correct TensorFlow syntax for adding layers

    The correct way is to use tf.keras.layers.Dense with units first, then activation as a named argument.
  2. Step 2: Check each option

    model.add(tf.keras.layers.Dense(10, activation='relu')) matches the correct syntax. model.add(Dense(activation='relu', 10)) has wrong argument order. model.add(tf.layers.Dense(activation='relu', units=10)) uses deprecated tf.layers. model.add(tf.keras.Dense(10, activation='relu')) misses layers in the path.
  3. Final Answer:

    model.add(tf.keras.layers.Dense(10, activation='relu')) -> Option C
  4. Quick Check:

    Correct layer syntax = D [OK]
Hint: Use tf.keras.layers.Dense(units, activation='relu') [OK]
Common Mistakes:
  • Wrong argument order in Dense layer
  • Using deprecated tf.layers instead of tf.keras.layers
  • Missing 'layers' in the import path
3. What will be the output shape of the model after adding these layers?
model = tf.keras.Sequential()
model.add(tf.keras.layers.Dense(5, input_shape=(3,), activation='relu'))
model.add(tf.keras.layers.Dense(2, activation='softmax'))
print(model.output_shape)
medium
A. (None, 5)
B. (None, 2)
C. (None, 3)
D. (3, 2)

Solution

  1. Step 1: Understand input and output shapes

    The input shape is (3,), first layer outputs 5 units, second layer outputs 2 units.
  2. Step 2: Determine final output shape

    The model output shape is (None, 2) where None is batch size, 2 is output units.
  3. Final Answer:

    (None, 2) -> Option B
  4. Quick Check:

    Output units = 2 means shape (None, 2) [OK]
Hint: Output shape matches last layer units with batch size None [OK]
Common Mistakes:
  • Confusing input shape with output shape
  • Ignoring batch size dimension None
  • Mixing layer units and input dimensions
4. Identify the error in this code snippet for creating a simple neural network:
model = tf.keras.Sequential()
model.add(tf.keras.layers.Dense(10, activation='relu'))
model.compile(optimizer='adam', loss='mse')
model.summary()
model.fit(x_train, y_train, epochs=5)
medium
A. Optimizer 'adam' is not supported
B. Loss function 'mse' is invalid
C. fit method requires batch_size argument
D. Missing input shape in the first layer

Solution

  1. Step 1: Check layer definition

    The first Dense layer lacks an input shape, which is required for the model to know input dimensions.
  2. Step 2: Verify other parts

    Loss 'mse' and optimizer 'adam' are valid. Batch size is optional in fit.
  3. Final Answer:

    Missing input shape in the first layer -> Option D
  4. Quick Check:

    Input shape needed in first layer = C [OK]
Hint: Always specify input shape in first layer [OK]
Common Mistakes:
  • Skipping input_shape in first layer
  • Thinking batch_size is mandatory in fit
  • Confusing loss and optimizer names
5. You want to build a neural network to classify images into 3 categories. Which model setup is best?
model = tf.keras.Sequential([
  tf.keras.layers.Flatten(input_shape=(28,28)),
  tf.keras.layers.Dense(64, activation='relu'),
  tf.keras.layers.Dense(3, activation='softmax')
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])
hard
A. Correct setup for multi-class classification
B. Use sigmoid activation in last layer instead of softmax
C. Use mean squared error loss for classification
D. Missing Flatten layer before Dense layers

Solution

  1. Step 1: Analyze model layers

    Flatten converts 2D image to 1D, Dense with 64 units and ReLU is hidden layer, final Dense with 3 units and softmax outputs class probabilities.
  2. Step 2: Check compile settings

    Optimizer 'adam' is good, loss 'sparse_categorical_crossentropy' fits multi-class with integer labels, metrics include accuracy.
  3. Final Answer:

    Correct setup for multi-class classification -> Option A
  4. Quick Check:

    Softmax + sparse_categorical_crossentropy = B [OK]
Hint: Use softmax and sparse_categorical_crossentropy for multi-class [OK]
Common Mistakes:
  • Using sigmoid for multi-class output
  • Using MSE loss for classification
  • Skipping Flatten for image input