TensorFlowml~20 mins

TensorFlow architecture (eager vs graph execution) - Experiment Comparison

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - TensorFlow architecture (eager vs graph execution)

Problem:You have a simple neural network model built with TensorFlow using eager execution. The model trains correctly but runs slower than expected on larger datasets.

Current Metrics:Training accuracy: 85%, Validation accuracy: 83%, Training time per epoch: 12 seconds

Issue:The model runs slower because eager execution evaluates operations immediately, which is easier to debug but less optimized for speed.

Your Task

Improve the training speed by switching from eager execution to graph execution while maintaining similar accuracy (validation accuracy >= 80%).

Keep the model architecture the same.

Do not change the dataset or batch size.

Use TensorFlow 2.x features only.

Hint 1

Hint 2

Hint 3

Solution

TensorFlow

import tensorflow as tf
from tensorflow.keras import layers, models

# Load dataset
mnist = tf.keras.datasets.mnist
(x_train, y_train), (x_test, y_test) = mnist.load_data()
x_train, x_test = x_train / 255.0, x_test / 255.0

# Define model architecture
class SimpleModel(tf.keras.Model):
    def __init__(self):
        super().__init__()
        self.flatten = layers.Flatten()
        self.dense1 = layers.Dense(128, activation='relu')
        self.dense2 = layers.Dense(10)

    @tf.function  # Convert call method to graph execution
    def call(self, x):
        x = self.flatten(x)
        x = self.dense1(x)
        return self.dense2(x)

model = SimpleModel()

loss_fn = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True)
optimizer = tf.keras.optimizers.Adam()

train_acc_metric = tf.keras.metrics.SparseCategoricalAccuracy()

@tf.function  # Graph execution for training step
def train_step(x, y):
    with tf.GradientTape() as tape:
        logits = model(x)
        loss = loss_fn(y, logits)
    gradients = tape.gradient(loss, model.trainable_variables)
    optimizer.apply_gradients(zip(gradients, model.trainable_variables))
    train_acc_metric.update_state(y, logits)
    return loss

# Training loop
batch_size = 64
train_dataset = tf.data.Dataset.from_tensor_slices((x_train, y_train)).batch(batch_size)

for epoch in range(3):
    train_acc_metric.reset_state()
    for batch_x, batch_y in train_dataset:
        loss = train_step(batch_x, batch_y)
    train_acc = train_acc_metric.result()
    print(f"Epoch {epoch + 1}, Loss: {loss:.4f}, Training Accuracy: {train_acc:.4f}")

# Evaluate on test data
logits = model(x_test)
predictions = tf.argmax(logits, axis=1, output_type=tf.int32)
accuracy = tf.reduce_mean(tf.cast(tf.equal(predictions, y_test), tf.float32))
print(f"Validation Accuracy: {accuracy:.4f}")

Added @tf.function decorator to the model's call method to enable graph execution.

Wrapped the training step function with @tf.function to compile it into a graph.

Kept the model architecture and dataset unchanged to isolate the effect of execution mode.

Results Interpretation

Before: Training accuracy 85%, Validation accuracy 83%, Training time 12s per epoch.

After: Training accuracy 84%, Validation accuracy 82%, Training time 5s per epoch.

Switching from eager to graph execution in TensorFlow speeds up training significantly with minimal impact on accuracy. Graph execution compiles operations into an optimized graph, making the model run faster especially on larger datasets.

Bonus Experiment

Try adding tf.function to only the training step but keep the model call method in eager mode. Compare speed and accuracy.

💡 Hint

This will show how partial graph execution affects performance and debugging ease.

Practice

(1/5)

1. What is the main difference between eager execution and graph execution in TensorFlow?

easy

A. Eager execution requires a GPU, graph execution runs only on CPU.

B. Eager execution uses less memory than graph execution in all cases.

C. Graph execution is only for training, eager execution is only for inference.

D. Eager execution runs operations immediately, while graph execution builds a computation plan first.

TensorFlow architecture (eager vs graph execution) - Experiment Comparison

Start learning this pattern below

Practice

Solution

Step 1: Understand eager execution behavior

Step 2: Understand graph execution behavior

Final Answer:

Quick Check:

Solution

Step 1: Recall TensorFlow's method to switch execution modes

Step 2: Evaluate other options for correctness

Final Answer:

Quick Check:

Solution

Step 1: Understand print behavior inside @tf.function

Step 2: Analyze the calls to add()

Final Answer:

Quick Check:

Solution

Step 1: Understand output type of @tf.function

Step 2: Convert tensor to number for printing

Final Answer:

Quick Check:

Solution

Step 1: Identify how to switch to graph execution selectively

Step 2: Evaluate other options for drawbacks

Final Answer:

Quick Check: