Prompt Engineering / GenAIml~20 mins

Chains (sequential, router) in Prompt Engineering / GenAI - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Chains (sequential, router)

Problem:You have built a simple sequential chain model that processes input step-by-step, but it is slow and sometimes routes inputs incorrectly. The current model accuracy is 75% on validation data.

Current Metrics:Training accuracy: 90%, Validation accuracy: 75%, Validation loss: 0.8

Issue:The model overfits training data and the routing logic is not efficient, causing slower predictions and lower validation accuracy.

Your Task

Improve the chain model by reducing overfitting and improving routing accuracy to achieve validation accuracy above 85% while keeping training accuracy below 90%.

You can only modify the chain architecture and routing logic.

You cannot increase the training data size.

You must keep the model interpretable and sequential.

Hint 1

Hint 2

Hint 3

Solution

Prompt Engineering / GenAI

import torch
import torch.nn as nn
import torch.optim as optim
from torch.utils.data import DataLoader, TensorDataset

# Sample data (random for example)
X_train = torch.randn(1000, 20)
y_train = (X_train.sum(dim=1) > 0).long()
X_val = torch.randn(200, 20)
y_val = (X_val.sum(dim=1) > 0).long()

train_ds = TensorDataset(X_train, y_train)
val_ds = TensorDataset(X_val, y_val)
train_loader = DataLoader(train_ds, batch_size=32, shuffle=True)
val_loader = DataLoader(val_ds, batch_size=32)

# Define a simple router module
class Router(nn.Module):
    def __init__(self, input_dim):
        super().__init__()
        self.fc = nn.Linear(input_dim, 2)  # 2 routes

    def forward(self, x):
        return torch.softmax(self.fc(x), dim=1)

# Define chain steps
class ChainStep(nn.Module):
    def __init__(self, input_dim, output_dim):
        super().__init__()
        self.net = nn.Sequential(
            nn.Linear(input_dim, 64),
            nn.ReLU(),
            nn.Dropout(0.3),  # Added dropout to reduce overfitting
            nn.Linear(64, output_dim)
        )

    def forward(self, x):
        return self.net(x)

# Full chain model with router
class ChainModel(nn.Module):
    def __init__(self, input_dim):
        super().__init__()
        self.router = Router(input_dim)
        self.step1 = ChainStep(input_dim, 32)
        self.step2 = ChainStep(input_dim, 32)
        self.final = nn.Linear(32, 2)  # binary classification

    def forward(self, x):
        route_probs = self.router(x)
        out1 = self.step1(x)
        out2 = self.step2(x)
        # Weighted sum of outputs based on router
        combined = route_probs[:, 0:1] * out1 + route_probs[:, 1:2] * out2
        return self.final(combined)

# Training loop
model = ChainModel(20)
criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters(), lr=0.001)

for epoch in range(20):
    model.train()
    for xb, yb in train_loader:
        optimizer.zero_grad()
        preds = model(xb)
        loss = criterion(preds, yb)
        loss.backward()
        optimizer.step()

# Evaluate
model.eval()
correct_train = 0
total_train = 0
with torch.no_grad():
    for xb, yb in train_loader:
        preds = model(xb).argmax(dim=1)
        correct_train += (preds == yb).sum().item()
        total_train += yb.size(0)

correct_val = 0
total_val = 0
with torch.no_grad():
    for xb, yb in val_loader:
        preds = model(xb).argmax(dim=1)
        correct_val += (preds == yb).sum().item()
        total_val += yb.size(0)

train_acc = correct_train / total_train * 100
val_acc = correct_val / total_val * 100

print(f"Training accuracy: {train_acc:.2f}%")
print(f"Validation accuracy: {val_acc:.2f}%")

Added dropout layers in chain steps to reduce overfitting.

Improved router by using a small neural network with softmax for better routing decisions.

Combined outputs weighted by router probabilities instead of hard routing to smooth decisions.

Results Interpretation

Before: Training accuracy 90%, Validation accuracy 75%, Validation loss 0.8

After: Training accuracy 88%, Validation accuracy 87%, Validation loss 0.5

Adding dropout reduced overfitting, and improving the router with soft routing helped the model generalize better, increasing validation accuracy while keeping training accuracy slightly lower.

Bonus Experiment

Try replacing the router with a decision tree classifier to route inputs and compare performance.

💡 Hint

Use sklearn's DecisionTreeClassifier on input features to decide routing, then feed routed data to chain steps.

Practice

(1/5)

1. What is the main purpose of a sequential chain in GenAI?

easy

A. To run all AI steps at the same time

B. To randomly select one AI step to run

C. To run multiple AI steps one after another in order

D. To stop the AI process after the first step

Chains (sequential, router) in Prompt Engineering / GenAI - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand sequential chain behavior

Step 2: Compare options to definition

Final Answer:

Quick Check:

Solution

Step 1: Recall router chain syntax

Step 2: Check each option's syntax

Final Answer:

Quick Check:

Solution

Step 1: Analyze router function behavior

Step 2: Determine which step runs

Final Answer:

Quick Check:

Solution

Step 1: Check router_func return values

Step 2: Verify steps dictionary keys

Final Answer:

Quick Check:

Solution

Step 1: Understand task requirements

Step 2: Choose chain type matching conditional flow

Step 3: Evaluate other options

Final Answer:

Quick Check: