Experiment - Zeroing gradients
Problem:You are training a simple neural network on a small dataset. The model's loss is not decreasing as expected during training.
Current Metrics:Epoch 1: loss=0.85, Epoch 2: loss=0.83, Epoch 3: loss=0.82 (minimal improvement)
Issue:The gradients are not being zeroed before each backward pass, causing gradient accumulation and incorrect weight updates.