Complete the code to define the input shape for the YOLO model.
input_shape = ([1], [2], 3)
The YOLO model commonly uses 416x416 pixel input images with 3 color channels (RGB).
Complete the code to create a convolutional layer in YOLO with 32 filters.
conv_layer = Conv2D([1], (3, 3), padding='same', activation='leaky_relu')
YOLO starts with a convolutional layer of 32 filters to extract features.
Fix the error in the YOLO output layer code to predict bounding boxes.
output = Conv2D([1], (1, 1), activation='linear')(previous_layer)
The output layer predicts 255 values per grid cell for 3 bounding boxes: 3 × (4 box coords + 1 objectness + 80 class probabilities) = 255.
Fill both blanks to complete the YOLO loss function components.
loss = [1] + [2]
YOLO loss combines localization loss (for bounding box accuracy) and classification loss (for object class prediction).
Fill all three blanks to complete the YOLO detection pipeline steps.
detections = [1](image) filtered = [2](detections, threshold=0.5) final_boxes = [3](filtered)
YOLO pipeline: predict boxes, apply non-max suppression to remove overlaps, then scale boxes to original image size.