0
0
Computer Visionml~20 mins

Depth estimation basics in Computer Vision - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Depth Estimation Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
1:00remaining
What does depth estimation predict?
In simple terms, what is the main output of a depth estimation model when given a single image?
AA map showing the distance of each pixel from the camera
BA classification label for the type of scene in the image
CA color-enhanced version of the original image
DA set of bounding boxes around detected objects
Attempts:
2 left
💡 Hint
Think about what 'depth' means in a photo.
Predict Output
intermediate
1:30remaining
Output shape of depth map from CNN
Given a convolutional neural network that takes a 256x256 RGB image and outputs a depth map, what is the most likely shape of the output tensor?
Computer Vision
import torch
import torch.nn as nn

class SimpleDepthNet(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv = nn.Conv2d(3, 1, kernel_size=3, padding=1)
    def forward(self, x):
        return self.conv(x)

model = SimpleDepthNet()
input_tensor = torch.randn(1, 3, 256, 256)
output = model(input_tensor)
output.shape
Atorch.Size([1, 3, 256, 256])
Btorch.Size([1, 1, 254, 254])
Ctorch.Size([1, 1, 256, 256])
Dtorch.Size([256, 256])
Attempts:
2 left
💡 Hint
Check the padding and output channels of the convolution.
Model Choice
advanced
1:30remaining
Best model type for monocular depth estimation
Which model architecture is most suitable for estimating depth from a single RGB image?
AA simple fully connected network without convolution
BA convolutional encoder-decoder network with skip connections
CA recurrent neural network designed for text sequences
DA clustering algorithm like K-means
Attempts:
2 left
💡 Hint
Think about preserving spatial details and image features.
Metrics
advanced
1:30remaining
Choosing the right metric for depth estimation
Which metric is commonly used to measure the accuracy of predicted depth maps compared to ground truth?
AMean Absolute Error (MAE) between predicted and true depth values
BIntersection over Union (IoU) for object detection
CAccuracy of classification labels
DBLEU score for language translation
Attempts:
2 left
💡 Hint
The metric should measure how close predicted distances are to actual distances.
🔧 Debug
expert
2:00remaining
Why does the depth model output all zeros?
You trained a depth estimation model but the output depth map is all zeros for every input image. Which is the most likely cause?
AThe optimizer learning rate is too high
BThe input images are grayscale instead of RGB
CThe batch size is set to 1 during training
DThe final activation function is ReLU and the model weights are initialized to zero
Attempts:
2 left
💡 Hint
Consider how ReLU and zero weights affect output values.