Challenge - 5 Problems

🎖️

Landmark Detection Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

1:30remaining

Understanding landmark detection outputs

When a hand landmark detection model processes an image, what does its output typically represent?

ACoordinates of key points on the hand such as fingertips and joints

BA binary mask highlighting the hand region

CA classification label indicating hand gestures

DThe raw pixel values of the hand area

Attempts:

2 left

❓ Predict Output

intermediate

1:30remaining

Output shape of face landmark detection model

Given a face landmark detection model that detects 468 points per face, what is the shape of the output tensor for a batch of 5 images?

Computer Vision

batch_size = 5
num_landmarks = 468
output_shape = (batch_size, num_landmarks, 3)  # x, y, z coordinates

A(5, 3, 468)

B(5, 468, 3)

C(3, 468, 5)

D(468, 5, 3)

Attempts:

2 left

❓ Model Choice

advanced

2:00remaining

Choosing a model for real-time hand landmark detection

You want to build a mobile app that detects hand landmarks in real-time video. Which model architecture is best suited?

AA deep transformer model with self-attention layers

BA large ResNet-based model with many layers for high accuracy

CA lightweight MobileNet-based model optimized for speed

DA simple linear regression model

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

Evaluating landmark detection accuracy

Which metric best measures the accuracy of predicted hand landmarks compared to ground truth points?

APrecision and recall of hand detection bounding boxes

BClassification accuracy of hand gesture labels

CConfusion matrix of detected vs missed landmarks

DMean Squared Error (MSE) between predicted and true landmark coordinates

Attempts:

2 left

🔧 Debug

expert

2:30remaining

Debugging inconsistent landmark predictions

You notice that your face landmark detection model sometimes predicts landmarks outside the face region. What is the most likely cause?

AThe input images are not normalized or preprocessed consistently

BThe model architecture is too deep

CThe optimizer learning rate is too low

DThe batch size during training was too large

Attempts:

2 left

Practice

(1/5)

1. What is the main purpose of hand and face landmark detection in computer vision?

easy

A. To compress video files

B. To increase image resolution

C. To change the color of images

D. To find key points on hands and faces in images or videos

Hand and face landmark detection in Computer Vision - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand the goal of landmark detection

Step 2: Compare options with the goal

Final Answer:

Quick Check:

Solution

Step 1: Recall MediaPipe import syntax

Step 2: Check each option

Final Answer:

Quick Check:

Solution

Step 1: Understand the code flow

Step 2: Interpret the output

Final Answer:

Quick Check:

Solution

Step 1: Check input image format for MediaPipe FaceMesh

Step 2: Understand error cause

Final Answer:

Quick Check:

Solution

Step 1: Understand challenges in gesture recognition

Step 2: Choose best method to improve robustness

Final Answer:

Quick Check: