0
0
Computer Visionml~5 mins

Mask R-CNN overview in Computer Vision - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is Mask R-CNN?
Mask R-CNN is a computer vision model that detects objects in images and also creates a mask to show the exact shape of each object.
Click to reveal answer
beginner
What are the main tasks Mask R-CNN performs?
Mask R-CNN performs three main tasks: 1) Detect objects and their locations, 2) Classify each object, 3) Generate a pixel-level mask for each object.
Click to reveal answer
intermediate
How does Mask R-CNN extend Faster R-CNN?
Mask R-CNN adds a branch to Faster R-CNN that predicts masks for each detected object, enabling precise segmentation along with detection and classification.
Click to reveal answer
intermediate
What is the role of the Region Proposal Network (RPN) in Mask R-CNN?
The RPN suggests candidate regions in the image where objects might be, so the model can focus on these areas for detection and mask prediction.
Click to reveal answer
beginner
Why is mask prediction important in Mask R-CNN?
Mask prediction helps to find the exact shape of each object, not just a box around it, which is useful for tasks like image editing, medical imaging, and robotics.
Click to reveal answer
What additional output does Mask R-CNN produce compared to Faster R-CNN?
APixel-level masks for each object
BOnly bounding boxes
CImage captions
DEdge detection maps
Which part of Mask R-CNN suggests where objects might be in the image?
ARegion Proposal Network (RPN)
BMask branch
CFully connected layers
DConvolutional filters
Mask R-CNN is mainly used for which type of computer vision task?
AImage generation
BInstance segmentation
CImage classification
DOptical character recognition
What does the mask branch in Mask R-CNN output?
AA heatmap of edges
BA bounding box
CA binary mask for each detected object
DA class label
Why is Mask R-CNN preferred over just bounding box detection in some applications?
AIt requires no training data
BIt runs faster
CIt uses less memory
DIt provides precise object shapes, not just boxes
Explain how Mask R-CNN detects and segments objects in an image.
Think about the steps from finding objects to drawing their shapes.
You got /4 concepts.
    Describe the difference between Mask R-CNN and Faster R-CNN.
    Focus on what extra output Mask R-CNN provides.
    You got /4 concepts.