Which statement correctly describes the difference between semantic segmentation and instance segmentation?
Think about whether the method separates individual objects or just labels the type of object.
Semantic segmentation assigns a class label to every pixel but does not separate different objects of the same class. Instance segmentation does both: it labels pixels and separates individual objects.
Given an input image of size 256x256 pixels, what is the shape of the output mask for semantic segmentation and instance segmentation respectively?
Semantic segmentation outputs a class label per pixel; instance segmentation outputs masks per object.
Semantic segmentation outputs a mask with one channel per class, so shape is (height, width, num_classes). Instance segmentation outputs a mask per detected object, so shape is (num_instances, height, width).
You want to detect and separate multiple objects of the same class in images. Which model architecture is best suited for this task?
Look for a model that can detect objects and segment them individually.
Mask R-CNN performs instance segmentation by detecting objects and generating masks for each instance. U-Net and FCN are mainly for semantic segmentation. ResNet is for classification.
Which metric is commonly used to evaluate both semantic and instance segmentation performance?
Think about a metric that compares predicted and true pixel areas.
Mean Intersection over Union (mIoU) measures overlap between predicted and true masks and is widely used for both semantic and instance segmentation. Bounding box accuracy is for detection, BLEU is for language, MSE is for regression.
You trained an instance segmentation model but the output masks overlap heavily, merging different objects into one mask. What is the most likely cause?
Consider how overlapping predictions are filtered in instance segmentation.
Non-maximum suppression (NMS) removes overlapping detections. If its threshold is too high, masks that should be separate merge. Learning rate and image size affect training quality but not mask merging. Using semantic segmentation would not produce instance masks.