0
0
Computer Visionml~12 mins

OpenPose overview in Computer Vision - Model Pipeline Trace

Choose your learning style9 modes available
Model Pipeline - OpenPose overview

OpenPose is a system that detects human body parts and their positions in images or videos. It finds key points like elbows, knees, and wrists to understand human poses.

Data Flow - 5 Stages
1Input Image
1 image x 368 height x 368 width x 3 channelsResize and normalize the input image1 image x 368 height x 368 width x 3 channels
A photo of a person standing in a room
2Feature Extraction
1 image x 368 x 368 x 3Pass image through CNN layers to extract features1 image x 46 x 46 x 128 feature maps
Feature maps highlighting edges and textures of the person
3Part Confidence Maps
1 image x 46 x 46 x 128Predict heatmaps showing likelihood of each body part at each location1 image x 46 x 46 x 18 (body parts)
Heatmap with bright spots where wrists and elbows likely are
4Part Affinity Fields
1 image x 46 x 46 x 128Predict vector fields showing connections between body parts1 image x 46 x 46 x 36 (connections)
Vector fields pointing from elbow to wrist
5Pose Assembly
Confidence maps and affinity fieldsCombine detected parts and connections to form full body posesList of poses with keypoint coordinates
Coordinates of detected wrists, elbows, knees for each person
Training Trace - Epoch by Epoch
Loss:
2.5 |*****
1.2 |****
0.7 |***
0.4 |**
0.3 |*

Epochs ->
EpochLoss ↓Accuracy ↑Observation
12.50.30Model starts learning basic body part locations
51.20.55Confidence maps and affinity fields improve
100.70.75Model detects body parts more accurately
150.40.85Pose assembly becomes reliable
200.30.90Model converges with good pose detection
Prediction Trace - 5 Layers
Layer 1: Input Image
Layer 2: Feature Extraction CNN
Layer 3: Part Confidence Maps Prediction
Layer 4: Part Affinity Fields Prediction
Layer 5: Pose Assembly
Model Quiz - 3 Questions
Test your understanding
What does the Part Confidence Maps stage output?
ARaw input images
BHeatmaps showing where body parts are likely located
CVector fields connecting body parts
DFinal pose coordinates
Key Insight
OpenPose uses a two-step approach: first detecting body parts with confidence maps, then connecting them with affinity fields. This helps it accurately find human poses even in complex images.