Image-to-image transformation means changing one image into another, like coloring a black-and-white photo or turning a sketch into a photo. To check how well this works, we use metrics that compare the output image to the target image.
Common metrics are:
- Mean Squared Error (MSE): Measures average squared difference between pixels. Lower is better.
- Peak Signal-to-Noise Ratio (PSNR): Shows how clear the output image is compared to noise. Higher is better.
- Structural Similarity Index (SSIM): Checks if the output image looks similar in structure and texture to the target. Values close to 1 mean very similar.
- Frechet Inception Distance (FID): Measures how close the output images are to real images in a learned feature space. Lower is better.
We pick metrics that match the goal: if we want pixel accuracy, MSE or PSNR help. If we want realistic or natural images, SSIM or FID are better.