Practice

(1/5)

What is the main goal of image-to-image transformation in AI?

easy

A. To change an input image into a different output image automatically

B. To classify images into categories

C. To detect objects inside an image

D. To generate text from an image

Solution

Step 1: Understand the purpose of image-to-image transformation
This technique changes one image into another, like coloring or style transfer.
Step 2: Compare with other image tasks
Classification, detection, and text generation are different tasks, not image transformation.
Final Answer:
To change an input image into a different output image automatically -> Option A
Quick Check:
Image-to-image transformation = change image [OK]

Hint: Image-to-image means input image changes to output image [OK]

Common Mistakes:

Confusing transformation with classification
Thinking it detects objects instead of changing images
Mixing it up with text generation from images

Which of the following is the correct way to describe an image-to-image model's input and output?

Input: ?
Output: ?

easy

A. Input: Image, Output: Image

B. Input: Text, Output: Image

C. Input: Image, Output: Text

D. Input: Number, Output: Image

Solution

Step 1: Identify input type for image-to-image models
These models take an image as input to transform it.
Step 2: Identify output type for image-to-image models
The output is also an image, changed in style, color, or content.
Final Answer:
Input: Image, Output: Image -> Option A
Quick Check:
Input and output both images [OK]

Hint: Both input and output are images in image-to-image tasks [OK]

Common Mistakes:

Confusing input as text or numbers
Thinking output is text instead of image
Mixing input/output types

Consider this simplified Python code using a model for image-to-image transformation:

input_image = load_image('sketch.png')
output_image = model.transform(input_image)
save_image(output_image, 'colorized.png')
print(type(output_image))

What will be printed?

medium

A. <class 'str'>

B. <class 'numpy.ndarray'>

C. <class 'PIL.Image.Image'>

D. Error: model.transform is not defined

Solution

Step 1: Understand typical output type of image-to-image models
Most models output images as numpy arrays representing pixel data.
Step 2: Check code for output type
Since model.transform returns an image, it is usually a numpy.ndarray, not a PIL Image or string.
Final Answer:
<class 'numpy.ndarray'> -> Option B
Quick Check:
Model output image = numpy array [OK]

Hint: Model outputs image arrays, not strings or PIL objects [OK]

Common Mistakes:

Assuming output is a string filename
Confusing PIL Image with numpy array
Expecting error without context

Look at this code snippet for image-to-image transformation:

def transform_image(model, img_path):
    img = load_image(img_path)
    result = model.transform(img)
    return result

output = transform_image(my_model, 12345)
print(type(output))

What is the main error here?

medium

A. The function returns None instead of an image

B. The model.transform method does not exist

C. The image path should be a string, not a number

D. The print statement is missing parentheses

Solution

Step 1: Check the argument passed to load_image
load_image expects a file path string, but 12345 is a number, causing an error.
Step 2: Verify other code parts
model.transform and print syntax are correct; function returns result properly.
Final Answer:
The image path should be a string, not a number -> Option C
Quick Check:
Image path must be string [OK]

Hint: File paths must be strings, not numbers [OK]

Common Mistakes:

Thinking model.transform is missing
Ignoring argument type for image path
Confusing print syntax in Python 3

You want to build an image-to-image model that converts black-and-white sketches into colored images. Which approach is best?

A dataset has pairs of sketches and their colored versions.

hard

A. Train a text-to-image model with sketch descriptions

B. Use unsupervised clustering on sketches only

C. Apply image classification on sketches

D. Train a supervised model using paired sketch and color images

Solution

Step 1: Identify the task type
Converting sketches to colored images is a paired image-to-image translation task.
Step 2: Choose the right training method
Supervised learning with paired data (sketch and color image) is best to learn direct mapping.
Step 3: Evaluate other options
Unsupervised clustering, text-to-image, and classification do not fit this paired transformation task.
Final Answer:
Train a supervised model using paired sketch and color images -> Option D
Quick Check:
Paired data needs supervised training [OK]

Hint: Use paired images for supervised training in image-to-image tasks [OK]

Common Mistakes:

Choosing unsupervised methods without paired data
Confusing text-to-image with image-to-image
Using classification instead of transformation

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Initial training, loss high, accuracy low
2	0.9	0.6	Model starts learning image features
3	0.7	0.72	Better style transfer, loss decreasing
4	0.5	0.8	Model improving, clearer output images
5	0.35	0.87	Good style transfer, loss low, accuracy high

Image-to-image transformation in Prompt Engineering / GenAI - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of image-to-image transformation

Step 2: Compare with other image tasks

Final Answer:

Quick Check:

Solution

Step 1: Identify input type for image-to-image models

Step 2: Identify output type for image-to-image models

Final Answer:

Quick Check:

Solution

Step 1: Understand typical output type of image-to-image models

Step 2: Check code for output type

Final Answer:

Quick Check:

Solution

Step 1: Check the argument passed to load_image

Step 2: Verify other code parts

Final Answer:

Quick Check:

Solution

Step 1: Identify the task type

Step 2: Choose the right training method

Step 3: Evaluate other options

Final Answer:

Quick Check: