Prompt Engineering / GenAIml~20 mins

Image understanding and description in Prompt Engineering / GenAI - Practice Problems & Coding Challenges

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Challenge - 5 Problems

🎖️

Image Captioning Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

1:30remaining

What is the primary role of an image captioning model?

Imagine you have a smart assistant that looks at pictures and tells you what it sees in simple sentences. What is the main job of this assistant?

ATo generate a short text description that explains the content of the image.

BTo classify the image into one of many categories without describing it.

CTo enhance the image quality by removing noise and improving colors.

DTo detect faces in the image and blur them for privacy.

Attempts:

2 left

❓ Predict Output

intermediate

1:30remaining

What is the output of this image captioning code snippet?

Given the following simplified code that uses a pre-trained image captioning model, what will be printed?

Prompt Engineering / GenAI

image = load_image('dog_park.jpg')
caption = model.generate_caption(image)
print(caption)

A"A group of dogs playing in a park."

BSyntaxError: missing parentheses in call to 'print'

C"dog_park.jpg"

DNone

Attempts:

2 left

❓ Model Choice

advanced

2:00remaining

Which model architecture is best suited for image captioning tasks?

You want to build a system that looks at images and writes sentences describing them. Which model type is most appropriate?

AA simple feedforward neural network with no sequence handling.

BA convolutional neural network (CNN) combined with a recurrent neural network (RNN).

CA support vector machine (SVM) classifier.

DA k-means clustering algorithm.

Attempts:

2 left

❓ Metrics

advanced

1:30remaining

Which metric is commonly used to evaluate image captioning quality?

After training an image captioning model, you want to measure how good its descriptions are compared to human-written captions. Which metric should you use?

AConfusion matrix of detected objects.

BMean Squared Error (MSE) between pixel values of images.

CBLEU score, which compares the overlap of words and phrases between generated and reference captions.

DAccuracy of classifying images into categories.

Attempts:

2 left

🔧 Debug

expert

2:30remaining

Why does this image captioning model produce repetitive captions?

Consider this simplified code snippet where the model generates captions but repeats the same word multiple times:

caption = model.generate_caption(image)
print(caption)
# Output: "dog dog dog dog dog"

What is the most likely cause?

AThe print statement is inside a loop printing the same word multiple times.

BThe input image is corrupted and cannot be processed.

CThe model was trained on only one image, so it memorizes that caption.

DThe model's beam search decoding is not implemented correctly, causing it to select the same word repeatedly.

Attempts:

2 left

Practice

(1/5)

What does image understanding mean in AI?

easy

A. Drawing a new picture from scratch

B. Writing a story about a picture

C. Changing the colors of a picture

D. Recognizing objects and details in a picture

Which of the following is the correct way to describe an image using AI?

"A cat sitting on a mat."

easy

A. A sentence describing what is in the image

B. A code to change image colors

C. A list of numbers representing pixels

D. A command to delete the image

Given this Python code snippet using a simple AI model for image description, what will be the output?

def describe_image(image):
    if 'dog' in image:
        return 'A dog playing in the park.'
    else:
        return 'Unknown image.'

result = describe_image('photo of a dog')
print(result)

medium

A. A dog playing in the park.

B. Unknown image.

C. photo of a dog

D. Error: 'dog' not found

Find the error in this AI image description function and choose the fix:

def describe(image):
    if image.contains('cat'):
        return 'A cat on the sofa.'
    else:
        return 'No cat found.'

medium

A. Change return to print

B. Add a semicolon at the end of each line

C. Replace image.contains('cat') with 'cat' in image

D. Use image.has('cat') instead

Image understanding and description in Prompt Engineering / GenAI - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand the term 'image understanding'

Step 2: Compare options with the meaning

Final Answer:

Quick Check:

Solution

Step 1: Understand image description

Step 2: Match options to this meaning

Final Answer:

Quick Check:

Solution

Step 1: Check the input string for keyword

Step 2: Follow the if condition in the function

Final Answer:

Quick Check:

Solution

Step 1: Identify the error in method usage

Step 2: Choose the correct syntax for membership check

Final Answer:

Quick Check:

Solution

Step 1: Understand the goal of automatic image description

Step 2: Evaluate the options for this goal

Final Answer:

Quick Check: