import torchvision.transforms as transforms transform = transforms.Compose([ transforms.ToTensor(), transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]) ]) tensor_img = transform([1])

from transformers import VisionEncoderDecoderModel, ViTImageProcessor, AutoTokenizer model = VisionEncoderDecoderModel.from_pretrained('nlpconnect/vit-gpt2-image-captioning') processor = ViTImageProcessor.from_pretrained('nlpconnect/vit-gpt2-image-captioning') tokenizer = AutoTokenizer.from_pretrained('nlpconnect/vit-gpt2-image-captioning') pixel_values = processor(images=img, return_tensors='pt').[1]() output_ids = model.generate(pixel_values) caption = tokenizer.decode(output_ids[0], skip_special_tokens=True) print(caption)

Practice

(1/5)

What does image understanding mean in AI?

easy

A. Drawing a new picture from scratch

B. Writing a story about a picture

C. Changing the colors of a picture

D. Recognizing objects and details in a picture

Which of the following is the correct way to describe an image using AI?

"A cat sitting on a mat."

easy

A. A sentence describing what is in the image

B. A code to change image colors

C. A list of numbers representing pixels

D. A command to delete the image

Given this Python code snippet using a simple AI model for image description, what will be the output?

def describe_image(image):
    if 'dog' in image:
        return 'A dog playing in the park.'
    else:
        return 'Unknown image.'

result = describe_image('photo of a dog')
print(result)

medium

A. A dog playing in the park.

B. Unknown image.

C. photo of a dog

D. Error: 'dog' not found

Find the error in this AI image description function and choose the fix:

def describe(image):
    if image.contains('cat'):
        return 'A cat on the sofa.'
    else:
        return 'No cat found.'

medium

A. Change return to print

B. Add a semicolon at the end of each line

C. Replace image.contains('cat') with 'cat' in image

D. Use image.has('cat') instead

Image understanding and description in Prompt Engineering / GenAI - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand the term 'image understanding'

Step 2: Compare options with the meaning

Final Answer:

Quick Check:

Solution

Step 1: Understand image description

Step 2: Match options to this meaning

Final Answer:

Quick Check:

Solution

Step 1: Check the input string for keyword

Step 2: Follow the if condition in the function

Final Answer:

Quick Check:

Solution

Step 1: Identify the error in method usage

Step 2: Choose the correct syntax for membership check

Final Answer:

Quick Check:

Solution

Step 1: Understand the goal of automatic image description

Step 2: Evaluate the options for this goal

Final Answer:

Quick Check: