Prompt Engineering / GenAIml~20 mins

Stable Diffusion overview in Prompt Engineering / GenAI - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Stable Diffusion overview

Problem:You want to generate clear images from text descriptions using Stable Diffusion, but the model sometimes creates blurry or unclear images.

Current Metrics:Image clarity score: 65/100, Text-image alignment score: 70/100

Issue:The model produces images that are not sharp enough and sometimes do not match the text description well.

Your Task

Improve the image clarity and text-image alignment scores to at least 85/100 while keeping generation time under 10 seconds per image.

You cannot change the text input descriptions.

You must use the Stable Diffusion model architecture.

You can adjust model parameters and inference settings only.

Hint 1

Hint 2

Hint 3

Hint 4

Solution

Prompt Engineering / GenAI

from diffusers import StableDiffusionPipeline
import torch

# Load the Stable Diffusion model
pipe = StableDiffusionPipeline.from_pretrained('runwayml/stable-diffusion-v1-5', torch_dtype=torch.float16)
pipe = pipe.to('cuda')

# Set parameters to improve image clarity and alignment
prompt = "A beautiful sunset over a mountain lake"
num_inference_steps = 50  # Increased steps for better quality
guidance_scale = 7.5      # Higher guidance for better text alignment

# Generate image
image = pipe(prompt, num_inference_steps=num_inference_steps, guidance_scale=guidance_scale).images[0]

# Save or display the image
image.save('output.png')

Increased the number of inference steps to 50 to improve image clarity.

Increased guidance scale to 7.5 to better align images with text prompts.

Used half precision (float16) and GPU to keep generation time under 10 seconds.

Results Interpretation

Before: Image clarity 65/100, Text-image alignment 70/100

After: Image clarity 88/100, Text-image alignment 90/100

Increasing diffusion steps and guidance scale helps the model generate sharper images that better match the text description, demonstrating how tuning inference parameters can reduce image blurriness and improve relevance.

Bonus Experiment

Try using a different scheduler like DDIM or LMS to see if image quality improves further without increasing generation time.

💡 Hint

Change the scheduler parameter in the pipeline and compare results visually and with clarity/alignment scores.

Practice

(1/5)

1. What is the main purpose of Stable Diffusion in AI?

easy

A. To translate languages automatically

B. To analyze financial data

C. To create images from text descriptions

D. To detect spam emails

Stable Diffusion overview in Prompt Engineering / GenAI - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand Stable Diffusion's function

Step 2: Compare with other options

Final Answer:

Quick Check:

Solution

Step 1: Identify proper prompt format

Step 2: Check options for correct syntax

Final Answer:

Quick Check:

Solution

Step 1: Understand prompt to output relation

Step 2: Match prompt to output type

Final Answer:

Quick Check:

Solution

Step 1: Analyze prompt clarity impact

Step 2: Evaluate other options

Final Answer:

Quick Check:

Solution

Step 1: Understand prompt specificity effect

Step 2: Evaluate other options

Final Answer:

Quick Check: