Practice

(1/5)

1. What is the main purpose of output guardrails in AI systems?

easy

A. To speed up AI training time

B. To guide AI to give safe and useful answers

C. To increase the size of AI models

D. To reduce the number of AI layers

Solution

Step 1: Understand output guardrails
Output guardrails are rules that help AI give answers that are safe and useful.
Step 2: Identify the main goal
The main goal is to guide AI responses to be helpful and respectful, avoiding harmful or irrelevant content.
Final Answer:
To guide AI to give safe and useful answers -> Option B
Quick Check:
Output guardrails = safe and useful answers [OK]

Hint: Guardrails keep AI answers safe and helpful [OK]

Common Mistakes:

Confusing guardrails with training speed
Thinking guardrails increase model size
Assuming guardrails reduce AI layers

2. Which of the following is a correct example of an output guardrail rule?

easy

A. Block certain harmful words from AI responses

B. Allow AI to generate any length of text without limits

C. Train AI with more data to improve accuracy

D. Increase AI model layers for better output

Solution

Step 1: Identify output guardrail examples
Output guardrails include rules like blocking harmful words or limiting response length.
Step 2: Match the correct rule
Blocking harmful words is a direct guardrail to keep AI responses safe.
Final Answer:
Block certain harmful words from AI responses -> Option A
Quick Check:
Guardrail = block harmful words [OK]

Hint: Guardrails block harmful words, not increase model size [OK]

Common Mistakes:

Confusing training improvements with guardrails
Thinking guardrails allow unlimited text
Mixing model architecture changes with guardrails

3. Given this simple AI output guardrail code snippet in Python:

blocked_words = ['badword']
def filter_output(text):
    for word in blocked_words:
        if word in text:
            return 'Content blocked due to policy.'
    return text

print(filter_output('This is a badword example.'))

What will be the printed output?

medium

A. This is a badword example.

B. Error: blocked_words not defined

C. None

D. Content blocked due to policy.

Solution

Step 1: Analyze the filter_output function
The function checks if any blocked word is in the input text. If found, it returns a block message.
Step 2: Check the input text
The input text contains 'badword', which is in blocked_words, so the function returns the block message.
Final Answer:
Content blocked due to policy. -> Option D
Quick Check:
Blocked word found = block message [OK]

Hint: If blocked word in text, output block message [OK]

Common Mistakes:

Ignoring the blocked word check
Assuming original text prints always
Confusing variable scope errors

4. Consider this Python code meant to limit AI output length:

def limit_length(text, max_len=10):
    if len(text) > max_len:
        return text[:max_len]
    else:
        return text

print(limit_length('Hello, world!'))

What is the output and is there any bug?

medium

A. 'Hello, world!' and no bug

B. Error due to missing return

C. 'Hello, worl' and no bug

D. 'Hello, wor' and no bug

Solution

Step 1: Check the function logic
If text length is more than 10, it returns first 10 characters; else returns full text.
Step 2: Apply to input 'Hello, world!'
Input length is 13, so it returns text[:10] which is 'Hello, worl'.
Final Answer:
'Hello, worl' and no bug -> Option C
Quick Check:
Length limit applied correctly [OK]

Hint: Slice text to max length if too long [OK]

Common Mistakes:

Counting 11 characters instead of 10
Assuming no slicing happens
Thinking code has syntax errors

5. You want to create an output guardrail that blocks any AI response containing both 'error' and 'fail' words, but allows responses with only one of them. Which Python code snippet correctly implements this?

hard

A. def guard(text): if 'error' in text and 'fail' in text: return 'Response blocked.' return text

B. def guard(text): if 'error' in text or 'fail' in text: return 'Response blocked.' return text

C. def guard(text): if 'error' not in text and 'fail' not in text: return 'Response blocked.' return text

D. def guard(text): if 'error' in text and 'fail' not in text: return 'Response blocked.' return text

Solution

Step 1: Understand the condition
The guardrail should block only if both 'error' and 'fail' appear together.
Step 2: Check each option logic
def guard(text): if 'error' in text and 'fail' in text: return 'Response blocked.' return text uses 'and' to check both words, blocking only when both are present, which matches the requirement.
Final Answer:
def guard(text): if 'error' in text and 'fail' in text: return 'Response blocked.' return text -> Option A
Quick Check:
Block if both words present = def guard(text): if 'error' in text and 'fail' in text: return 'Response blocked.' return text [OK]

Hint: Use 'and' to require both words for blocking [OK]

Common Mistakes:

Using 'or' blocks if either word appears
Negating conditions incorrectly
Blocking only one word instead of both

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Model starts learning basic language patterns.
2	0.9	0.6	Model improves in generating coherent text.
3	0.7	0.75	Model begins to produce relevant and safe outputs.
4	0.5	0.85	Output guardrails help reduce unsafe or irrelevant outputs.
5	0.4	0.9	Model converges with high-quality, safe outputs.

Output guardrails in Prompt Engineering / GenAI - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand output guardrails

Step 2: Identify the main goal

Final Answer:

Quick Check:

Solution

Step 1: Identify output guardrail examples

Step 2: Match the correct rule

Final Answer:

Quick Check:

Solution

Step 1: Analyze the filter_output function

Step 2: Check the input text

Final Answer:

Quick Check:

Solution

Step 1: Check the function logic

Step 2: Apply to input 'Hello, world!'

Final Answer:

Quick Check:

Solution

Step 1: Understand the condition

Step 2: Check each option logic

Final Answer:

Quick Check: