Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to define a simple human evaluation metric function that returns the average score.

Prompt Engineering / GenAI

def average_score(scores):
    return sum(scores) / [1]

Drag options to blanks, or click blank then click option'

Alen(scores)

Bsum(scores)

Cmax(scores)

Dmin(scores)

Attempts:

3 left

2fill in blank

medium

Complete the code to calculate the inter-rater agreement using Cohen's kappa formula denominator.

Prompt Engineering / GenAI

def cohen_kappa_denominator(p0, pe):
    return [1] - pe

Drag options to blanks, or click blank then click option'

Bpe

Cp0

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to compute the average human evaluation score from a dictionary of scores.

Prompt Engineering / GenAI

def average_human_score(scores_dict):
    total = sum(scores_dict.values())
    count = len([1])
    return total / count

Drag options to blanks, or click blank then click option'

Ascores_dict.keys()

Bscores_dict

Cscores_dict.items()

Dscores_dict.values()

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary comprehension that filters human evaluation scores above 3.

Prompt Engineering / GenAI

filtered_scores = [1]: score for [2], score in scores.items() if score > 3}

Drag options to blanks, or click blank then click option'

Arater

Bscore

Crater_id

Dscore_value

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to compute the weighted average human evaluation score.

Prompt Engineering / GenAI

def weighted_average(scores, weights):
    total_weighted = sum(scores[i] * [1] for i in range(len(scores)))
    total_weight = sum([2])
    return total_weighted / [3]

Drag options to blanks, or click blank then click option'

Aweights[i]

Bweights

Ctotal_weight

Dscores[i]

Attempts:

3 left

Practice

(1/5)

1. What is the main purpose of human evaluation frameworks in AI?

easy

A. To have people judge AI outputs for quality

B. To replace all automatic scoring methods

C. To train AI models faster

D. To collect data without human input

Human evaluation frameworks in Prompt Engineering / GenAI - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of human evaluation

Step 2: Compare with other options

Final Answer:

Quick Check:

Solution

Step 1: Identify common human evaluation methods

Step 2: Eliminate unrelated options

Final Answer:

Quick Check:

Solution

Step 1: Sum the scores given by raters

Step 2: Calculate the average score

Final Answer:

Quick Check:

Solution

Step 1: Trace the code execution for invalid input

Step 2: Identify the error

Final Answer:

Quick Check:

Solution

Step 1: Consider evaluation goals

Step 2: Evaluate options

Final Answer:

Quick Check: