LangChainframework~20 mins

LangSmith evaluators in LangChain - Practice Problems & Coding Challenges

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Perf

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Challenge - 5 Problems

🎖️

LangSmith Evaluator Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ component_behavior

intermediate

2:00remaining

What is the output of this LangSmith evaluator code?

Consider this LangSmith evaluator snippet that scores a model's response based on keyword presence. What score does it produce?

LangChain

from langsmith.evaluation import Evaluator

class KeywordEvaluator(Evaluator):
    def evaluate(self, prediction: str, reference: str) -> float:
        keywords = reference.split()
        score = sum(1 for kw in keywords if kw in prediction) / len(keywords)
        return score

# Usage
evaluator = KeywordEvaluator()
prediction = "The quick brown fox jumps"
reference = "quick fox jumps high"
result = evaluator.evaluate(prediction, reference)
print(result)

A0.5

B0.75

C1.0

D0.25

Attempts:

2 left

📝 Syntax

intermediate

2:00remaining

Which option causes a syntax error in defining a LangSmith evaluator?

Identify the code snippet that will raise a syntax error when defining a custom LangSmith evaluator class.

class MyEvaluator(Evaluator):
    def evaluate(self, prediction str, reference: str) -&gt; float:
        return 1.0

class MyEvaluator(Evaluator):
    def evaluate(self, prediction: str, reference: str):
        return 1.0

class MyEvaluator(Evaluator):
    def evaluate(self, prediction: str, reference: str) -&gt; float:
        return 1.0

0.1 nruter        
:taolf &gt;- )rts :ecnerefer ,rts :noitciderp ,fles(etaulave fed    
:)rotaulavE(rotaulavEyM ssalc

Attempts:

2 left

❓ state_output

advanced

2:00remaining

What is the value of 'score' after running this LangSmith evaluator code?

Given this evaluator code that uses a weighted scoring system, what is the final score returned?

LangChain

from langsmith.evaluation import Evaluator

class WeightedEvaluator(Evaluator):
    def evaluate(self, prediction: str, reference: str) -> float:
        weights = {'good': 2, 'bad': -1}
        score = 0
        for word in prediction.split():
            score += weights.get(word, 0)
        return score

# Usage
result = WeightedEvaluator().evaluate('good good bad unknown', 'reference')

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Which option causes a runtime error when using LangSmith evaluator?

Identify the code snippet that will raise a runtime error during evaluation.

class Eval(Evaluator):
    def evaluate(self, prediction: str, reference: str) -&gt; float:
        return len(prediction) * len(reference)

Eval().evaluate('test', '')

class Eval(Evaluator):
    def evaluate(self, prediction: str, reference: str) -&gt; float:
        return len(prediction) + len(reference)

Eval().evaluate('test', '')

class Eval(Evaluator):
    def evaluate(self, prediction: str, reference: str) -&gt; float:
        return len(prediction) / len(reference)

Eval().evaluate('test', '')

class Eval(Evaluator):
    def evaluate(self, prediction: str, reference: str) -&gt; float:
        return len(prediction) - len(reference)

Eval().evaluate('test', '')

Attempts:

2 left

🧠 Conceptual

expert

2:00remaining

Which option best describes the role of LangSmith evaluators in model development?

Select the statement that correctly explains what LangSmith evaluators do.

AThey automatically generate training data for language models without human input.

BThey convert language models into different programming languages for compatibility.

CThey deploy language models to production environments with optimized latency.

DThey provide a way to score and assess model outputs against references to improve model quality.

Attempts:

2 left

Practice

(1/5)

1. What is the main purpose of LangSmith evaluators in LangChain?

easy

A. To check how good AI outputs are by comparing predictions to references

B. To train new AI models from scratch

C. To store large datasets for AI training

D. To create user interfaces for AI applications

LangSmith evaluators in LangChain - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of evaluators

Step 2: Identify the correct purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall method usage

Step 2: Match correct syntax

Final Answer:

Quick Check:

Solution

Step 1: Understand evaluate output

Step 2: Analyze print statement

Final Answer:

Quick Check:

Solution

Step 1: Check argument order

Step 2: Confirm other parts are correct

Final Answer:

Quick Check:

Solution

Step 1: Understand evaluator usage for multiple inputs

Step 2: Apply evaluator in a loop

Step 3: Eliminate incorrect options

Final Answer:

Quick Check: