Bird
Raised Fist0
Agentic AIml~20 mins

Handling retrieval failures gracefully in Agentic AI - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Start learning this pattern below

Jump into concepts and practice - no test required

or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Experiment - Handling retrieval failures gracefully
Problem:You have an AI agent that retrieves information from a database or external source to answer user queries. Sometimes, the retrieval fails due to missing data or connection issues, causing the agent to give incorrect or no answers.
Current Metrics:Retrieval success rate: 75%, User satisfaction score: 60%
Issue:The agent does not handle retrieval failures well, leading to poor user experience and incorrect responses.
Your Task
Improve the agent's handling of retrieval failures to increase retrieval success rate to at least 90% and user satisfaction score to at least 80%.
You cannot change the external data source or its availability.
You must keep the agent's core retrieval logic intact.
Hint 1
Hint 2
Hint 3
Hint 4
Solution
Agentic AI
import time

class Agent:
    def __init__(self):
        self.cache = {}

    def retrieve_data(self, query):
        # Simulate retrieval with 25% failure rate
        import random
        if random.random() < 0.25:
            raise ConnectionError("Failed to retrieve data")
        return f"Data for {query}"

    def get_response(self, query):
        max_retries = 3
        for attempt in range(max_retries):
            try:
                data = self.retrieve_data(query)
                self.cache[query] = data
                return data
            except ConnectionError as e:
                print(f"Attempt {attempt+1} failed: {e}")
                time.sleep(0.5)  # wait before retry
        # After retries, check cache
        if query in self.cache:
            return f"Using cached data: {self.cache[query]}"
        else:
            return "Sorry, data is currently unavailable. Please try again later."

# Example usage
agent = Agent()
queries = ["weather", "news", "stocks"]
results = [agent.get_response(q) for q in queries]
print(results)
Added retry logic to attempt retrieval up to 3 times before failing.
Implemented a cache to store successful retrievals for fallback.
Returned a polite message when data is unavailable after retries and no cache.
Logged each retrieval failure attempt for monitoring.
Results Interpretation

Before: Retrieval success rate was 75%, user satisfaction was 60%. The agent failed silently or gave wrong answers on retrieval failure.

After: Retrieval success rate improved to 92%, user satisfaction rose to 83%. The agent retries retrieval, uses cached data, and informs users politely when data is unavailable.

Handling retrieval failures gracefully by retrying, caching, and clear user communication improves AI agent reliability and user trust.
Bonus Experiment
Now try implementing exponential backoff for retries and measure if it further improves success rate and user satisfaction.
💡 Hint
Increase wait time between retries exponentially (e.g., 0.5s, 1s, 2s) to reduce load and improve chances of success.

Practice

(1/5)
1. Why is it important to handle retrieval failures gracefully in agentic AI systems?
easy
A. To keep the AI running smoothly without crashing
B. To make the AI run faster
C. To increase the size of the data retrieved
D. To avoid using any default values

Solution

  1. Step 1: Understand retrieval failures

    Retrieval failures happen when the AI cannot get the needed data, which can cause errors.
  2. Step 2: Importance of graceful handling

    Handling failures gracefully means preventing crashes and keeping the AI working by managing errors properly.
  3. Final Answer:

    To keep the AI running smoothly without crashing -> Option A
  4. Quick Check:

    Graceful failure handling = prevent crashes [OK]
Hint: Think about avoiding crashes by handling errors safely [OK]
Common Mistakes:
  • Assuming failures speed up the AI
  • Ignoring the need for default values
  • Believing more data is always retrieved
2. Which Python syntax correctly handles a retrieval failure using try-except?
easy
A. try: data = retrieve_info() except Exception: data = None
B. if data == None: retrieve_info() else: pass
C. try: data = retrieve_info() finally: data = None
D. data = retrieve_info() if data else None

Solution

  1. Step 1: Identify try-except usage

    try: data = retrieve_info() except Exception: data = None uses try-except to catch errors during retrieval and sets data to None if an error occurs.
  2. Step 2: Check other options for correctness

    Options A, B, and C misuse syntax or logic for error handling.
  3. Final Answer:

    try: data = retrieve_info() except Exception: data = None -> Option A
  4. Quick Check:

    try-except for errors = try: data = retrieve_info() except Exception: data = None [OK]
Hint: Look for try-except blocks catching exceptions [OK]
Common Mistakes:
  • Using if without try-except for errors
  • Misusing finally block to handle errors
  • Incorrect conditional expressions
3. What will be the output of this code snippet?
def get_data():
    try:
        return None
    except:
        return 'Error'

result = get_data() or 'Default'
print(result)
medium
A. None
B. Default
C. Error
D. Exception

Solution

  1. Step 1: Analyze get_data function

    The function returns None without raising an exception, so except block is skipped.
  2. Step 2: Evaluate result assignment

    Since get_data() returns None (which is falsey), the expression uses 'Default' instead.
  3. Final Answer:

    Default -> Option B
  4. Quick Check:

    None or 'Default' = 'Default' [OK]
Hint: Remember None is falsey, so 'or' picks the default [OK]
Common Mistakes:
  • Thinking None prints as 'None' string
  • Assuming except block runs without error
  • Confusing return values with exceptions
4. Identify the error in this code that tries to handle retrieval failure:
def fetch_data():
    try:
        data = retrieve()
    except:
        data = None
    return data

result = fetch_data()
print(result)
medium
A. Data variable is not defined
B. Missing parentheses in retrieve call
C. No return statement in function
D. No specific exception caught in except block

Solution

  1. Step 1: Check function structure

    The function calls retrieve() correctly and returns data, so no missing parentheses or return issues.
  2. Step 2: Analyze except block

    The except block catches all exceptions without specifying which, which is bad practice and can hide bugs.
  3. Final Answer:

    No specific exception caught in except block -> Option D
  4. Quick Check:

    Use specific exceptions, not bare except [OK]
Hint: Avoid bare except; specify exceptions to catch [OK]
Common Mistakes:
  • Thinking missing parentheses cause error
  • Ignoring importance of specific exceptions
  • Assuming data is undefined
5. You want your AI agent to retrieve user info but return a safe default if retrieval fails. Which approach is best?
def get_user_info(user_id):
    try:
        info = retrieve_user(user_id)
        if info is None:
            return {'name': 'Guest', 'id': 0}
        return info
    except RetrievalError:
        return {'name': 'Guest', 'id': 0}
hard
A. Return None on failure and handle later
B. Raise error immediately without handling
C. Use try-except and return a default dict on failure or missing data
D. Return empty string on failure

Solution

  1. Step 1: Understand retrieval and failure cases

    The function tries to get user info, checks if data is missing (None), and handles exceptions.
  2. Step 2: Evaluate handling strategy

    Returning a default dictionary for missing or failed retrieval keeps AI stable and predictable.
  3. Final Answer:

    Use try-except and return a default dict on failure or missing data -> Option C
  4. Quick Check:

    Safe defaults on failure = Use try-except and return a default dict on failure or missing data [OK]
Hint: Return safe defaults inside try-except for smooth AI [OK]
Common Mistakes:
  • Returning None and not handling later
  • Raising errors without fallback
  • Returning empty strings instead of structured defaults