Challenge - 5 Problems

🎖️

RAG Source Citation Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

1:30remaining

Why is source citation important in Retrieval-Augmented Generation (RAG)?

In RAG systems, the model generates answers based on retrieved documents. Why is it important to include source citations in the generated responses?

ATo improve the model's training speed by referencing documents

BTo increase the randomness of the generated text

CTo reduce the size of the retrieval database

DTo provide users with evidence and allow verification of the generated information

Attempts:

2 left

❓ Predict Output

intermediate

2:00remaining

What is the output of this LangChain snippet adding source citations?

Consider this Python code using LangChain to generate a RAG response with source citations:

from langchain.chains import RetrievalQA
from langchain.llms import OpenAI

retriever = ...  # pre-configured retriever
llm = OpenAI(temperature=0)

qa = RetrievalQA.from_chain_type(llm=llm, retriever=retriever, return_source_documents=True)

query = "What is the capital of France?"
result = qa.run(query)
print(result)

What will print(result) output?

AA list of retrieved documents without any answer

BA dictionary with keys 'result' (answer string) and 'source_documents' (list of documents)

CA string answer only, e.g., 'The capital of France is Paris.'

DAn error because 'return_source_documents' is not a valid argument

Attempts:

2 left

❓ Model Choice

advanced

1:30remaining

Which LangChain chain type best supports source citation in RAG?

You want to build a RAG system that returns both the answer and the source documents for citation. Which LangChain chain type should you use?

ARetrievalQA with <code>return_source_documents=True</code>

BLLMChain with a prompt template only

CConversationChain without retrieval

DSimple SequentialChain without retriever

Attempts:

2 left

❓ Metrics

advanced

1:30remaining

Which metric best evaluates the quality of source citations in RAG responses?

When assessing how well a RAG system cites sources, which metric is most appropriate?

ARecall of retrieved documents that contain correct information

BBLEU score comparing generated text to reference answers

CPerplexity of the language model on generated text

DTraining loss of the underlying LLM

Attempts:

2 left

🔧 Debug

expert

2:00remaining

Why does this LangChain RAG code fail to show source citations?

Given this code snippet:

from langchain.chains import RetrievalQA
from langchain.llms import OpenAI

retriever = ...
llm = OpenAI(temperature=0)
qa = RetrievalQA.from_chain_type(llm=llm, retriever=retriever)

query = "Explain photosynthesis"
result = qa(query)
print(result)

The output shows only the answer text without any source citations. What is the most likely reason?

AThe retriever is not connected to any documents

BThe <code>return_source_documents</code> parameter was not set to True when creating the RetrievalQA chain

CThe OpenAI LLM does not support source citation

DThe query string is too short to retrieve documents

Attempts:

2 left