from transformers import RagTokenizer, RagRetriever, RagSequenceForGeneration tokenizer = RagTokenizer.from_pretrained('facebook/rag-token-nq') retriever = RagRetriever.from_pretrained('facebook/rag-token-nq') model = RagSequenceForGeneration.from_pretrained('facebook/rag-token-nq', retriever=[1])

Practice

(1/5)

1. What is the main purpose of the retriever component in a RAG architecture?

easy

A. To find relevant documents or information from a large dataset

B. To generate natural language answers from scratch

C. To train the model on labeled data

D. To evaluate the accuracy of the answers

Solution

Step 1: Understand the role of retriever in RAG
The retriever searches a large collection of documents to find relevant information related to the question.
Step 2: Differentiate retriever from generator
The generator uses the retrieved information to create a natural language answer, not to find documents.
Final Answer:
To find relevant documents or information from a large dataset -> Option A
Quick Check:
Retriever = Find info [OK]

Hint: Retriever searches data; generator writes answers [OK]

Common Mistakes:

Confusing retriever with generator
Thinking retriever generates answers
Assuming retriever evaluates answers

2. Which of the following correctly describes the sequence of operations in a RAG model?

easy

A. Generate answer first, then retrieve documents

B. Retrieve documents first, then generate answer

C. Train model, then retrieve documents

D. Evaluate answer, then generate documents

Solution

Step 1: Recall RAG workflow
RAG first retrieves relevant documents to provide context for the answer.
Step 2: Understand generation step
After retrieval, the generator uses the documents to produce a final answer.
Final Answer:
Retrieve documents first, then generate answer -> Option B
Quick Check:
Retrieve before generate [OK]

Hint: Retrieve info before writing answer [OK]

Common Mistakes:

Thinking generation happens before retrieval
Mixing training with retrieval steps
Confusing evaluation with generation

3. Consider this simplified Python pseudocode for a RAG-like process:

retrieved_docs = retriever.search(query)
answer = generator.generate(retrieved_docs, query)
print(answer)

What will be printed if the retriever returns an empty list?

medium

A. An answer generated without context, possibly generic or incorrect

B. A runtime error because generator cannot handle empty input

C. The original query string printed

D. An empty string printed

Solution

Step 1: Analyze retriever output
The retriever returns an empty list, meaning no documents found.
Step 2: Understand generator behavior
The generator tries to create an answer without context, so it may produce a generic or less accurate answer, but no error occurs.
Final Answer:
An answer generated without context, possibly generic or incorrect -> Option A
Quick Check:
Empty retrieval leads to generic answer [OK]

Hint: Empty retrieval means generic answer, not error [OK]

Common Mistakes:

Assuming empty retrieval causes error
Thinking query is printed directly
Expecting empty string output

4. You have a RAG model that always returns irrelevant answers. Which of these is the most likely cause?

medium

A. The model is overfitting on training data

B. Generator is not trained on any data

C. Retriever is returning unrelated documents

D. The evaluation metric is incorrect

Solution

Step 1: Identify cause of irrelevant answers
If answers are irrelevant, the source documents are likely unrelated to the question.
Step 2: Check retriever role
The retriever finds documents; if it returns unrelated ones, the generator has poor context to answer.
Final Answer:
Retriever is returning unrelated documents -> Option C
Quick Check:
Bad retrieval causes irrelevant answers [OK]

Hint: Check retriever output first for relevance [OK]

Common Mistakes:

Blaming generator without checking retrieval
Confusing overfitting with retrieval errors
Ignoring data quality issues

5. In a RAG system designed for a constantly updated news database, which advantage does RAG provide compared to a standard language model?

hard

A. It generates answers faster by skipping retrieval

B. It always produces shorter answers

C. It requires no training data at all

D. It can access fresh news by retrieving documents without retraining

Solution

Step 1: Understand RAG with dynamic data
RAG retrieves documents from an external source, so it can use new data without retraining the generator.
Step 2: Compare with standard language models
Standard models need retraining to learn new info, but RAG updates answers by searching fresh documents.
Final Answer:
It can access fresh news by retrieving documents without retraining -> Option D
Quick Check:
RAG updates answers via retrieval [OK]

Hint: RAG uses retrieval to handle new data easily [OK]

Common Mistakes:

Thinking RAG skips retrieval
Assuming no training data needed
Believing RAG limits answer length

RAG architecture overview in Prompt Engineering / GenAI - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of retriever in RAG

Step 2: Differentiate retriever from generator

Final Answer:

Quick Check:

Solution

Step 1: Recall RAG workflow

Step 2: Understand generation step

Final Answer:

Quick Check:

Solution

Step 1: Analyze retriever output

Step 2: Understand generator behavior

Final Answer:

Quick Check:

Solution

Step 1: Identify cause of irrelevant answers

Step 2: Check retriever role

Final Answer:

Quick Check:

Solution

Step 1: Understand RAG with dynamic data

Step 2: Compare with standard language models

Final Answer:

Quick Check: