Challenge - 5 Problems

🎖️

Open-Source Embedding Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ component_behavior

intermediate

2:00remaining

Embedding Model Output Shape

Given the following LangChain code snippet using an open-source embedding model, what is the shape of the output vector for a single input text?

LangChain

from langchain.embeddings import HuggingFaceEmbeddings
embedding_model = HuggingFaceEmbeddings(model_name='sentence-transformers/all-MiniLM-L6-v2')
text = "Hello, world!"
vector = embedding_model.embed_query(text)
print(len(vector))

A512

B768

C384

D1024

Attempts:

2 left

📝 Syntax

intermediate

2:00remaining

Correct Initialization of Open-Source Embedding Model

Which option correctly initializes a HuggingFace embedding model in LangChain to use the 'all-MiniLM-L6-v2' model?

Aembedding_model = HuggingFaceEmbeddings(model_name='sentence-transformers/all-MiniLM-L6-v2')

Bembedding_model = HuggingFaceEmbeddings('sentence-transformers/all-MiniLM-L6-v2')

Cembedding_model = HuggingFaceEmbeddings(model='all-MiniLM-L6-v2')

Dembedding_model = HuggingFaceEmbeddings(model_name='all-MiniLM-L6-v2')

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Error When Using Embedding Model with Missing Dependencies

You run this code snippet but get an ImportError related to 'transformers'. What is the cause?

LangChain

from langchain.embeddings import HuggingFaceEmbeddings
embedding_model = HuggingFaceEmbeddings(model_name='sentence-transformers/all-MiniLM-L6-v2')
vector = embedding_model.embed_query('Test input')

AThe model name is incorrect and does not exist.

BThe LangChain version is outdated and incompatible.

CThe input text is empty, causing the error.

DThe 'transformers' library is not installed in the environment.

Attempts:

2 left

❓ state_output

advanced

2:00remaining

Embedding Vector Consistency Across Calls

If you call embed_query twice with the same input text on the same HuggingFace embedding model instance, what will be the relationship between the two output vectors?

LangChain

embedding_model = HuggingFaceEmbeddings(model_name='sentence-transformers/all-MiniLM-L6-v2')
vec1 = embedding_model.embed_query('Repeat this sentence')
vec2 = embedding_model.embed_query('Repeat this sentence')
print(vec1 == vec2)

ATrue, but only if the model is in training mode.

BTrue, because the model produces deterministic embeddings for the same input.

CFalse, because the model output depends on external state.

DFalse, because the model adds random noise each call.

Attempts:

2 left

🧠 Conceptual

expert

3:00remaining

Choosing an Open-Source Embedding Model for Multilingual Support

Which open-source embedding model is best suited for generating embeddings that support multiple languages effectively in LangChain?

Asentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

Bsentence-transformers/all-MiniLM-L6-v2

Cbert-base-uncased

Dgpt2

Attempts:

2 left