Which Python code snippet correctly computes attention weights using the dot product of query Q and keys K before applying softmax?

easy📝 Syntax Q3 of 15

NLP - Sequence Models for NLP

Ascores = np.dot(K, Q) weights = np.exp(scores) * np.sum(np.exp(scores))

Bscores = np.dot(Q.T, K) weights = np.sum(np.exp(scores))

Cscores = np.dot(Q, K.T) weights = np.exp(scores) / np.sum(np.exp(scores))

Dscores = np.dot(Q, K) weights = np.exp(scores) / np.exp(np.sum(scores))

Step-by-Step Solution

Solution:

Step 1: Compute dot product
Attention scores are computed as Q multiplied by K transposed.
Step 2: Apply softmax
Softmax normalizes exponentiated scores by dividing by their sum.
Final Answer:
scores = np.dot(Q, K.T)\nweights = np.exp(scores) / np.sum(np.exp(scores)) -> Option C
Quick Check:
Dot product with K.T then softmax [OK]

Quick Trick: Dot Q with K.T then softmax normalize [OK]

Common Mistakes:

MISTAKES

Master "Sequence Models for NLP" in NLP

9 interactive learning modes - each teaches the same concept differently

Want More Practice?

15+ quiz questions · All difficulty levels · Free

More NLP Quizzes