Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to create the input embedding layer for a Transformer model.

Prompt Engineering / GenAI

embedding_layer = nn.Embedding(num_tokens, [1])

Drag options to blanks, or click blank then click option'

Aembedding_dim

Bnum_heads

Cnum_layers

Ddropout_rate

Attempts:

3 left

2fill in blank

medium

Complete the code to apply multi-head attention in the Transformer encoder block.

Prompt Engineering / GenAI

attention_output, _ = multihead_attn(query, key, value, [1]=key_padding_mask)

Drag options to blanks, or click blank then click option'

Abias

Bdropout

Cattn_mask

Dkey_padding_mask

Attempts:

3 left

3fill in blank

hard

Fix the error in the Transformer feed-forward network layer by completing the missing activation function.

Prompt Engineering / GenAI

ffn_output = linear2([1](linear1(x)))

Drag options to blanks, or click blank then click option'

Asigmoid

Brelu

Csoftmax

Dtanh

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a positional encoding function that adds position info to token embeddings.

Prompt Engineering / GenAI

positional_encoding = torch.zeros(seq_len, [1])
for pos in range(seq_len):
    for i in range(0, [2], 2):
        positional_encoding[pos, i] = math.sin(pos / (10000 ** (i / [2])))

Drag options to blanks, or click blank then click option'

Aembedding_dim

Bseq_len

Cnum_heads

Dbatch_size

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to complete the Transformer encoder layer with normalization and residual connections.

Prompt Engineering / GenAI

x = x + [1](multihead_attn(x, x, x))
x = [2](x)
residual = x
x = x + [3](feed_forward(x))

Drag options to blanks, or click blank then click option'

Alayer_norm

Bdropout

Drelu

Attempts:

3 left

Practice

(1/5)

1. What is the main purpose of the attention mechanism in a Transformer model?

easy

A. To increase the size of the model

B. To focus on important parts of the input data

C. To reduce the number of layers

D. To store data permanently

Transformer architecture overview in Prompt Engineering / GenAI - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand attention mechanism role

Step 2: Compare options with attention purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall Transformer encoder layer structure

Step 2: Match the correct sequence

Final Answer:

Quick Check:

Solution

Step 1: Understand masking in decoder attention

Step 2: Evaluate options against masking purpose

Final Answer:

Quick Check:

Solution

Step 1: Check expected input shape for nn.MultiheadAttention

Step 2: Verify input tensor shape

Final Answer:

Quick Check:

Solution

Step 1: Identify components needed for translation

Step 2: Match components to translation needs

Final Answer:

Quick Check: