Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to create a Transformer encoder layer using PyTorch.

NLP

import torch.nn as nn
encoder_layer = nn.TransformerEncoderLayer(d_model=512, nhead=[1])

Drag options to blanks, or click blank then click option'

A16

D32

Attempts:

3 left

2fill in blank

medium

Complete the code to apply positional encoding to the input embeddings.

NLP

pos_encoded = embeddings + [1]

Drag options to blanks, or click blank then click option'

Ann.Embedding(num_positions, embedding_dim)

Bpositional_encoding

Cnn.Linear(embedding_dim, embedding_dim)

Dnn.LayerNorm(embedding_dim)

Attempts:

3 left

3fill in blank

hard

Fix the error in the multi-head attention call by filling the correct argument.

NLP

output, weights = multihead_attn(query, key, [1])

Drag options to blanks, or click blank then click option'

Avalue

Bmask

Ckey_padding_mask

Dattn_mask

Attempts:

3 left

4fill in blank

hard

Fill both blanks to complete the Transformer decoder layer initialization.

NLP

decoder_layer = nn.TransformerDecoderLayer(d_model=[1], nhead=[2])

Drag options to blanks, or click blank then click option'

A512

B256

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a dictionary comprehension that maps each token to its embedding size if the size is greater than 300.

NLP

embedding_sizes = {token: [1] for token, size in token_sizes.items() if size [2] 300 and size == [3]

Drag options to blanks, or click blank then click option'

Asize

C512

Dtoken

Attempts:

3 left

Practice

(1/5)

1. What is the main purpose of the self-attention mechanism in a Transformer model?

easy

A. To increase the number of layers in the model

B. To reduce the size of the input data

C. To convert words into numbers

D. To let the model focus on different words in the sentence at the same time

Transformer architecture in NLP - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand self-attention role

Step 2: Match purpose with options

Final Answer:

Quick Check:

Solution

Step 1: Recall Transformer structure

Step 2: Compare options with structure

Final Answer:

Quick Check:

Solution

Step 1: Understand input shape and MultiheadAttention

Step 2: Output shape matches input shape

Final Answer:

Quick Check:

Solution

Step 1: Check shapes of tgt and memory

Step 2: Identify batch size mismatch

Step 3: Re-examine options carefully

Final Answer:

Quick Check:

Solution

Step 1: Understand summarization task

Step 2: Match task with Transformer parts

Final Answer:

Quick Check: