Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to create a TransformerDecoderLayer with 8 attention heads.

PyTorch

import torch.nn as nn

decoder_layer = nn.TransformerDecoderLayer(d_model=512, nhead=[1])

Drag options to blanks, or click blank then click option'

B16

Attempts:

3 left

2fill in blank

medium

Complete the code to pass the memory tensor to the TransformerDecoder.

PyTorch

import torch
import torch.nn as nn

memory = torch.rand(10, 32, 512)  # (sequence_length, batch_size, d_model)
decoder_layer = nn.TransformerDecoderLayer(d_model=512, nhead=8)
decoder = nn.TransformerDecoder(decoder_layer, num_layers=6)
output = decoder(tgt=torch.rand(20, 32, 512), memory=[1])

Drag options to blanks, or click blank then click option'

Amemory

Btgt

Coutput

Dsrc

Attempts:

3 left

3fill in blank

hard

Fix the error in the code by selecting the correct mask to prevent the decoder from attending to future tokens.

PyTorch

import torch
import torch.nn as nn

decoder_layer = nn.TransformerDecoderLayer(d_model=512, nhead=8)
decoder = nn.TransformerDecoder(decoder_layer, num_layers=6)
tgt = torch.rand(20, 32, 512)
memory = torch.rand(10, 32, 512)
size = tgt.size(0)
mask = torch.triu(torch.ones(size, size), diagonal=[1]).bool()
output = decoder(tgt, memory, tgt_mask=mask)

Drag options to blanks, or click blank then click option'

B-1

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a TransformerDecoderLayer with dropout and ReLU activation.

PyTorch

import torch.nn as nn

decoder_layer = nn.TransformerDecoderLayer(d_model=512, nhead=8, dropout=[1], activation=[2])

Drag options to blanks, or click blank then click option'

A0.1

B0.5

C"relu"

D"gelu"

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a TransformerDecoder, pass the target and memory, and apply the correct mask.

PyTorch

import torch
import torch.nn as nn

memory = torch.rand(15, 64, 256)
tgt = torch.rand(30, 64, 256)
decoder_layer = nn.TransformerDecoderLayer(d_model=256, nhead=4)
decoder = nn.TransformerDecoder(decoder_layer, num_layers=[1])
size = tgt.size(0)
mask = torch.triu(torch.ones(size, size), diagonal=[2]).bool()
output = decoder(tgt=[3], memory=memory, tgt_mask=mask)

Drag options to blanks, or click blank then click option'

Ctgt

Attempts:

3 left