Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to define the encoder embedding layer.

NLP

self.embedding = nn.[1](input_dim, embed_dim)

Drag options to blanks, or click blank then click option'

ALinear

BLSTM

CConv1d

DEmbedding

Attempts:

3 left

2fill in blank

medium

Complete the code to compute attention weights using softmax.

NLP

attn_weights = F.[1](scores, dim=1)

Drag options to blanks, or click blank then click option'

Atanh

Bsoftmax

Crelu

Dsigmoid

Attempts:

3 left

3fill in blank

hard

Fix the error in the decoder's context vector calculation.

NLP

context = torch.bmm(attn_weights.unsqueeze(1), encoder_outputs).[1](1)

Drag options to blanks, or click blank then click option'

Asqueeze

Bunsqueeze

Cview

Dpermute

Attempts:

3 left

4fill in blank

hard

Fill both blanks to complete the attention score calculation using dot product.

NLP

scores = torch.bmm(encoder_outputs, decoder_hidden.[1](2, 1, 0)).[2](2, 1, 0)

Drag options to blanks, or click blank then click option'

Aunsqueeze

Bsqueeze

Cpermute

Dview

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to complete the decoder forward pass with attention.

NLP

embedded = self.embedding(input_step)
attn_weights = F.softmax(torch.bmm(encoder_outputs, decoder_hidden.[1](2, 1, 0)), dim=1)
context = torch.bmm(attn_weights.unsqueeze(1), encoder_outputs).[2](1)
rnn_input = torch.cat((embedded, context), dim=[3])

Drag options to blanks, or click blank then click option'

Aunsqueeze

Bsqueeze

Dpermute

Attempts:

3 left

Practice

(1/5)

1. What is the main purpose of the attention mechanism in an encoder-decoder model?

easy

A. To randomly select input tokens for the decoder

B. To help the model focus on relevant parts of the input sequence when generating each output token

C. To speed up the training by skipping some input tokens

D. To reduce the size of the input data before encoding

Encoder-decoder with attention in NLP - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of attention in sequence models

Step 2: Identify the correct purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall attention weight calculation

Step 2: Match the correct formula

Final Answer:

Quick Check:

Solution

Step 1: Analyze tensor shapes in batch matrix multiplication

Step 2: Remove last dimension and apply softmax

Final Answer:

Quick Check:

Solution

Step 1: Understand uniform attention weights meaning

Step 2: Identify missing softmax effect

Final Answer:

Quick Check:

Solution

Step 1: Identify challenges with long sentences

Step 2: Understand multi-head attention benefits

Final Answer:

Quick Check: