Practice

(1/5)

1. What is the main advantage of using an LSTM model for text data?

easy

A. It converts text directly into images.

B. It removes all punctuation from the text.

C. It remembers the order of words in a sentence.

D. It translates text into multiple languages.

Solution

Step 1: Understand LSTM's role in text
LSTM models are designed to remember sequences, which means they keep track of word order in sentences.
Step 2: Compare options with LSTM function
Only It remembers the order of words in a sentence. correctly describes LSTM's ability to remember word order. Other options describe unrelated tasks.
Final Answer:
It remembers the order of words in a sentence. -> Option C
Quick Check:
LSTM remembers word order = B [OK]

Hint: LSTM = memory for word order in text [OK]

Common Mistakes:

Thinking LSTM translates languages
Confusing LSTM with image processing
Assuming LSTM removes punctuation

2. Which of the following is the correct way to add an LSTM layer in Keras for text input?

easy

A. model.add(LSTM(128, input_shape=(timesteps, features)))

B. model.add(Dense(128, input_shape=(timesteps, features)))

C. model.add(Conv2D(128, kernel_size=3))

D. model.add(Embedding(128, input_shape=(timesteps, features)))

Solution

Step 1: Identify LSTM layer syntax in Keras
The LSTM layer is added with LSTM(units, input_shape=(timesteps, features)). model.add(LSTM(128, input_shape=(timesteps, features))) matches this syntax.
Step 2: Check other options for correctness
model.add(Dense(128, input_shape=(timesteps, features))) is a Dense layer, not LSTM. model.add(Conv2D(128, kernel_size=3)) is a Conv2D layer for images. model.add(Embedding(128, input_shape=(timesteps, features))) is an Embedding layer, not LSTM.
Final Answer:
model.add(LSTM(128, input_shape=(timesteps, features))) -> Option A
Quick Check:
LSTM layer syntax = D [OK]

Hint: LSTM layer uses LSTM(), not Dense or Conv2D [OK]

Common Mistakes:

Using Dense instead of LSTM for sequence data
Confusing Embedding with LSTM layer
Applying Conv2D for text input

3. Given this code snippet, what will be the shape of the output from the LSTM layer?

model = Sequential()
model.add(Embedding(input_dim=1000, output_dim=64, input_length=10))
model.add(LSTM(32))
output = model.output_shape

medium

A. (None, 10, 32)

B. (None, 32)

C. (None, 64)

D. (10, 32)

Solution

Step 1: Understand Embedding and LSTM output shapes
The Embedding layer outputs (batch_size, 10, 64). The LSTM with 32 units returns (batch_size, 32) by default (last output only).
Step 2: Match output shape with options
(None, 32) matches (None, 32) where None is batch size. Other options are incorrect shapes.
Final Answer:
(None, 32) -> Option B
Quick Check:
LSTM output shape = (None, 32) [OK]

Hint: LSTM returns (batch, units) by default, not sequence [OK]

Common Mistakes:

Assuming LSTM outputs full sequence by default
Confusing embedding output with LSTM output
Ignoring batch size dimension

4. Identify the error in this LSTM model code for text classification:

model = Sequential()
model.add(LSTM(64, input_shape=(100,)))
model.add(Dense(1, activation='sigmoid'))
model.compile(optimizer='adam', loss='binary_crossentropy')

medium

A. Optimizer 'adam' is not suitable for LSTM models

B. Dense layer activation should be 'relu' for binary classification

C. Loss function should be 'categorical_crossentropy' for binary output

D. Input shape should be 2D, e.g., (timesteps, features), not (100,)

Solution

Step 1: Check input shape for LSTM layer
LSTM expects input shape as (timesteps, features). Here, (100,) is 1D, missing feature dimension.
Step 2: Validate other components
Binary classification uses sigmoid activation and binary_crossentropy loss correctly. Adam optimizer is suitable.
Final Answer:
Input shape should be 2D, e.g., (timesteps, features), not (100,) -> Option D
Quick Check:
LSTM input shape must be 2D = A [OK]

Hint: LSTM input shape needs (timesteps, features) [OK]

Common Mistakes:

Using 1D input shape for LSTM
Changing activation incorrectly for binary tasks
Mixing loss functions for binary classification

5. You want to build an LSTM model to classify movie reviews as positive or negative. Which approach best improves model understanding of word meaning before LSTM processing?

hard

A. Add an Embedding layer to convert words into dense vectors before the LSTM.

B. Use a Dense layer directly on raw text input before LSTM.

C. Apply a Conv2D layer to the text input before LSTM.

D. Skip preprocessing and feed raw text strings directly to LSTM.

Solution

Step 1: Understand preprocessing for text in LSTM models
Embedding layers convert words into meaningful numeric vectors, helping LSTM understand word relationships.
Step 2: Evaluate other options
Dense layers expect numeric input, not raw text. Conv2D is for images. Feeding raw strings to LSTM causes errors.
Final Answer:
Add an Embedding layer to convert words into dense vectors before the LSTM. -> Option A
Quick Check:
Embedding before LSTM = C [OK]

Hint: Use Embedding layer to convert words before LSTM [OK]

Common Mistakes:

Feeding raw text directly to LSTM
Using Dense or Conv2D layers on raw text
Skipping word vector conversion

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Model starts learning basic word patterns
2	0.9	0.60	Loss decreases, accuracy improves as model learns context
3	0.7	0.72	Model captures longer dependencies in text
4	0.55	0.80	Good convergence, model predicts text better
5	0.45	0.85	Training stabilizes with high accuracy

LSTM for text in NLP - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand LSTM's role in text

Step 2: Compare options with LSTM function

Final Answer:

Quick Check:

Solution

Step 1: Identify LSTM layer syntax in Keras

Step 2: Check other options for correctness

Final Answer:

Quick Check:

Solution

Step 1: Understand Embedding and LSTM output shapes

Step 2: Match output shape with options

Final Answer:

Quick Check:

Solution

Step 1: Check input shape for LSTM layer

Step 2: Validate other components

Final Answer:

Quick Check:

Solution

Step 1: Understand preprocessing for text in LSTM models

Step 2: Evaluate other options

Final Answer:

Quick Check: