Why might an RNN-based text generation model struggle with very long sequences, and what is a common solution?

hard📝 Conceptual Q10 of 15

NLP - Text Generation

ARNNs cannot process sequences longer than 10 tokens; use CNNs instead

BRNNs overfit on long sequences; reduce training data size

CRNNs forget long-term dependencies; use LSTM or GRU cells instead

DRNNs require one-hot encoding for long sequences; switch to embeddings

Step-by-Step Solution

Solution:

Step 1: Identify RNN limitation
Standard RNNs have difficulty remembering information over long sequences due to vanishing gradients.
Step 2: Recognize common solution
LSTM and GRU cells are designed to keep long-term dependencies better, solving this problem.
Final Answer:
RNNs forget long-term dependencies; use LSTM or GRU cells instead -> Option C
Quick Check:
Use LSTM/GRU to handle long-term dependencies [OK]

Quick Trick: Use LSTM/GRU to remember long sequences [OK]

Common Mistakes:

MISTAKES

Master "Text Generation" in NLP

9 interactive learning modes - each teaches the same concept differently

Want More Practice?

15+ quiz questions · All difficulty levels · Free

More NLP Quizzes