NLPml~20 mins

Why text generation creates content in NLP - Experiment to Prove It

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Why text generation creates content

Problem:You want to understand how a text generation model creates new content based on input prompts.

Current Metrics:The model generates text that is often repetitive and sometimes irrelevant to the prompt.

Issue:The model tends to repeat phrases and lacks diversity in generated content, showing limited creativity.

Your Task

Improve the text generation model so it produces more diverse and relevant content without losing coherence.

Keep the model architecture the same (a simple LSTM-based text generator).

Only adjust training parameters and sampling methods during generation.

Hint 1

Hint 2

Hint 3

Solution

NLP

import numpy as np
import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import LSTM, Dense, Embedding
from tensorflow.keras.utils import to_categorical

# Sample text data
text = "hello world hello machine learning world hello ai world"

# Create character mapping
chars = sorted(list(set(text)))
char_to_idx = {c:i for i,c in enumerate(chars)}
idx_to_char = {i:c for i,c in enumerate(chars)}

# Prepare sequences
seq_length = 5
sequences = []
next_chars = []
for i in range(len(text) - seq_length):
    sequences.append(text[i:i+seq_length])
    next_chars.append(text[i+seq_length])

X = np.zeros((len(sequences), seq_length), dtype=int)
y = np.zeros((len(sequences), len(chars)), dtype=int)
for i, seq in enumerate(sequences):
    for t, char in enumerate(seq):
        X[i, t] = char_to_idx[char]
    y[i, char_to_idx[next_chars[i]]] = 1

# Build model
model = Sequential([
    Embedding(len(chars), 10, input_length=seq_length),
    LSTM(50),
    Dense(len(chars), activation='softmax')
])
model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=['accuracy'])

# Train model
model.fit(X, y, epochs=50, batch_size=8, verbose=0)

# Text generation function with temperature and top-k sampling

def sample(preds, temperature=1.0, top_k=None):
    preds = np.asarray(preds).astype('float64')
    preds = np.log(preds + 1e-8) / temperature
    exp_preds = np.exp(preds)
    preds = exp_preds / np.sum(exp_preds)
    if top_k is not None:
        indices_to_remove = preds < np.sort(preds)[-top_k]
        preds[indices_to_remove] = 0
        preds = preds / np.sum(preds)
    probas = np.random.multinomial(1, preds, 1)
    return np.argmax(probas)

# Generate text
seed_text = "hello"
generated = seed_text
for _ in range(50):
    input_seq = [char_to_idx[c] for c in generated[-seq_length:]]
    input_seq = np.array(input_seq).reshape(1, seq_length)
    preds = model.predict(input_seq, verbose=0)[0]
    next_index = sample(preds, temperature=0.8, top_k=3)
    next_char = idx_to_char[next_index]
    generated += next_char

print("Generated text:", generated)

Added temperature parameter to control randomness in predictions.

Implemented top-k sampling to limit choices to the most probable characters.

Trained the model for 50 epochs with a small batch size to balance learning.

Results Interpretation

Before: The model generated repetitive and less relevant text.

After: With temperature and top-k sampling, the text is more varied and coherent.

Adjusting sampling methods like temperature and top-k helps text generation models create more diverse and meaningful content without changing the model itself.

Bonus Experiment

Try using nucleus (top-p) sampling instead of top-k to see if it improves text diversity further.

💡 Hint

Nucleus sampling selects from the smallest set of words whose cumulative probability exceeds a threshold p, balancing diversity and coherence.

Practice

(1/5)

1. What is the main reason text generation models create new content?

easy

A. They predict the next word based on previous words

B. They copy sentences from a fixed list

C. They randomly select words without context

D. They translate text from one language to another

Why text generation creates content in NLP - Experiment to Prove It

Start learning this pattern below

Practice

Solution

Step 1: Understand how text generation works

Step 2: Compare options with this understanding

Final Answer:

Quick Check:

Solution

Step 1: Identify the function for text generation

Step 2: Eliminate unrelated functions

Final Answer:

Quick Check:

Solution

Step 1: Understand the generate function output

Step 2: Analyze the code snippet

Final Answer:

Quick Check:

Solution

Step 1: Check parameter names for generate()

Step 2: Verify other code parts

Final Answer:

Quick Check:

Solution

Step 1: Understand text generation for summaries

Step 2: Evaluate options based on this understanding

Final Answer:

Quick Check: