NLPml~20 mins

Temperature and sampling in NLP - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Temperature and sampling

Problem:You have a text generation model that uses sampling with temperature to create sentences. Currently, the model uses a temperature of 1.0 and produces repetitive or dull text.

Current Metrics:Sampled text shows low diversity and repetitiveness; qualitative evaluation indicates low creativity.

Issue:The model's output lacks variety and creativity due to suboptimal temperature settings during sampling.

Your Task

Adjust the temperature parameter during sampling to increase the diversity and creativity of generated text without making it nonsensical.

Do not change the underlying language model architecture.

Only modify the temperature parameter and sampling method.

Keep the sampling code runnable and simple.

Hint 1

Hint 2

Hint 3

Solution

NLP

import numpy as np

def sample_with_temperature(logits, temperature=1.0):
    # Convert logits to probabilities with temperature
    scaled_logits = logits / temperature
    exp_logits = np.exp(scaled_logits - np.max(scaled_logits))
    probs = exp_logits / np.sum(exp_logits)
    # Sample from the probability distribution
    return np.random.choice(len(probs), p=probs)

# Example logits for a vocabulary of 5 tokens
logits = np.array([2.0, 1.0, 0.1, 0.5, 1.5])

# Sample tokens with different temperatures
for temp in [0.5, 1.0, 1.5]:
    print(f"Sampling with temperature={temp}:")
    samples = [sample_with_temperature(logits, temperature=temp) for _ in range(10)]
    print(samples)

Added a temperature parameter to scale logits before converting to probabilities.

Implemented sampling from the adjusted probability distribution.

Demonstrated sampling at temperatures 0.5, 1.0, and 1.5 to show effect on output diversity.

Results Interpretation

Before: Sampling at temperature 1.0 produced repetitive tokens with low diversity.

After: Sampling at temperature 0.5 reduced randomness, focusing on likely tokens, while temperature 1.5 increased randomness, producing more diverse but sometimes less sensible tokens.

Adjusting temperature during sampling controls randomness: lower temperature makes output more predictable, higher temperature increases creativity but risks nonsense. This helps balance diversity and coherence in text generation.

Bonus Experiment

Try implementing top-k sampling combined with temperature to further control output diversity.

💡 Hint

Limit sampling to the top k tokens with highest probabilities after applying temperature scaling, then sample from this smaller set.

Practice

(1/5)

1. What does increasing the temperature parameter in text generation usually do?

easy

A. Makes the output more predictable and repetitive

B. Stops the model from generating any text

C. Makes the output more random and creative

D. Always selects the most probable next word

Temperature and sampling in NLP - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand temperature effect on randomness

Step 2: Relate temperature to creativity

Final Answer:

Quick Check:

Solution

Step 1: Recall temperature scaling formula

Step 2: Identify correct operation

Final Answer:

Quick Check:

Solution

Step 1: Scale logits by dividing by temperature

Step 2: Calculate softmax probabilities

Final Answer:

Quick Check:

Solution

Step 1: Identify temperature scaling mistake

Step 2: Explain effect of wrong scaling

Final Answer:

Quick Check:

Solution

Step 1: Understand temperature impact on creativity

Step 2: Choose sampling method for balance

Final Answer:

Quick Check: