Recall & Review

beginner

What is the role of temperature in sampling from a language model?

Temperature controls how random or focused the model's predictions are. A low temperature (<1) makes the model more confident and conservative, picking high-probability words. A high temperature (>1) makes the model more creative and random by flattening the probabilities.

Click to reveal answer

beginner

Explain sampling in the context of generating text from a language model.

Sampling means picking the next word based on the model's predicted probabilities instead of always choosing the most likely word. This adds variety and creativity to the generated text.

Click to reveal answer

intermediate

How does increasing temperature affect the probability distribution during sampling?

Increasing temperature makes the probability distribution more even, so less likely words have a higher chance to be picked. This leads to more diverse and surprising outputs.

Click to reveal answer

intermediate

What happens if temperature is set to 0 during sampling?

Setting temperature to 0 means always picking the word with the highest probability (greedy decoding). This removes randomness and can make the output repetitive or dull.

Click to reveal answer

beginner

Why might you want to use a moderate temperature (e.g., 0.7) instead of very low or very high?

A moderate temperature balances creativity and coherence. It allows some randomness for interesting text but keeps the output sensible and relevant.

Click to reveal answer

What does a temperature of 1.0 mean when sampling from a language model?

AProbabilities are flattened to be more even

BThe original predicted probabilities are used without change

COnly the highest probability word is chosen

DProbabilities are sharpened to favor the top word

What is the effect of setting temperature to a very high value (e.g., 5)?

AModel always picks the most likely word

BOutput becomes very predictable and repetitive

CModel ignores probabilities and picks words randomly

DOutput becomes more random and diverse

Which sampling method removes randomness completely?

AGreedy decoding (temperature = 0)

BRandom sampling without temperature

CSampling with temperature = 1

DSampling with temperature > 1

Why is sampling preferred over always picking the highest probability word?

AIt makes the output more creative and less repetitive

BIt guarantees the most accurate output

CIt speeds up the generation process

DIt reduces the model size

What does lowering temperature below 1 do to the output?

AMakes output more random

BMakes output longer

CMakes output more focused and conservative

DMakes output shorter

Describe how temperature affects the randomness of text generated by a language model.

Explain why sampling is used instead of always choosing the most likely word in text generation.

Practice

(1/5)

1. What does increasing the temperature parameter in text generation usually do?

easy

A. Makes the output more predictable and repetitive

B. Stops the model from generating any text

C. Makes the output more random and creative

D. Always selects the most probable next word

Temperature and sampling in NLP - Cheat Sheet & Quick Revision

Start learning this pattern below

Practice

Solution

Step 1: Understand temperature effect on randomness

Step 2: Relate temperature to creativity

Final Answer:

Quick Check:

Solution

Step 1: Recall temperature scaling formula

Step 2: Identify correct operation

Final Answer:

Quick Check:

Solution

Step 1: Scale logits by dividing by temperature

Step 2: Calculate softmax probabilities

Final Answer:

Quick Check:

Solution

Step 1: Identify temperature scaling mistake

Step 2: Explain effect of wrong scaling

Final Answer:

Quick Check:

Solution

Step 1: Understand temperature impact on creativity

Step 2: Choose sampling method for balance

Final Answer:

Quick Check: