0
0
Prompt Engineering / GenAIml~5 mins

Text splitters in Prompt Engineering / GenAI - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is a text splitter in the context of machine learning?
A text splitter is a tool or method that breaks long text into smaller parts or chunks. This helps models handle text better by focusing on manageable pieces instead of very long text all at once.
Click to reveal answer
beginner
Why do we need to split text before feeding it to AI models?
Splitting text helps avoid overwhelming the model with too much information at once. It also helps keep context clear and allows the model to process and understand text more effectively.
Click to reveal answer
beginner
Name two common ways to split text.
1. Splitting by sentences or paragraphs. 2. Splitting by fixed length chunks (like every 100 words). Both help keep text pieces easy to handle.
Click to reveal answer
intermediate
What is an overlap in text splitting and why is it useful?
Overlap means some words from the end of one chunk appear again at the start of the next chunk. This helps keep context between chunks so the model doesn’t lose important connections.
Click to reveal answer
intermediate
How does chunk size affect model performance?
If chunks are too small, the model might miss context. If too large, the model might get overwhelmed or run into limits. Choosing the right chunk size balances understanding and performance.
Click to reveal answer
What is the main purpose of using a text splitter?
ATo remove punctuation from text
BTo translate text into another language
CTo break long text into smaller parts for easier processing
DTo generate new text from old text
Which of these is a common way to split text?
ABy sentence or paragraph
BBy font size
CBy color of words
DBy word meaning
What does overlap in text splitting help with?
AChanging the text language
BAdding more words to the text
CRemoving repeated words
DKeeping context between chunks
What happens if text chunks are too large?
AModel might get overwhelmed or hit limits
BModel will understand better
CText becomes shorter
DText becomes easier to read
Why not make chunks too small?
ASmall chunks confuse the model
BSmall chunks can lose important context
CSmall chunks increase text length
DSmall chunks are harder to process
Explain what a text splitter is and why it is important in AI text processing.
Think about how breaking text into smaller parts helps AI understand better.
You got /3 concepts.
    Describe how overlap works in text splitting and why it helps maintain context.
    Overlap is like sharing some words between chunks to keep the story connected.
    You got /3 concepts.