Performance: Token-based splitting
MEDIUM IMPACT
Token-based splitting affects how quickly and efficiently large text inputs are broken down for processing, impacting initial load and interaction speed.
from langchain.text_splitter import TokenTextSplitter def split_text_good(text): splitter = TokenTextSplitter(chunk_size=1000, chunk_overlap=100) return splitter.split_text(text)
def split_text_bad(text): # Splitting by characters or simple whitespace without token awareness return text.split(' ')
| Pattern | DOM Operations | Reflows | Paint Cost | Verdict |
|---|---|---|---|---|
| Simple whitespace split | N/A | N/A | N/A | [X] Bad |
| Token-aware splitting with Langchain TokenTextSplitter | N/A | N/A | N/A | [OK] Good |