Experiment - Text splitters
Problem:You have a long text document that you want to split into smaller chunks for easier processing by a language model. Currently, the text splitter divides the text into chunks of fixed size without considering sentence boundaries.
Current Metrics:Chunk size: 1000 characters; Overlap: 0; Number of chunks: 15; Issue: Some chunks cut sentences in half, causing loss of context.
Issue:The fixed-size splitter cuts sentences abruptly, which can confuse the language model and reduce the quality of downstream tasks like summarization or question answering.