Experiment - Text chunking strategies
Problem:You want to split long text documents into smaller chunks for better processing by a language model. The current method splits text into fixed-size chunks without considering sentence boundaries.
Current Metrics:Chunk coherence score: 0.65, Overlap redundancy: 0.30
Issue:Chunks often break sentences in the middle, causing loss of meaning and reducing model understanding. This leads to lower chunk coherence and higher redundancy.