Experiment - OpenAI embeddings API
Problem:You want to create a model that converts text into numerical vectors (embeddings) to compare text similarity. Currently, you use OpenAI embeddings API but the similarity scores between related texts are low and inconsistent.
Current Metrics:Average cosine similarity between related text pairs: 0.45 (on scale 0 to 1, where 1 means very similar)
Issue:The embeddings do not capture semantic similarity well enough, causing poor similarity scores for related texts.