Sentence transformers create vector representations of sentences. We want these vectors to capture meaning well. So, we measure how well the model groups similar sentences close and different sentences far apart.
Common metrics include Cosine Similarity to check closeness of vectors, and Recall@K or Mean Reciprocal Rank (MRR) to evaluate retrieval tasks. These metrics show if the model finds the right similar sentences.