NLP - Text Similarity and SearchWhy can cosine similarity sometimes give misleading results when comparing very sparse vectors in NLP tasks?ABecause cosine similarity ignores vector length completelyBBecause sparse vectors may have little overlap but still high cosine similarity due to normalizationCBecause cosine similarity is sensitive to vector magnitude differencesDBecause cosine similarity requires dense vectors onlyCheck Answer
Step-by-Step SolutionSolution:Step 1: Understand sparse vector behaviorSparse vectors often have many zeros; normalization can inflate similarity even with little overlap in non-zero elements.Step 2: Identify why cosine can misleadVectors with few shared features can still have relatively high cosine similarity due to normalization effects.Final Answer:Because sparse vectors may have little overlap but still high cosine similarity due to normalization -> Option BQuick Check:Sparse vectors can mislead cosine similarity [OK]Quick Trick: Normalization can inflate similarity for sparse vectors [OK]Common Mistakes:MISTAKESThinking cosine ignores vector length entirelyAssuming cosine requires dense vectorsConfusing sensitivity to magnitude with normalization effect
Master "Text Similarity and Search" in NLP9 interactive learning modes - each teaches the same concept differentlyLearnWhyDeepModelTryChallengeExperimentRecallMetrics
More NLP Quizzes Sentiment Analysis Advanced - Why advanced sentiment handles nuance - Quiz 9hard Sentiment Analysis Advanced - Multilingual sentiment - Quiz 4medium Sentiment Analysis Advanced - Hybrid approaches - Quiz 5medium Text Generation - Evaluating generated text (BLEU, ROUGE) - Quiz 7medium Text Generation - N-gram language models - Quiz 14medium Text Similarity and Search - Why similarity measures find related text - Quiz 10hard Text Similarity and Search - Document similarity ranking - Quiz 7medium Topic Modeling - Choosing number of topics - Quiz 4medium Topic Modeling - LDA with scikit-learn - Quiz 14medium Topic Modeling - Latent Dirichlet Allocation (LDA) - Quiz 7medium