0
0
ML Pythonml~5 mins

t-SNE for visualization in ML Python - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is t-SNE used for in machine learning?
t-SNE is used to reduce high-dimensional data to two or three dimensions to help visualize complex data patterns in a simple, easy-to-understand way.
Click to reveal answer
intermediate
How does t-SNE preserve data structure during visualization?
t-SNE preserves local relationships by keeping similar points close together in the low-dimensional space, making clusters visible while allowing some distortion of global structure.
Click to reveal answer
intermediate
What is the main difference between PCA and t-SNE?
PCA is a linear method that preserves global variance, while t-SNE is a nonlinear method focused on preserving local similarities for better cluster visualization.
Click to reveal answer
advanced
What role does the 'perplexity' parameter play in t-SNE?
Perplexity controls the balance between local and global aspects of the data; it roughly sets how many neighbors each point influences during the mapping.
Click to reveal answer
advanced
Why should t-SNE results be interpreted carefully?
Because t-SNE can distort global distances and is sensitive to parameters, its plots show clusters well but should not be used to infer exact distances or relationships between clusters.
Click to reveal answer
What is the primary goal of t-SNE?
AIncrease the number of features in data
BVisualize high-dimensional data in 2D or 3D
CTrain a classification model
DGenerate synthetic data
Which aspect does t-SNE focus on preserving?
ALocal similarities between nearby points
BGlobal distances between all points
CExact numeric values of features
DData labels
What does the 'perplexity' parameter affect in t-SNE?
ANumber of output dimensions
BNumber of iterations
CLearning rate speed
DBalance between local and global data structure
Why is t-SNE not suitable for very large datasets without adjustments?
AIt is computationally expensive and slow
BIt increases data dimensionality
CIt only works with images
DIt requires labeled data
Which method is a linear alternative to t-SNE?
ARandom Forest
BK-means
CPCA
DNeural Networks
Explain how t-SNE helps visualize complex data and what makes it different from other dimensionality reduction methods.
Think about how t-SNE shows clusters clearly by focusing on neighbors.
You got /4 concepts.
    Describe the importance of the 'perplexity' parameter in t-SNE and how changing it might affect the visualization.
    Consider how many neighbors each point 'sees' during mapping.
    You got /4 concepts.