R Programmingprogramming~10 mins

Correlation analysis in R Programming - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - Correlation analysis

Start with two numeric vectors

↓

Calculate means of each vector

↓

Calculate deviations from means

↓

Multiply deviations element-wise

↓

Sum multiplied deviations

↓

Calculate standard deviations

↓

Divide sum by product of std devs and n-1

↓

End

Correlation analysis calculates a number showing how two sets of numbers move together, step-by-step computing means, deviations, and combining them into a coefficient.

Execution Sample

R Programming

x <- c(1, 2, 3, 4, 5)
y <- c(2, 4, 6, 8, 10)
correlation <- cor(x, y)
print(correlation)

This code calculates the correlation coefficient between two numeric vectors x and y and prints the result.

Execution Table

Step	Action	Value/Calculation	Result
1	Input vectors	x = (1,2,3,4,5), y = (2,4,6,8,10)	Vectors ready
2	Calculate means	mean(x) = 3, mean(y) = 6	Means computed
3	Calculate deviations	x_dev = (-2,-1,0,1,2), y_dev = (-4,-2,0,2,4)	Deviations computed
4	Multiply deviations element-wise	x_dev * y_dev = (8,2,0,2,8)	Products computed
5	Sum multiplied deviations	sum = 20	Sum = 20
6	Calculate standard deviations	sd(x) = 1.58, sd(y) = 3.16	Std devs computed
7	Calculate correlation	r = 20 / ((5-1)1.583.16) = 1	Correlation coefficient = 1
8	Print result	print(1)	Output: 1
9	End	Correlation calculated successfully	Stop

💡 All steps completed; correlation coefficient calculated and printed.

Variable Tracker

Variable	Start	After Step 2	After Step 3	After Step 4	After Step 5	After Step 6	After Step 7	Final
x	(1,2,3,4,5)	(1,2,3,4,5)	(-2,-1,0,1,2)	(-2,-1,0,1,2)	(-2,-1,0,1,2)	(1,2,3,4,5)	(1,2,3,4,5)	(1,2,3,4,5)
y	(2,4,6,8,10)	(2,4,6,8,10)	(-4,-2,0,2,4)	(-4,-2,0,2,4)	(-4,-2,0,2,4)	(2,4,6,8,10)	(2,4,6,8,10)	(2,4,6,8,10)
mean_x	NA	3	3	3	3	3	3	3
mean_y	NA	6	6	6	6	6	6	6
x_dev	NA	NA	(-2,-1,0,1,2)	(-2,-1,0,1,2)	(-2,-1,0,1,2)	NA	NA	NA
y_dev	NA	NA	(-4,-2,0,2,4)	(-4,-2,0,2,4)	(-4,-2,0,2,4)	NA	NA	NA
prod_dev	NA	NA	NA	(8,2,0,2,8)	(8,2,0,2,8)	NA	NA	NA
sum_prod_dev	NA	NA	NA	NA	20	NA	NA	NA
sd_x	NA	NA	NA	NA	NA	1.58	1.58	1.58
sd_y	NA	NA	NA	NA	NA	3.16	3.16	3.16
correlation	NA	NA	NA	NA	NA	NA	1	1

Key Moments - 3 Insights

Why do we subtract the mean from each value before multiplying?

Why do we divide by (n-1) times the product of standard deviations?

What does a correlation of 1 mean in this example?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table at step 3. What are the deviations of x from its mean?

A(2, 1, 0, -1, -2)

B(1, 2, 3, 4, 5)

C(-2, -1, 0, 1, 2)

D(0, 0, 0, 0, 0)

Concept Snapshot

Correlation analysis in R:
- Use cor(x, y) to find correlation coefficient r
- r ranges from -1 (perfect negative) to 1 (perfect positive)
- Steps: compute means, deviations, multiply, sum, divide by std dev product
- Measures linear relationship strength and direction
- Perfect correlation means one variable predicts the other exactly

Full Transcript

Correlation analysis calculates how two numeric vectors relate linearly. We start with two vectors, find their means, then find how each value deviates from its mean. We multiply these deviations element-wise and sum them. Then we calculate the standard deviations of each vector. Finally, we divide the sum of multiplied deviations by the product of standard deviations and n-1 to get the correlation coefficient. This coefficient tells us if the vectors move together positively, negatively, or not at all. In the example, vectors x and y have a perfect positive correlation of 1.