0
0
R Programmingprogramming~10 mins

Correlation analysis in R Programming - Step-by-Step Execution

Choose your learning style9 modes available
Concept Flow - Correlation analysis
Start with two numeric vectors
Calculate means of each vector
Calculate deviations from means
Multiply deviations element-wise
Sum multiplied deviations
Calculate standard deviations
Divide sum by product of std devs and n-1
End
Correlation analysis calculates a number showing how two sets of numbers move together, step-by-step computing means, deviations, and combining them into a coefficient.
Execution Sample
R Programming
x <- c(1, 2, 3, 4, 5)
y <- c(2, 4, 6, 8, 10)
correlation <- cor(x, y)
print(correlation)
This code calculates the correlation coefficient between two numeric vectors x and y and prints the result.
Execution Table
StepActionValue/CalculationResult
1Input vectorsx = (1,2,3,4,5), y = (2,4,6,8,10)Vectors ready
2Calculate meansmean(x) = 3, mean(y) = 6Means computed
3Calculate deviationsx_dev = (-2,-1,0,1,2), y_dev = (-4,-2,0,2,4)Deviations computed
4Multiply deviations element-wisex_dev * y_dev = (8,2,0,2,8)Products computed
5Sum multiplied deviationssum = 20Sum = 20
6Calculate standard deviationssd(x) = 1.58, sd(y) = 3.16Std devs computed
7Calculate correlationr = 20 / ((5-1)*1.58*3.16) = 1Correlation coefficient = 1
8Print resultprint(1)Output: 1
9EndCorrelation calculated successfullyStop
💡 All steps completed; correlation coefficient calculated and printed.
Variable Tracker
VariableStartAfter Step 2After Step 3After Step 4After Step 5After Step 6After Step 7Final
x(1,2,3,4,5)(1,2,3,4,5)(-2,-1,0,1,2)(-2,-1,0,1,2)(-2,-1,0,1,2)(1,2,3,4,5)(1,2,3,4,5)(1,2,3,4,5)
y(2,4,6,8,10)(2,4,6,8,10)(-4,-2,0,2,4)(-4,-2,0,2,4)(-4,-2,0,2,4)(2,4,6,8,10)(2,4,6,8,10)(2,4,6,8,10)
mean_xNA3333333
mean_yNA6666666
x_devNANA(-2,-1,0,1,2)(-2,-1,0,1,2)(-2,-1,0,1,2)NANANA
y_devNANA(-4,-2,0,2,4)(-4,-2,0,2,4)(-4,-2,0,2,4)NANANA
prod_devNANANA(8,2,0,2,8)(8,2,0,2,8)NANANA
sum_prod_devNANANANA20NANANA
sd_xNANANANANA1.581.581.58
sd_yNANANANANA3.163.163.16
correlationNANANANANANA11
Key Moments - 3 Insights
Why do we subtract the mean from each value before multiplying?
Subtracting the mean centers the data around zero, so we measure how values move relative to their average, as shown in step 3 of the execution_table.
Why do we divide by (n-1) times the product of standard deviations?
Dividing by (n-1) and the product of standard deviations normalizes the sum of products to get a value between -1 and 1, explained in step 7 of the execution_table.
What does a correlation of 1 mean in this example?
A correlation of 1 means perfect positive linear relationship; as x increases, y increases proportionally, as seen in the final result in step 8.
Visual Quiz - 3 Questions
Test your understanding
Look at the execution_table at step 3. What are the deviations of x from its mean?
A(2, 1, 0, -1, -2)
B(1, 2, 3, 4, 5)
C(-2, -1, 0, 1, 2)
D(0, 0, 0, 0, 0)
💡 Hint
Check the 'Value/Calculation' column at step 3 in execution_table.
At which step does the correlation coefficient get calculated?
AStep 7
BStep 3
CStep 5
DStep 2
💡 Hint
Look for the step where division by standard deviations and n-1 happens in execution_table.
If the vectors x and y were identical, what would the correlation coefficient be according to variable_tracker?
A-1
B1
C0
DUndefined
💡 Hint
Correlation of identical vectors is shown in the final correlation value in variable_tracker.
Concept Snapshot
Correlation analysis in R:
- Use cor(x, y) to find correlation coefficient r
- r ranges from -1 (perfect negative) to 1 (perfect positive)
- Steps: compute means, deviations, multiply, sum, divide by std dev product
- Measures linear relationship strength and direction
- Perfect correlation means one variable predicts the other exactly
Full Transcript
Correlation analysis calculates how two numeric vectors relate linearly. We start with two vectors, find their means, then find how each value deviates from its mean. We multiply these deviations element-wise and sum them. Then we calculate the standard deviations of each vector. Finally, we divide the sum of multiplied deviations by the product of standard deviations and n-1 to get the correlation coefficient. This coefficient tells us if the vectors move together positively, negatively, or not at all. In the example, vectors x and y have a perfect positive correlation of 1.