Factors in R are used to store data that belongs to categories or groups. They help R understand that the data is not just numbers or text, but categories that can be ordered or unordered.
0
0
Why factors represent categorical data in R Programming
Introduction
When you have survey answers like 'Yes', 'No', and 'Maybe'.
When you want to group data by colors like 'Red', 'Green', and 'Blue'.
When you have data about days of the week and want to keep their order.
When you want to save memory by storing repeated category names efficiently.
When you want to perform statistical analysis on groups or categories.
Syntax
R Programming
factor(x, levels = NULL, ordered = FALSE)
x is the vector of data you want to convert to categories.
levels lets you set the possible categories explicitly.
Examples
This creates a factor from a vector of color names.
R Programming
colors <- c('Red', 'Green', 'Blue', 'Green') factor_colors <- factor(colors)
This creates an ordered factor where sizes have a specific order.
R Programming
sizes <- c('Small', 'Large', 'Medium', 'Small') factor_sizes <- factor(sizes, levels = c('Small', 'Medium', 'Large'), ordered = TRUE)
Sample Program
This program creates an ordered factor from survey responses and prints the factor and its levels.
R Programming
responses <- c('Yes', 'No', 'Maybe', 'Yes', 'No') factor_responses <- factor(responses, levels = c('No', 'Maybe', 'Yes'), ordered = TRUE) print(factor_responses) print(levels(factor_responses))
OutputSuccess
Important Notes
Factors store categories as numbers internally but show the category names.
Ordering factors helps when categories have a natural order, like sizes or ratings.
Summary
Factors represent categorical data by grouping values into categories.
They help R treat data as categories, not just text or numbers.
Ordering factors allows working with categories that have a natural sequence.