0
0
R-programmingHow-ToBeginner · 3 min read

How to Use readr Package in R for Fast Data Import

The readr package in R is used to read data files quickly and easily with functions like read_csv() for CSV files. You load the package with library(readr) and then call the appropriate function to import your data into a data frame.
📐

Syntax

The readr package provides simple functions to read different file types. The most common syntax is:

  • read_csv(file, col_names = TRUE, locale = default_locale(), na = c("", "NA"), skip = 0) reads CSV files.
  • read_tsv() reads tab-separated files.
  • read_delim() reads files with any delimiter.

Parameters explained:

  • file: path to your data file.
  • col_names: whether the first row has column names.
  • locale: controls encoding and other locale settings.
  • na: strings to treat as missing values.
  • skip: number of lines to skip before reading data.
r
library(readr)
data <- read_csv("data.csv", col_names = TRUE, na = c("", "NA"))
💻

Example

This example shows how to load a CSV file using read_csv() and view the first few rows.

r
library(readr)

# Create a sample CSV file
writeLines(c("name,age,score", "Alice,30,85", "Bob,25,90", "Carol,27,88"), "sample.csv")

# Read the CSV file
data <- read_csv("sample.csv")

# Print the data
print(data)
Output
# A tibble: 3 × 3 name age score <chr> <dbl> <dbl> 1 Alice 30 85 2 Bob 25 90 3 Carol 27 88
⚠️

Common Pitfalls

Common mistakes when using readr include:

  • Not loading the package with library(readr) before calling functions.
  • Using base R functions like read.csv() instead of read_csv() which is faster and more consistent.
  • Forgetting to set col_names = FALSE if the file has no header row, causing the first row to be treated as column names.
  • Not specifying the correct file path or working directory.

Example of a common mistake and fix:

r
# Wrong: forgetting to load readr
# data <- read_csv("file.csv") # Error: could not find function "read_csv"

# Correct:
library(readr)
data <- read_csv("file.csv")
📊

Quick Reference

FunctionPurpose
read_csv()Read comma-separated values (CSV) files
read_tsv()Read tab-separated values (TSV) files
read_delim()Read files with any delimiter
write_csv()Write data frames to CSV files
parse_number()Extract numbers from strings

Key Takeaways

Load the readr package with library(readr) before using its functions.
Use read_csv() to quickly read CSV files into tibbles with better performance than base R.
Set col_names = FALSE if your file does not have header names to avoid misreading data.
Check your file path and working directory to avoid file not found errors.
readr functions return tibbles, which are easier to work with than base data frames.