Overview - R6 classes

What is it?

R6 classes are a way to create objects in R that can hold both data and functions together. They let you build objects with properties (called fields) and actions (called methods) that can change over time. Unlike regular R functions or lists, R6 classes support object-oriented programming with encapsulation and inheritance. This helps organize code better when working on bigger projects.

Why it matters

Without R6 classes, managing complex data and behaviors in R can get messy and repetitive. R6 classes solve this by bundling data and related actions into one object, making code easier to read, reuse, and maintain. This is especially important for building larger programs, simulations, or tools where you want clear structure and control over how data changes.

Where it fits

Before learning R6 classes, you should understand basic R programming, including lists and functions. Knowing about simpler object-oriented systems in R like S3 or S4 helps but is not required. After R6 classes, you can explore advanced object-oriented design, package development, or other R programming paradigms.

Mental Model

Core Idea

R6 classes are blueprints for objects that combine data and behavior, letting you create and control living things in your code.

Think of it like...

Think of an R6 class like a recipe for baking a cake: it tells you what ingredients (data) you need and the steps (methods) to make and change the cake. Each cake you bake from the recipe is an object that can be decorated or sliced differently without changing the recipe itself.

┌───────────────┐
│   R6 Class    │
│───────────────│
│ Fields (data) │
│ Methods (func)│
└──────┬────────┘
       │
       ▼
┌───────────────┐
│  Object 1     │
│  (instance)   │
│ Fields values │
│ Methods work  │
└───────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding Basic Objects in R

Concept: Learn what objects are in R and how data is stored simply.

In R, everything is an object like numbers, vectors, or lists. Lists can hold different types of data together, like a mini container. But lists don't have built-in ways to attach functions that act on their data.

Result

You know that R stores data in objects and that lists can group data but lack behavior.

Understanding that R's basic objects hold data but not behavior sets the stage for why R6 classes are useful.

2

FoundationWhat is Object-Oriented Programming?

3

IntermediateCreating Your First R6 Class

4

IntermediateUsing Fields and Methods in R6

5

IntermediateInheritance and Extending R6 Classes

6

AdvancedEncapsulation and Private Members

7

ExpertAdvanced R6: Active Bindings and Performance

Under the Hood

R6 classes work by creating environments that hold fields and methods together. Each object is an environment with its own data and functions. Methods use 'self' to refer to the current object. Private members are stored in a separate environment inaccessible from outside. This design uses R's reference semantics, so objects are modified in place without copying, unlike normal R objects.

Why designed this way?

R6 was designed to bring true object-oriented programming to R with mutable objects and encapsulation, which older systems like S3 and S4 lacked or handled inefficiently. Using environments and reference semantics avoids costly copying and makes R6 faster and more intuitive for programmers from other OOP languages.

┌─────────────────────────────┐
│        R6 Object            │
│ ┌───────────────┐           │
│ │ Public Env    │◄──────────┤
│ │ - fields      │           │
│ │ - methods     │           │
│ └───────────────┘           │
│ ┌───────────────┐           │
│ │ Private Env   │           │
│ │ - private data│           │
│ │ - private meth│           │
│ └───────────────┘           │
└─────────────┬───────────────┘
              │
              ▼
        Reference semantics
        (modifies in place)

Myth Busters - 4 Common Misconceptions

Quick: Do you think R6 objects are copied when assigned to a new variable? Commit to yes or no.

Common Belief:R6 objects behave like normal R objects and get copied when assigned or passed around.

Tap to reveal reality

Quick: Can you access private fields of an R6 object directly using $ from outside? Commit yes or no.

Common Belief:Private fields in R6 are just a naming convention and can be accessed like public fields.

Tap to reveal reality

Quick: Do you think R6 classes are slower than S3 or S4 classes? Commit yes or no.

Common Belief:Because R6 is more complex, it must be slower than older R object systems.

Tap to reveal reality

Quick: Do you think methods in R6 are just normal R functions outside the class? Commit yes or no.

Common Belief:R6 methods are just regular functions stored inside the object without special behavior.

Tap to reveal reality

Expert Zone

1

R6 objects can be serialized and saved, but private environments require careful handling to preserve privacy after reload.

2

Active bindings can introduce subtle bugs if they have side effects or depend on external state, so use them sparingly.

3

Multiple inheritance is not supported directly in R6, so complex hierarchies require composition or mixins.

When NOT to use

Avoid R6 classes when you need immutable objects or functional programming style; use base R lists or S3 classes instead. For formal class definitions with strict validation, S4 or R's newer 'R7' system may be better.

Production Patterns

In production, R6 classes are used to build modular components like database connectors, simulation models, or GUI elements. Developers often combine R6 with packages like 'shiny' for interactive apps or 'plumber' for APIs, leveraging encapsulation and mutable state.

Connections

Reference Semantics

R6 classes implement reference semantics in R, unlike most R objects which use value semantics.

Understanding reference semantics in R6 helps grasp why objects change in place and how this differs from usual R behavior.

Encapsulation in Software Engineering

R6's private members embody the encapsulation principle from software engineering.

Knowing encapsulation from software design clarifies why hiding data inside objects improves code safety and clarity.

Biological Cell Model

Both R6 objects and biological cells contain internal components and perform actions to maintain state.

Seeing R6 objects like cells helps appreciate how data and behavior combine to form self-contained units.

Common Pitfalls

#1Trying to copy an R6 object by simple assignment expecting a new independent copy.

Wrong approach:obj2 <- obj1 obj2$field <- 10 # expecting obj1$field unchanged

Correct approach:obj2 <- obj1$clone(deep = TRUE) obj2$field <- 10 # obj1$field remains unchanged

Root cause:Misunderstanding that R6 objects use reference semantics and assignment copies only the reference, not the object.

#2Accessing private fields directly from outside the object.

Wrong approach:print(obj$private_field) # NULL or error

Correct approach:Use public methods to access private data, e.g., obj$get_private_field()

Root cause:Not knowing private members are stored in a separate environment inaccessible from outside.

#3Defining methods without using 'self' to refer to fields or other methods.

Wrong approach:greet = function() { paste0("Hi, ", name) } # 'name' undefined

Correct approach:greet = function() { paste0("Hi, ", self$name) }

Root cause:Confusing local function scope with object scope; forgetting to use 'self' to access object data.

Key Takeaways

R6 classes bring true object-oriented programming to R by combining data and behavior in mutable objects.

They use reference semantics, so objects are modified in place and assigned variables point to the same object unless cloned.

Encapsulation with private members helps protect internal data and design clear interfaces.

Inheritance allows building new classes from existing ones, promoting code reuse and organization.

Advanced features like active bindings enable dynamic properties but require careful use to avoid bugs.