GraphQLquery~15 mins

Subgraph definition in GraphQL - Deep Dive

Choose your learning style10 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Overview - Subgraph definition

What is it?

A subgraph definition in GraphQL is a way to describe a part of a larger graph schema. It defines types, fields, and relationships that belong to a specific service or domain. This allows multiple teams or services to own different parts of the overall graph, which can then be combined into a single unified API.

Why it matters

Without subgraph definitions, managing large GraphQL APIs becomes difficult and chaotic. Teams would have to coordinate on one big schema, causing delays and errors. Subgraphs let teams work independently on their parts, making development faster and more scalable. This also enables a smooth way to combine multiple services into one API that clients can query easily.

Where it fits

Before learning subgraph definitions, you should understand basic GraphQL schemas and queries. After mastering subgraphs, you can learn about schema federation, which is how multiple subgraphs are combined into a single graph. Later, you might explore advanced federation features like directives and query planning.

Mental Model

Core Idea

A subgraph definition is a focused piece of a larger GraphQL schema owned by one service, describing its data and how it connects to others.

Think of it like...

Imagine a city map divided into neighborhoods. Each neighborhood map shows streets and landmarks inside it, maintained by local teams. The full city map is made by combining all neighborhood maps, letting you navigate the whole city easily.

┌─────────────┐   ┌─────────────┐   ┌─────────────┐
│ Subgraph A  │   │ Subgraph B  │   │ Subgraph C  │
│ (Service 1) │   │ (Service 2) │   │ (Service 3) │
│ Types &     │   │ Types &     │   │ Types &     │
│ Fields      │   │ Fields      │   │ Fields      │
└─────┬───────┘   └─────┬───────┘   └─────┬───────┘
      │                 │                 │
      └───────┬─────────┴─────────┬───────┘
              │                   │
        ┌─────▼───────────────────▼─────┐
        │       Federated Graph Schema    │
        │  (Unified API combining all)   │
        └────────────────────────────────┘

Build-Up - 7 Steps

FoundationUnderstanding GraphQL Schema Basics

Concept: Learn what a GraphQL schema is and how it defines data types and queries.

A GraphQL schema describes the shape of data you can ask for. It defines types like objects, their fields, and how clients can query them. For example, a 'User' type might have fields like 'id' and 'name'. This schema acts like a contract between client and server.

Result

You can write simple GraphQL queries and understand the data structure they return.

Understanding schemas is essential because subgraphs are just parts of these schemas owned by different services.

FoundationWhat is a Subgraph in GraphQL?

IntermediateDefining Types and Fields in a Subgraph

IntermediateExtending Types Across Subgraphs

IntermediateUsing Directives for Federation

AdvancedSubgraph Schema Validation and Composition

ExpertAdvanced Subgraph Design and Performance Considerations

Under the Hood

Subgraph definitions are GraphQL schemas annotated with federation directives. Each subgraph runs as an independent GraphQL service exposing its schema. A gateway service fetches these schemas and uses a composition algorithm to merge them into a single federated schema. This algorithm resolves type ownership, merges fields, and validates keys and references. At runtime, the gateway splits client queries into sub-queries sent to relevant subgraphs, then combines their results.

Why designed this way?

Subgraphs were designed to solve the problem of scaling GraphQL APIs across multiple teams and services. Instead of one monolithic schema, subgraphs allow decentralized ownership and independent deployment. Federation directives and composition enable these independent schemas to work together seamlessly. Alternatives like stitching schemas lacked strong ownership and had runtime inefficiencies, so federation with subgraphs became the preferred approach.

┌───────────────┐       ┌───────────────┐       ┌───────────────┐
│ Subgraph A    │       │ Subgraph B    │       │ Subgraph C    │
│ (GraphQL svc) │       │ (GraphQL svc) │       │ (GraphQL svc) │
└───────┬───────┘       └───────┬───────┘       └───────┬───────┘
        │                       │                       │
        │ Schema with directives │ Schema with directives │ Schema with directives
        └──────────────┬────────┴───────────────┬────────┘
                       │                        │
               ┌───────▼────────┐       ┌───────▼────────┐
               │ Federation     │       │ Gateway        │
               │ Composition    │──────▶│ (Query Router) │
               └────────────────┘       └────────────────┘
                       │                        │
                       │ Unified Federated Schema│
                       └────────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Do you think a subgraph must define all fields of a type it owns? Commit yes or no.

Common Belief:A subgraph must define every field of a type it owns completely.

Tap to reveal reality

Quick: Do you think subgraph directives like '@key' are optional for federation? Commit yes or no.

Common Belief:Directives like '@key' are optional and only for documentation.

Tap to reveal reality

Quick: Do you think subgraphs can be combined manually without tools? Commit yes or no.

Common Belief:You can manually merge subgraph schemas without special tools.

Tap to reveal reality

Quick: Do you think more subgraphs always mean better modularity and performance? Commit yes or no.

Common Belief:Splitting into many small subgraphs always improves modularity and performance.

Tap to reveal reality

Expert Zone

Subgraphs can share ownership of types by carefully coordinating '@key' fields and extensions, enabling flexible data ownership models.

The choice of which fields to mark as '@external' affects query planning and can optimize data fetching across services.

Schema composition errors often stem from subtle mismatches in key fields or type definitions that are hard to spot without deep schema inspection.

When NOT to use

Subgraph definitions and federation are not ideal for very small APIs or when all data is owned by a single service. In those cases, a monolithic GraphQL schema is simpler and more efficient. Also, if services cannot coordinate schema changes or use federation directives properly, schema stitching or separate APIs might be better alternatives.

Production Patterns

In production, teams define subgraphs per domain or microservice, each with clear ownership. They use CI pipelines to validate subgraph schemas and composition before deployment. The gateway caches federated schemas and query plans for performance. Monitoring tools track query latency across subgraphs to identify bottlenecks. Incremental adoption allows migrating parts of a monolith to subgraphs gradually.

Connections

Microservices Architecture

Subgraphs map directly to microservices owning specific data domains.

Understanding microservices helps grasp why subgraphs isolate schema parts and how independent teams manage them.

Modular Programming

Subgraph definitions embody modular design by separating concerns into distinct schema modules.

Knowing modular programming principles clarifies why splitting schemas improves maintainability and scalability.

Distributed Systems

Subgraphs run as distributed services that must coordinate to serve unified queries.

Recognizing distributed system challenges like latency and consistency helps understand federation's design trade-offs.

Common Pitfalls

#1Defining the same type with conflicting fields in multiple subgraphs.

Wrong approach:type User { id: ID! name: String } // In another subgraph type User { id: ID! email: String name: Int # Conflict: different type }

Correct approach:type User @key(fields: "id") { id: ID! name: String } // In another subgraph extend type User @key(fields: "id") { id: ID! @external email: String }

Root cause:Misunderstanding that types must be defined once and extended elsewhere, not redefined with conflicting fields.

#2Omitting '@key' directive on entity types in subgraphs.

Wrong approach:type Product { sku: String! name: String }

Correct approach:type Product @key(fields: "sku") { sku: String! name: String }

Root cause:Not realizing '@key' is required to identify entities across subgraphs for federation.

#3Marking fields as '@external' without matching '@key' fields.

Wrong approach:extend type User { email: String @external }

Correct approach:extend type User @key(fields: "id") { id: ID! @external email: String @external }

Root cause:Failing to declare the key fields on extended types causes federation to lose track of entity identity.

Key Takeaways

A subgraph definition is a partial GraphQL schema owned by one service, describing its data and how it connects to others.

Subgraphs use special directives like '@key' and '@external' to enable federation and link data across services.

Subgraph schemas are composed automatically by tools to form a unified federated graph, ensuring consistency and correctness.

Designing subgraphs requires balancing modularity with performance to build scalable and maintainable APIs.

Understanding subgraphs connects deeply with concepts in microservices, modular programming, and distributed systems.

Practice

(1/5)

1. What is the main purpose of defining a subgraph in a GraphQL architecture?

easy

A. To split a large graph into smaller, manageable parts

B. To increase the number of queries sent to the server

C. To combine multiple databases into one

D. To replace the need for a schema in GraphQL

Subgraph definition in GraphQL - Deep Dive

Start learning this pattern below

Practice

Solution

Step 1: Understand the concept of subgraphs

Step 2: Identify the main purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall the syntax of the @key directive

Step 2: Match the correct syntax

Final Answer:

Quick Check:

Solution

Step 1: Understand the @key directive role

Step 2: Analyze the query fields

Final Answer:

Quick Check:

Solution

Step 1: Check the original @key syntax

Step 2: Verify field existence

Final Answer:

Quick Check:

Solution

Step 1: Identify the unique key for the Order entity

Step 2: Check the correct @key usage

Final Answer:

Quick Check: