Overview - HTTP(S) Load Balancer (Layer 7)

What is it?

An HTTP(S) Load Balancer is a service that distributes incoming web traffic across multiple servers based on the content of the requests. It works at Layer 7, the application layer, which means it can make smart decisions using details like URLs, headers, and cookies. This helps websites handle many visitors smoothly and stay available even if some servers fail. It also supports secure connections using HTTPS.

Why it matters

Without an HTTP(S) Load Balancer, websites could slow down or crash when too many people visit at once. It solves the problem of sharing work among servers so no single server gets overwhelmed. This keeps websites fast, reliable, and secure, which is important for user experience and business success. Without it, users might face slow pages, errors, or security risks.

Where it fits

Before learning about HTTP(S) Load Balancers, you should understand basic networking concepts like IP addresses, DNS, and how web servers work. After this, you can learn about other load balancing types like TCP/UDP (Layer 4) Load Balancers and advanced traffic management like CDN and auto-scaling.

Mental Model

Core Idea

An HTTP(S) Load Balancer acts like a smart traffic officer that directs web requests to the best server based on the request details to keep websites fast and reliable.

Think of it like...

Imagine a busy restaurant with many chefs. The host (load balancer) looks at each customer's order and sends it to the chef best suited to prepare that dish quickly, ensuring no chef is overwhelmed and customers get their food fast.

┌─────────────────────────────┐
│       Client Requests        │
└─────────────┬───────────────┘
              │
              ▼
┌─────────────────────────────┐
│   HTTP(S) Load Balancer      │
│  (Reads URL, headers, etc.)  │
└─────────────┬───────────────┘
              │
   ┌──────────┴───────────┐
   │                      │
┌───────┐             ┌───────┐
│Server1│             │Server2│
└───────┘             └───────┘

Build-Up - 7 Steps

1

FoundationBasics of Web Traffic and Servers

Concept: Understanding how web requests travel from users to servers.

When you type a website address, your browser sends a request to a server that holds the website. The server processes the request and sends back the webpage. If many people visit at once, one server might get too busy and slow down.

Result

You see how web requests and responses flow, and why a single server can become a bottleneck.

Knowing how web traffic works helps you understand why distributing requests is necessary.

2

FoundationWhat is Load Balancing?

3

IntermediateLayer 7 Load Balancing Explained

4

IntermediateHow GCP HTTP(S) Load Balancer Works

5

IntermediateBackend Services and URL Maps

6

AdvancedHealth Checks and Failover Handling

7

ExpertAdvanced Features: SSL Offloading and CDN Integration

Under the Hood

The HTTP(S) Load Balancer operates as a reverse proxy that terminates client connections, inspects HTTP headers and URLs, and then opens new connections to backend servers. It maintains session affinity if configured and performs health checks by sending periodic requests to backends. It uses global anycast IP addresses to route user requests to the nearest Google edge location, reducing latency. SSL certificates are managed at the load balancer, which handles encryption and decryption, offloading this work from backend servers.

Why designed this way?

This design separates client-facing traffic management from backend server logic, improving scalability and security. Using global anycast and edge locations reduces latency and improves availability worldwide. SSL offloading centralizes certificate management and reduces backend complexity. Alternatives like Layer 4 load balancing are simpler but lack application-level routing, which is essential for modern web apps.

┌───────────────┐       ┌─────────────────────┐       ┌───────────────┐
│   Client      │──────▶│ Google Edge Location │──────▶│ HTTP(S) Load  │
│ (Browser)     │       │ (Anycast IP)         │       │ Balancer      │
└───────────────┘       └─────────┬───────────┘       └──────┬────────┘
                                      │                        │
                                      ▼                        ▼
                            ┌─────────────────┐       ┌───────────────┐
                            │ Health Checks   │       │ Backend       │
                            │ (Monitor Server │◀──────│ Servers       │
                            │  Status)        │       │ (Web Apps)    │
                            └─────────────────┘       └───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does an HTTP(S) Load Balancer only distribute traffic based on IP addresses? Commit to yes or no.

Common Belief:Load balancers only look at IP addresses and ports to distribute traffic.

Tap to reveal reality

Quick: Do you think SSL encryption always happens on backend servers? Commit to yes or no.

Common Belief:SSL/TLS encryption must be handled by each backend server individually.

Tap to reveal reality

Quick: Does a load balancer send traffic to unhealthy servers until they recover? Commit to yes or no.

Common Belief:Load balancers do not check server health and keep sending traffic regardless.

Tap to reveal reality

Quick: Can GCP HTTP(S) Load Balancer only balance traffic within one region? Commit to yes or no.

Common Belief:Load balancing is limited to servers in the same geographic region.

Tap to reveal reality

Expert Zone

1

Session affinity can be configured using cookies or client IP, but improper use can reduce load distribution efficiency.

2

Backend services can have different capacity weights, allowing fine-tuned traffic distribution based on server power or cost.

3

URL maps support complex routing rules including rewrites and redirects, enabling sophisticated traffic control without backend changes.

When NOT to use

HTTP(S) Load Balancer is not suitable for non-HTTP protocols or very low-latency internal traffic. For TCP/UDP or internal-only traffic, use Network Load Balancer or Internal Load Balancer instead.

Production Patterns

In production, HTTP(S) Load Balancers are combined with auto-scaling backend groups, CDN integration for static content, and managed SSL certificates for security. They are also used with monitoring and logging to detect and respond to traffic anomalies.

Connections

Content Delivery Network (CDN)

Builds-on

Understanding HTTP(S) Load Balancing helps grasp how CDNs cache and serve content closer to users, improving speed and reducing load on origin servers.

Reverse Proxy

Same pattern

HTTP(S) Load Balancers act as reverse proxies, a concept used in web servers and security appliances to control and filter incoming traffic.

Traffic Control in Road Networks

Analogy in a different field

Studying how traffic lights and signs manage vehicle flow helps understand how load balancers direct data traffic to avoid congestion and improve flow.

Common Pitfalls

#1Sending all traffic to one backend server defeats load balancing.

Wrong approach:urlMap { defaultService = backendService1 } // Only one backend service defined, no traffic split

Correct approach:urlMap { defaultService = backendService1 pathMatchers { paths = ["/api/*"] service = backendService2 } } // Traffic split based on URL paths

Root cause:Misunderstanding URL maps and backend service configuration leads to ineffective load balancing.

#2Not enabling health checks causes traffic to be sent to down servers.

Wrong approach:backendService { name = "my-backend" // No health check configured }

Correct approach:backendService { name = "my-backend" healthChecks = ["my-health-check"] } healthCheck { name = "my-health-check" checkIntervalSec = 5 timeoutSec = 5 healthyThreshold = 2 unhealthyThreshold = 2 httpHealthCheck { port = 80 requestPath = "/health" } }

Root cause:Ignoring health checks causes the load balancer to send requests to unhealthy backends, causing errors.

#3Configuring SSL certificates only on backend servers increases complexity.

Wrong approach:Each backend server has its own SSL certificate and handles HTTPS termination.

Correct approach:Configure SSL certificates on the HTTP(S) Load Balancer to perform SSL offloading centrally.

Root cause:Not leveraging SSL offloading leads to duplicated effort and harder certificate management.

Key Takeaways

HTTP(S) Load Balancers distribute web traffic based on request content, improving speed and reliability.

They operate at Layer 7, allowing smart routing using URLs, headers, and cookies.

GCP's HTTP(S) Load Balancer supports global traffic distribution, health checks, and SSL offloading.

Proper configuration of backend services, URL maps, and health checks is essential for effective load balancing.

Advanced features like CDN integration and session affinity optimize performance and user experience.