What is GAN for image generation in Computer Vision?

Computer Visionml~5 mins

GAN for image generation in Computer Vision

Choose your learning style9 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Introduction

GANs help computers create new images that look real by learning from many example pictures.

Creating new artwork or photos that don't exist yet.

Improving blurry or low-quality images.

Making realistic faces for games or movies.

Generating images for training other AI models when data is limited.

Syntax

Computer Vision

class Generator(nn.Module):
    def __init__(self):
        super().__init__()
        # layers to create images from noise

class Discriminator(nn.Module):
    def __init__(self):
        super().__init__()
        # layers to decide if image is real or fake

# Training loop:
for epoch in range(num_epochs):
    # Train discriminator on real and fake images
    # Train generator to fool discriminator

Generator creates fake images from random noise.

Discriminator learns to tell real images from fake ones.

Examples

Generate fake images by giving random noise to the generator.

Computer Vision

noise = torch.randn(batch_size, noise_dim)
fake_images = generator(noise)

Discriminator predicts if images are real or fake.

Computer Vision

real_preds = discriminator(real_images)
fake_preds = discriminator(fake_images.detach())

Generator tries to make discriminator think fake images are real.

Computer Vision

loss_gen = loss_function(discriminator(fake_images), real_labels)

Sample Model

This code creates a simple GAN with a generator and discriminator. It trains both for one step using random data to show how losses change.

Computer Vision

import torch
import torch.nn as nn
import torch.optim as optim

# Simple Generator
class Generator(nn.Module):
    def __init__(self):
        super().__init__()
        self.model = nn.Sequential(
            nn.Linear(100, 256),
            nn.ReLU(True),
            nn.Linear(256, 784),
            nn.Tanh()
        )
    def forward(self, x):
        return self.model(x)

# Simple Discriminator
class Discriminator(nn.Module):
    def __init__(self):
        super().__init__()
        self.model = nn.Sequential(
            nn.Linear(784, 256),
            nn.LeakyReLU(0.2, inplace=True),
            nn.Linear(256, 1),
            nn.Sigmoid()
        )
    def forward(self, x):
        return self.model(x)

# Initialize models
G = Generator()
D = Discriminator()

# Loss and optimizers
criterion = nn.BCELoss()
optimizer_G = optim.Adam(G.parameters(), lr=0.001)
optimizer_D = optim.Adam(D.parameters(), lr=0.001)

# Training data: random real images (for demo)
real_images = torch.randn(16, 784)
real_labels = torch.ones(16, 1)
fake_labels = torch.zeros(16, 1)

# Training loop for 1 epoch
for epoch in range(1):
    # Train Discriminator
    optimizer_D.zero_grad()
    outputs_real = D(real_images)
    loss_real = criterion(outputs_real, real_labels)

    noise = torch.randn(16, 100)
    fake_images = G(noise)
    outputs_fake = D(fake_images.detach())
    loss_fake = criterion(outputs_fake, fake_labels)

    loss_D = loss_real + loss_fake
    loss_D.backward()
    optimizer_D.step()

    # Train Generator
    optimizer_G.zero_grad()
    outputs = D(fake_images)
    loss_G = criterion(outputs, real_labels)
    loss_G.backward()
    optimizer_G.step()

    print(f"Epoch {epoch+1} | Loss D: {loss_D.item():.4f} | Loss G: {loss_G.item():.4f}")

OutputSuccess

Important Notes

GAN training can be tricky; losses may not always decrease smoothly.

Use real datasets like MNIST or CIFAR for better results.

Normalize images between -1 and 1 when using Tanh activation.

Summary

GANs have two parts: a generator that makes images and a discriminator that checks them.

They learn together by competing, improving image quality over time.

You can use GANs to create new, realistic images from random noise.