Practice

(1/5)

1. Why is it important to plan GPU infrastructure before starting a GenAI project?

easy

A. To reduce the size of the AI model automatically

B. To ensure the GPU has enough memory and speed for the AI model

C. Because GPUs are always cheaper than CPUs

D. To avoid using any GPUs and rely only on CPUs

Solution

Step 1: Understand GPU role in AI projects
GPUs speed up AI model training and need enough memory to handle data.
Step 2: Importance of matching GPU specs to model needs
Choosing a GPU with insufficient memory or speed will slow down or fail the project.
Final Answer:
To ensure the GPU has enough memory and speed for the AI model -> Option B
Quick Check:
GPU specs must match AI needs = D [OK]

Hint: Match GPU memory and speed to your AI model size [OK]

Common Mistakes:

Thinking CPUs can replace GPUs for heavy AI tasks
Assuming all GPUs have the same performance
Ignoring GPU memory limits

2. Which of the following is the correct way to check GPU memory using Python's PyTorch library?

easy

A. torch.cuda.memory_size()

B. torch.gpu.memory.total()

C. torch.cuda.get_device_properties(0).total_memory

D. torch.device.memory()

Solution

Step 1: Recall PyTorch GPU memory query syntax
The correct method is torch.cuda.get_device_properties(device_id).total_memory.
Step 2: Check each option for correctness
Only torch.cuda.get_device_properties(0).total_memory uses the correct PyTorch function and attribute.
Final Answer:
torch.cuda.get_device_properties(0).total_memory -> Option C
Quick Check:
Correct PyTorch GPU memory call = C [OK]

Hint: Use torch.cuda.get_device_properties(0).total_memory to check GPU memory [OK]

Common Mistakes:

Using non-existent PyTorch functions
Confusing device and memory functions
Missing the device index argument

3. Given this Python code snippet using PyTorch, what will be printed?

import torch
if torch.cuda.is_available():
    mem = torch.cuda.get_device_properties(0).total_memory
    print(mem > 8_000_000_000)
else:
    print(False)

medium

A. True if GPU memory is more than 8GB, else False

B. Always True

C. Always False

D. Raises an error if no GPU

Solution

Step 1: Understand the code logic
The code checks if a GPU is available, then compares its memory to 8GB (8 billion bytes).
Step 2: Determine output based on GPU memory
If GPU memory is greater than 8GB, it prints True; otherwise, False. If no GPU, prints False.
Final Answer:
True if GPU memory is more than 8GB, else False -> Option A
Quick Check:
GPU memory check > 8GB = A [OK]

Hint: Check GPU memory size condition to predict output [OK]

Common Mistakes:

Assuming always True regardless of GPU
Expecting error if no GPU instead of False
Confusing bytes with gigabytes

4. Identify the error in this GPU memory check code and select the fix:

import torch
if torch.cuda.is_available():
    mem = torch.cuda.get_device_properties().total_memory
    print(mem)
else:
    print('No GPU')

medium

A. Add device index 0 in get_device_properties: get_device_properties(0)

B. Replace torch.cuda.is_available() with torch.has_cuda()

C. Use torch.cuda.memory_allocated() instead of get_device_properties()

D. No error, code is correct

Solution

Step 1: Check get_device_properties usage
The function requires a device index argument, e.g., 0 for the first GPU.
Step 2: Identify the fix
Adding (0) fixes the error. Other options are incorrect or unnecessary.
Final Answer:
Add device index 0 in get_device_properties: get_device_properties(0) -> Option A
Quick Check:
Missing device index causes error = B [OK]

Hint: Always provide device index to get_device_properties() [OK]

Common Mistakes:

Omitting device index argument
Using non-existent torch.has_cuda()
Confusing memory functions

5. You plan to train a large GenAI model requiring 24GB GPU memory. Your local GPUs have 16GB each. Which is the best GPU infrastructure planning choice?

hard

A. Ignore memory limits and expect training to succeed

B. Reduce the model size to fit 16GB GPU and train locally

C. Train on CPU only to avoid GPU memory limits

D. Use multiple GPUs with model parallelism or switch to cloud GPUs with 24GB+ memory

Solution

Step 1: Analyze GPU memory requirement vs available hardware
The model needs 24GB, but local GPUs have only 16GB, so one GPU is insufficient.
Step 2: Consider solutions for insufficient GPU memory
Using multiple GPUs with model parallelism or cloud GPUs with enough memory solves the problem effectively.
Final Answer:
Use multiple GPUs with model parallelism or switch to cloud GPUs with 24GB+ memory -> Option D
Quick Check:
Match GPU memory to model needs with parallelism or cloud = A [OK]

Hint: Use multi-GPU or cloud GPUs for models needing more memory [OK]

Common Mistakes:

Trying to train large models on insufficient GPU memory
Ignoring cloud GPU options
Assuming CPU can replace GPU for large models

Epoch	Loss ↓	Accuracy ↑	Observation
1	0.85	0.60	Initial training with high loss and moderate accuracy
2	0.65	0.72	Loss decreased, accuracy improved
3	0.50	0.80	Model learning well, loss dropping steadily
4	0.40	0.85	Good convergence, accuracy increasing
5	0.35	0.88	Training stabilizing with low loss and high accuracy

GPU infrastructure planning in Prompt Engineering / GenAI - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand GPU role in AI projects

Step 2: Importance of matching GPU specs to model needs

Final Answer:

Quick Check:

Solution

Step 1: Recall PyTorch GPU memory query syntax

Step 2: Check each option for correctness

Final Answer:

Quick Check:

Solution

Step 1: Understand the code logic

Step 2: Determine output based on GPU memory

Final Answer:

Quick Check:

Solution

Step 1: Check get_device_properties usage

Step 2: Identify the fix

Final Answer:

Quick Check:

Solution

Step 1: Analyze GPU memory requirement vs available hardware

Step 2: Consider solutions for insufficient GPU memory

Final Answer:

Quick Check: