Practice

(1/5)

1. What is the main purpose of compute resource management in MLOps?

easy

A. To write machine learning model code

B. To store data permanently on disk

C. To create user interfaces for ML applications

D. To control CPU, memory, and GPU usage for efficient job execution

Solution

Step 1: Understand resource management role
Compute resource management controls hardware resources like CPU, memory, and GPU.
Step 2: Identify its purpose in MLOps
It ensures jobs run efficiently and avoid crashes by managing these resources.
Final Answer:
To control CPU, memory, and GPU usage for efficient job execution -> Option D
Quick Check:
Resource management = control CPU, memory, GPU [OK]

Hint: Think about what hardware resources need managing [OK]

Common Mistakes:

Confusing resource management with coding tasks
Thinking it manages data storage only
Assuming it builds user interfaces

2. Which command correctly allocates GPU resources for a job in Kubernetes?

easy

A. kubectl run job --gpu=2

B. kubectl run job --requests=nvidia.com/gpu=2

C. kubectl run job --memory=2Gi

D. kubectl run job --cpu=2

Solution

Step 1: Recall Kubernetes resource request syntax
Kubernetes uses resource requests like --requests=nvidia.com/gpu=2 to allocate GPUs.
Step 2: Match correct GPU allocation command
kubectl run job --requests=nvidia.com/gpu=2 uses the correct syntax for GPU requests in Kubernetes.
Final Answer:
kubectl run job --requests=nvidia.com/gpu=2 -> Option B
Quick Check:
GPU allocation uses --requests=nvidia.com/gpu [OK]

Hint: Look for --requests with nvidia.com/gpu key [OK]

Common Mistakes:

Using --gpu directly (not valid syntax)
Confusing memory or CPU flags with GPU
Missing the resource request keyword

3. Given this Kubernetes pod spec snippet, what is the CPU limit set for the container?

resources:
  limits:
    cpu: "4"
  requests:
    cpu: "2"

medium

A. 4 CPUs

B. 6 CPUs

C. No CPU limit set

D. 2 CPUs

Solution

Step 1: Identify CPU limit in pod spec
The 'limits' section sets the maximum CPU usage, here cpu: "4" means 4 CPUs.
Step 2: Understand difference between requests and limits
Requests are minimum guaranteed (2 CPUs), limits are max allowed (4 CPUs).
Final Answer:
4 CPUs -> Option A
Quick Check:
CPU limit = 4 CPUs [OK]

Hint: Limits set max CPU, requests set minimum [OK]

Common Mistakes:

Confusing requests with limits
Ignoring quotes around CPU values
Assuming no limit means unlimited

4. You see this error when submitting a job: Insufficient cpu resources. What is the most likely cause?

medium

A. The job is missing GPU allocation

B. The job has no CPU requests set

C. The job requests more CPU than available on the cluster

D. The job memory limit is too high

Solution

Step 1: Interpret the error message
'Insufficient cpu resources' means requested CPU exceeds cluster capacity.
Step 2: Identify cause from options
The job requests more CPU than available on the cluster matches the error cause: job requests more CPU than available.
Final Answer:
The job requests more CPU than available on the cluster -> Option C
Quick Check:
Insufficient CPU = request > available [OK]

Hint: Error means requested CPU > cluster CPU [OK]

Common Mistakes:

Assuming missing CPU requests cause this error
Confusing CPU and GPU errors
Blaming memory limits for CPU shortage

5. You want to run multiple ML training jobs on a GPU cluster. Which strategy best manages GPU resources to avoid conflicts?

hard

A. Allocate GPUs explicitly per job and release after completion

B. Run all jobs without GPU limits and share GPUs freely

C. Assign CPU limits only and ignore GPU allocation

D. Use only CPU resources to avoid GPU conflicts

Solution

Step 1: Understand GPU resource management needs
Explicit allocation prevents multiple jobs from using the same GPU simultaneously.
Step 2: Evaluate options for best practice
Allocate GPUs explicitly per job and release after completion correctly allocates and releases GPUs per job to avoid conflicts.
Final Answer:
Allocate GPUs explicitly per job and release after completion -> Option A
Quick Check:
Explicit GPU allocation avoids conflicts [OK]

Hint: Always allocate and release GPUs per job [OK]

Common Mistakes:

Ignoring GPU allocation causing conflicts
Assuming CPU limits control GPU usage
Avoiding GPUs when cluster has them

Input Size (n)	Approx. Operations
10	10 allocations and releases
100	100 allocations and releases
1000	1000 allocations and releases

Compute resource management in MLOps - Time & Space Complexity

Start learning this pattern below

Practice

Solution

Step 1: Understand resource management role

Step 2: Identify its purpose in MLOps

Final Answer:

Quick Check:

Solution

Step 1: Recall Kubernetes resource request syntax

Step 2: Match correct GPU allocation command

Final Answer:

Quick Check:

Solution

Step 1: Identify CPU limit in pod spec

Step 2: Understand difference between requests and limits

Final Answer:

Quick Check:

Solution

Step 1: Interpret the error message

Step 2: Identify cause from options

Final Answer:

Quick Check:

Solution

Step 1: Understand GPU resource management needs

Step 2: Evaluate options for best practice

Final Answer:

Quick Check: