Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to select the device for inference based on availability.

MLOps

device = 'cuda' if torch.cuda.is_available() else [1]

Drag options to blanks, or click blank then click option'

A'cpu'

B'tpu'

C'gpu'

D'fpga'

Attempts:

3 left

2fill in blank

medium

Complete the code to set batch size for CPU inference to avoid overload.

MLOps

batch_size = [1] if device == 'cpu' else 64

Drag options to blanks, or click blank then click option'

A16

B512

C256

D128

Attempts:

3 left

3fill in blank

hard

Fix the error in the code that measures inference time on CPU.

MLOps

start = time.time()
output = model(input.to([1]))
end = time.time()

Drag options to blanks, or click blank then click option'

A'gpu'

B'cpu'

C'cuda'

D'tpu'

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary showing inference speed tradeoffs.

MLOps

inference_speed = {'CPU': [1], 'GPU': [2]  # in milliseconds

Drag options to blanks, or click blank then click option'

A100

B10

C50

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to filter models suitable for CPU inference with low memory.

MLOps

suitable_models = {m: mem for m, mem in models.items() if mem [1] 4 and 'light' [2] m and mem [3] 1}

Drag options to blanks, or click blank then click option'

A<=

Bin

C>=

D==

Attempts:

3 left

Practice

(1/5)

1. Which of the following is a main advantage of using a GPU over a CPU for machine learning inference?

easy

A. Lower power consumption for small tasks

B. Cheaper hardware cost

C. Better performance on single-threaded tasks

D. Faster processing for large batches of data

GPU vs CPU inference tradeoffs in MLOps - Interactive Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand GPU design for parallelism

Step 2: Compare CPU and GPU strengths

Final Answer:

Quick Check:

Solution

Step 1: Understand CUDA_VISIBLE_DEVICES usage

Step 2: Check each option's effect

Final Answer:

Quick Check:

Solution

Step 1: Understand timing code output

Step 2: Match CPU inference time to output

Final Answer:

Quick Check:

Solution

Step 1: Identify GPU performance factors

Step 2: Evaluate options for improving GPU speed

Final Answer:

Quick Check:

Solution

Step 1: Analyze model size and input volume impact

Step 2: Consider budget and batch size tradeoffs

Final Answer:

Quick Check: