Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to define the minimum number of instances for auto-scaling.

MLOps

auto_scaling_config = {
    "MinInstances": [1],
    "MaxInstances": 10
}

Drag options to blanks, or click blank then click option'

B10

Attempts:

3 left

2fill in blank

medium

Complete the code to set the target CPU utilization percentage for scaling.

MLOps

auto_scaling_config = {
    "TargetCPUUtilization": [1]
}

Drag options to blanks, or click blank then click option'

A50

B10

C90

D110

Attempts:

3 left

3fill in blank

hard

Fix the error in the auto-scaling policy by completing the missing field.

MLOps

auto_scaling_policy = {
    "PolicyName": "ScaleOutPolicy",
    "AdjustmentType": [1],
    "ScalingAdjustment": 2
}

Drag options to blanks, or click blank then click option'

APercentChangeInCapacity

BChangeInCapacity

CExactCapacity

DInvalidType

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a scaling rule that triggers when CPU usage is above 70%.

MLOps

scaling_rule = {
    "MetricName": [1],
    "Threshold": [2]
}

Drag options to blanks, or click blank then click option'

A"CPUUtilization"

B"MemoryUsage"

C70

D30

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to define a complete auto-scaling configuration with min, max, and target CPU utilization.

MLOps

auto_scaling_config = {
    "MinInstances": [1],
    "MaxInstances": [2],
    "TargetCPUUtilization": [3]
}

Drag options to blanks, or click blank then click option'

B50

C10

Attempts:

3 left

Practice

(1/5)

1. What is the main purpose of auto-scaling inference endpoints in ML services?

easy

A. To automatically adjust the number of servers based on traffic

B. To manually add servers when traffic increases

C. To reduce the accuracy of ML models during high traffic

D. To store more data for training models

Auto-scaling inference endpoints in MLOps - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand auto-scaling concept

Step 2: Identify the purpose in ML inference

Final Answer:

Quick Check:

Solution

Step 1: Identify minimum server setting

Step 2: Differentiate from other settings

Final Answer:

Quick Check:

Solution

Step 1: Compare current usage to target utilization

Step 2: Determine scaling action

Final Answer:

Quick Check:

Solution

Step 1: Analyze scaling limits

Step 2: Check target utilization impact

Final Answer:

Quick Check:

Solution

Step 1: Set minimum and maximum servers correctly

Step 2: Set target utilization to 60%

Step 3: Verify options

Final Answer:

Quick Check: