Practice

(1/5)

1. What is the main idea behind Gradient Boosting for regression?

easy

A. Combining many simple models step-by-step to improve predictions

B. Using a single complex model to predict values

C. Randomly guessing values and selecting the best guess

D. Using only one decision tree without updates

Solution

Step 1: Understand Gradient Boosting concept
Gradient Boosting builds a strong model by adding simple models one after another, each fixing errors of the previous.
Step 2: Compare options with this idea
Only Combining many simple models step-by-step to improve predictions describes combining many simple models step-by-step to improve predictions.
Final Answer:
Combining many simple models step-by-step to improve predictions -> Option A
Quick Check:
Gradient Boosting = Combining simple models [OK]

Hint: Remember: Gradient Boosting adds models one by one [OK]

Common Mistakes:

Thinking it uses only one model
Confusing with random guessing
Assuming it uses a single complex model

2. Which of the following is the correct way to create a Gradient Boosting Regressor in Python using scikit-learn?

easy

import GradientBoostingRegressor
model = GradientBoostingRegressor()

from sklearn.linear_model import GradientBoostingRegressor
model = GradientBoostingRegressor(learning_rate=0.1)

from sklearn.ensemble import GradientBoostingClassifier
model = GradientBoostingClassifier(n_estimators=100)

from sklearn.ensemble import GradientBoostingRegressor
model = GradientBoostingRegressor(n_estimators=100, learning_rate=0.1)

Solution

Step 1: Identify correct import and class for regression
GradientBoostingRegressor is in sklearn.ensemble, not sklearn.linear_model or a classifier.
Step 2: Check syntax correctness
from sklearn.ensemble import GradientBoostingRegressor model = GradientBoostingRegressor(n_estimators=100, learning_rate=0.1) correctly imports and creates the model with parameters n_estimators and learning_rate.
Final Answer:
Correct import and model creation with sklearn.ensemble.GradientBoostingRegressor -> Option D
Quick Check:
Correct import and class = from sklearn.ensemble import GradientBoostingRegressor model = GradientBoostingRegressor(n_estimators=100, learning_rate=0.1) [OK]

Hint: Use sklearn.ensemble for GradientBoostingRegressor [OK]

Common Mistakes:

Importing from wrong module
Using classifier instead of regressor
Missing parameters or wrong syntax

3. What will be the output of the following code snippet?

from sklearn.ensemble import GradientBoostingRegressor
import numpy as np

X = np.array([[1], [2], [3], [4], [5]])
y = np.array([1.5, 3.5, 5.5, 7.5, 9.5])
model = GradientBoostingRegressor(n_estimators=10, learning_rate=0.5)
model.fit(X, y)
pred = model.predict(np.array([[6]]))
print(round(pred[0], 1))

medium

A. 10.0

B. 11.5

C. 12.0

D. 9.5

Solution

Step 1: Understand training data pattern
y roughly equals 2*x - 0.5 (1.5, 3.5, 5.5, 7.5, 9.5). So for x=6, expected y ~ 11.5.
Step 2: Predict with Gradient Boosting model
Model with 10 estimators and learning rate 0.5 fits this pattern well, predicting close to 11.5 for input 6.
Final Answer:
11.5 -> Option B
Quick Check:
Prediction for 6 ≈ 11.5 [OK]

Hint: Check pattern in y to guess prediction quickly [OK]

Common Mistakes:

Ignoring the linear pattern in data
Confusing classifier with regressor output
Rounding errors or wrong rounding

4. Identify the error in this Gradient Boosting regression code and fix it:

from sklearn.ensemble import GradientBoostingRegressor
X = [[1], [2], [3]]
y = [2, 4, 6]
model = GradientBoostingRegressor(n_estimators=50)
model.fit(X, y)
print(model.predict([4]))

medium

A. Import GradientBoostingClassifier instead

B. Change n_estimators to 1

C. Change predict input to [[4]] instead of [4]

D. Change y to a numpy array

Solution

Step 1: Check input shape for predict method
Model expects 2D array for predict, but [4] is 1D. It should be [[4]] to match training input shape.
Step 2: Fix predict input shape
Changing predict input to [[4]] fixes the error and allows prediction.
Final Answer:
Change predict input to [[4]] instead of [4] -> Option C
Quick Check:
Predict input shape must match training input [OK]

Hint: Always use 2D array for predict input in scikit-learn [OK]

Common Mistakes:

Passing 1D array to predict
Changing unrelated parameters
Using classifier instead of regressor

5. You want to improve your Gradient Boosting regression model's accuracy on a dataset but notice it overfits. Which combination of parameter changes is best to reduce overfitting?

hard

A. Decrease n_estimators and decrease learning_rate

B. Decrease n_estimators and increase learning_rate

C. Increase n_estimators and decrease learning_rate

D. Increase n_estimators and increase learning_rate

Solution

Step 1: Understand overfitting in Gradient Boosting
Overfitting means model fits training data too closely, losing generalization.
Step 2: Adjust parameters to reduce overfitting
Decreasing n_estimators reduces model complexity; decreasing learning_rate slows learning, both help reduce overfitting.
Final Answer:
Decrease n_estimators and decrease learning_rate -> Option A
Quick Check:
Lower complexity and slower learning reduce overfitting [OK]

Hint: Lower n_estimators and learning_rate to fight overfitting [OK]

Common Mistakes:

Increasing both parameters causing more overfitting
Increasing learning_rate alone
Ignoring parameter effects on overfitting

Why Gradient Boosting for regression in ML Python? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand Gradient Boosting concept

Step 2: Compare options with this idea

Final Answer:

Quick Check:

Solution

Step 1: Identify correct import and class for regression

Step 2: Check syntax correctness

Final Answer:

Quick Check:

Solution

Step 1: Understand training data pattern

Step 2: Predict with Gradient Boosting model

Final Answer:

Quick Check:

Solution

Step 1: Check input shape for predict method

Step 2: Fix predict input shape

Final Answer:

Quick Check:

Solution

Step 1: Understand overfitting in Gradient Boosting

Step 2: Adjust parameters to reduce overfitting

Final Answer:

Quick Check: