Practice

(1/5)

1. What is the main idea behind boosting in machine learning?

easy

A. Randomly selecting features for training

B. Using a single complex model to fit data

C. Reducing the size of the dataset

D. Combining many weak models to create a strong model

Solution

Step 1: Understand boosting concept
Boosting builds a strong model by combining many simple (weak) models.
Step 2: Compare options with definition
Only Combining many weak models to create a strong model correctly describes this idea; others describe different techniques.
Final Answer:
Combining many weak models to create a strong model -> Option D
Quick Check:
Boosting = Combining weak models [OK]

Hint: Boosting = many weak models combined [OK]

Common Mistakes:

Thinking boosting uses one complex model
Confusing boosting with feature selection
Believing boosting reduces dataset size

2. Which of the following is the correct syntax to create an AdaBoost classifier in Python using scikit-learn?

easy

A. from sklearn.ensemble import AdaBoostClassifier model = AdaBoostClassifier()

B. from sklearn.ensemble import AdaBoost model = AdaBoost()

C. from sklearn.boost import AdaBoostClassifier model = AdaBoostClassifier()

D. import AdaBoost from sklearn.ensemble model = AdaBoost()

Solution

Step 1: Recall correct import path
In scikit-learn, AdaBoostClassifier is in sklearn.ensemble module.
Step 2: Check syntax correctness
from sklearn.ensemble import AdaBoostClassifier model = AdaBoostClassifier() uses correct import and class name; others have wrong module or syntax.
Final Answer:
from sklearn.ensemble import AdaBoostClassifier\nmodel = AdaBoostClassifier() -> Option A
Quick Check:
Correct import and class name = from sklearn.ensemble import AdaBoostClassifier model = AdaBoostClassifier() [OK]

Hint: AdaBoostClassifier is in sklearn.ensemble [OK]

Common Mistakes:

Using wrong module like sklearn.boost
Incorrect import syntax
Wrong class name without 'Classifier'

3. Consider this Python code using AdaBoost:

from sklearn.datasets import load_iris
from sklearn.ensemble import AdaBoostClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

iris = load_iris()
X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target, random_state=42)
model = AdaBoostClassifier(n_estimators=10, random_state=42)
model.fit(X_train, y_train)
preds = model.predict(X_test)
print(round(accuracy_score(y_test, preds), 2))

What is the printed output?

medium

A. 0.85

B. 0.75

C. 0.97

D. 0.60

Solution

Step 1: Understand the dataset and model
Iris dataset is simple; AdaBoost with 10 estimators usually achieves accuracy around 0.85 on this split.
Step 2: Check typical AdaBoost accuracy on iris
Common results show accuracy near 85% on test split with random_state=42 and 10 estimators.
Final Answer:
0.85 -> Option A
Quick Check:
Typical AdaBoost iris accuracy = 0.85 [OK]

Hint: AdaBoost on iris usually scores ~0.85 accuracy [OK]

Common Mistakes:

Assuming low accuracy for simple dataset
Confusing accuracy with training score
Ignoring random_state effect

4. The following code tries to train an AdaBoost model but raises an error:

from sklearn.ensemble import AdaBoostClassifier
model = AdaBoostClassifier(n_estimators='ten')
model.fit(X_train, y_train)

What is the cause of the error?

medium

A. Model cannot be trained without specifying 'learning_rate'

B. Missing import for 'X_train' and 'y_train'

C. 'n_estimators' must be an integer, not a string

D. AdaBoostClassifier does not have 'n_estimators' parameter

Solution

Step 1: Check parameter types
n_estimators expects an integer number of weak learners, not a string.
Step 2: Identify error cause
Passing 'ten' as string causes a type error; other options are incorrect because imports or learning_rate are not mandatory.
Final Answer:
'n_estimators' must be an integer, not a string -> Option C
Quick Check:
n_estimators type error = 'n_estimators' must be an integer, not a string [OK]

Hint: n_estimators must be int, not string [OK]

Common Mistakes:

Thinking learning_rate is required
Ignoring parameter type requirements
Assuming missing imports cause this error

5. You want to improve a weak decision tree model using boosting. Which approach best fits this goal?

hard

A. Increase the depth of a single decision tree

B. Use Gradient Boosting to sequentially correct errors of weak trees

C. Use random forests to average many deep trees

D. Apply PCA to reduce features before training the tree

Solution

Step 1: Understand boosting application
Boosting improves weak models by sequentially correcting their errors.
Step 2: Match approach to boosting
Gradient Boosting fits this by building trees one after another to fix mistakes.
Final Answer:
Use Gradient Boosting to sequentially correct errors of weak trees -> Option B
Quick Check:
Boosting = sequential error correction [OK]

Hint: Boosting fixes errors step-by-step [OK]

Common Mistakes:

Confusing boosting with random forests
Trying to fix with one big tree
Using PCA unrelated to boosting

Boosting concept in ML Python - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand boosting concept

Step 2: Compare options with definition

Final Answer:

Quick Check:

Solution

Step 1: Recall correct import path

Step 2: Check syntax correctness

Final Answer:

Quick Check:

Solution

Step 1: Understand the dataset and model

Step 2: Check typical AdaBoost accuracy on iris

Final Answer:

Quick Check:

Solution

Step 1: Check parameter types

Step 2: Identify error cause

Final Answer:

Quick Check:

Solution

Step 1: Understand boosting application

Step 2: Match approach to boosting

Final Answer:

Quick Check: