Practice

(1/5)

What is the main purpose of using polynomial regression instead of simple linear regression?

easy

A. To fit curved relationships between variables

B. To reduce the number of features

C. To speed up training time

D. To handle missing data automatically

Solution

Step 1: Understand linear regression limitation
Linear regression fits straight lines, which cannot capture curves in data.
Step 2: Role of polynomial regression
Polynomial regression fits curved lines by adding powers of features, capturing non-linear patterns.
Final Answer:
To fit curved relationships between variables -> Option A
Quick Check:
Polynomial regression = curved fit [OK]

Hint: Polynomial regression fits curves, not just straight lines [OK]

Common Mistakes:

Thinking polynomial regression reduces features
Assuming it speeds up training
Believing it handles missing data automatically

Which of the following is the correct way to create a polynomial regression pipeline in Python using sklearn?

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression

pipeline = Pipeline([
    ('poly', PolynomialFeatures(degree=2)),
    ('linear', LinearRegression())
])

easy

A. pipeline = Pipeline([('poly', PolynomialFeatures(degree=2)), ('linear', LinearRegression())])

B. pipeline = Pipeline([('linear', LinearRegression()), ('poly', PolynomialFeatures(degree=2))])

C. pipeline = Pipeline([('poly', LinearRegression()), ('linear', PolynomialFeatures(degree=2))])

D. pipeline = Pipeline([('poly', PolynomialFeatures()), ('linear', LinearRegression(degree=2))])

Solution

Step 1: Order of pipeline steps
PolynomialFeatures must come before LinearRegression to transform data first.
Step 2: Correct usage of classes and parameters
PolynomialFeatures takes degree parameter; LinearRegression does not take degree.
Final Answer:
pipeline = Pipeline([('poly', PolynomialFeatures(degree=2)), ('linear', LinearRegression())]) -> Option A
Quick Check:
PolynomialFeatures before LinearRegression [OK]

Hint: Put PolynomialFeatures before LinearRegression in pipeline [OK]

Common Mistakes:

Swapping order of pipeline steps
Passing degree to LinearRegression
Omitting degree in PolynomialFeatures

Given the following code, what will print(y_pred) output?

import numpy as np
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression

X = np.array([[1], [2], [3]])
y = np.array([1, 4, 9])

pipeline = Pipeline([
    ('poly', PolynomialFeatures(degree=2)),
    ('linear', LinearRegression())
])
pipeline.fit(X, y)
y_pred = pipeline.predict(np.array([[4]]))
print(np.round(y_pred, 2))

medium

A. [10.0]

B. [8.0]

C. [4.0]

D. [16.0]

Solution

Step 1: Understand data and model
X = [[1],[2],[3]] with y = [1,4,9] fits y = x^2 perfectly.
Step 2: Predict for X=4 using polynomial degree 2
Model learns y = x^2, so prediction at 4 is 4^2 = 16.
Final Answer:
[16.0] -> Option D
Quick Check:
4 squared = 16 [OK]

Hint: Polynomial degree 2 fits squares; predict 4^2 = 16 [OK]

Common Mistakes:

Ignoring polynomial transformation
Predicting linear value instead of squared
Rounding errors without np.round

Identify the error in this polynomial regression pipeline code:

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression

pipeline = Pipeline([
    ('linear', LinearRegression()),
    ('poly', PolynomialFeatures(degree=3))
])

pipeline.fit(X_train, y_train)

medium

A. LinearRegression should not be used in pipeline

B. The order of pipeline steps is incorrect

C. PolynomialFeatures degree must be 2, not 3

D. Missing import for X_train and y_train

Solution

Step 1: Check pipeline step order
PolynomialFeatures must come before LinearRegression to transform data first.
Step 2: Confirm degree and imports
Degree 3 is valid; imports for data are assumed outside snippet.
Final Answer:
The order of pipeline steps is incorrect -> Option B
Quick Check:
PolynomialFeatures before LinearRegression [OK]

Hint: PolynomialFeatures must be first in pipeline [OK]

Common Mistakes:

Swapping order of steps
Thinking degree must be 2
Confusing missing data imports with pipeline error

You want to model a dataset with a complex curve. You try polynomial regression with degree=2 but the fit is poor. What is the best next step?

hard

A. Remove polynomial features and use linear regression only

B. Decrease the polynomial degree to avoid overfitting

C. Increase the polynomial degree to capture more complexity

D. Use degree=2 but reduce training data size

Solution

Step 1: Understand model complexity and fit
Degree 2 polynomial may be too simple for complex curves, causing poor fit.
Step 2: Adjust polynomial degree
Increasing degree allows model to fit more complex patterns, improving fit quality.
Final Answer:
Increase the polynomial degree to capture more complexity -> Option C
Quick Check:
Higher degree = better complex fit [OK]

Hint: Raise degree to fit complex curves better [OK]

Common Mistakes:

Lowering degree when fit is poor
Removing polynomial features unnecessarily
Reducing data size instead of model complexity

Why Polynomial regression pipeline in ML Python? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand linear regression limitation

Step 2: Role of polynomial regression

Final Answer:

Quick Check:

Solution

Step 1: Order of pipeline steps

Step 2: Correct usage of classes and parameters

Final Answer:

Quick Check:

Solution

Step 1: Understand data and model

Step 2: Predict for X=4 using polynomial degree 2

Final Answer:

Quick Check:

Solution

Step 1: Check pipeline step order

Step 2: Confirm degree and imports

Final Answer:

Quick Check:

Solution

Step 1: Understand model complexity and fit

Step 2: Adjust polynomial degree

Final Answer:

Quick Check: