Challenge - 5 Problems

🎖️

SciPy Pipeline Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ Predict Output

intermediate

2:00remaining

Output of a simple pipeline with StandardScaler and LogisticRegression

What is the output of the following code snippet that creates a pipeline with a scaler and logistic regression, then fits and predicts on a test sample?

SciPy

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression
import numpy as np

X_train = np.array([[1, 2], [2, 3], [3, 4], [4, 5]])
y_train = np.array([0, 0, 1, 1])

pipeline = Pipeline([
    ('scaler', StandardScaler()),
    ('logreg', LogisticRegression(random_state=0))
])

pipeline.fit(X_train, y_train)

X_test = np.array([[1.5, 2.5]])
prediction = pipeline.predict(X_test)
print(prediction)

A[1]

B[1 0]

C[0 1]

D[0]

Attempts:

2 left

❓ data_output

intermediate

2:00remaining

Shape of transformed data after applying PCA in a pipeline

Given the following pipeline that applies PCA to reduce dimensionality, what is the shape of the transformed data after calling transform on X_test?

SciPy

from sklearn.pipeline import Pipeline
from sklearn.decomposition import PCA
import numpy as np

X_train = np.random.rand(10, 5)
X_test = np.random.rand(3, 5)

pipeline = Pipeline([
    ('pca', PCA(n_components=2))
])

pipeline.fit(X_train)
X_transformed = pipeline.transform(X_test)
print(X_transformed.shape)

A(3, 5)

B(3, 2)

C(10, 2)

D(10, 5)

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Identify the error in pipeline usage with SciPy function

What error will this code raise when trying to use a SciPy function inside a scikit-learn pipeline step?

SciPy

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import FunctionTransformer
from scipy.special import expit
import numpy as np

X = np.array([[0, 1], [2, 3]])

pipeline = Pipeline([
    ('sigmoid', FunctionTransformer(expit)),
])

pipeline.fit(X)
output = pipeline.transform(X)
print(output)

ANo error, outputs transformed array

BAttributeError: 'FunctionTransformer' object has no attribute 'fit'

CTypeError: 'FunctionTransformer' object is not callable

DValueError: Input contains NaN

Attempts:

2 left

🚀 Application

advanced

2:00remaining

Using a custom SciPy statistical function in a pipeline step

You want to add a pipeline step that replaces each feature with its z-score using SciPy's zscore function. Which pipeline step code correctly applies this transformation?

A('zscore', StandardScaler())

B('zscore', FunctionTransformer(scipy.stats.zscore))

C('zscore', FunctionTransformer(lambda x: scipy.stats.zscore(x, axis=0)))

D('zscore', FunctionTransformer(lambda x: scipy.stats.zscore(x, axis=1)))

Attempts:

2 left

🧠 Conceptual

expert

2:00remaining

Why integrate SciPy functions in scikit-learn pipelines?

What is the main advantage of integrating SciPy functions inside scikit-learn pipelines using FunctionTransformer?

AIt allows seamless combination of SciPy transformations with scikit-learn estimators for consistent preprocessing and model fitting.

BIt automatically converts SciPy functions into scikit-learn estimators with fit and predict methods.

CIt speeds up SciPy functions by compiling them into C code within the pipeline.

DIt enables SciPy functions to handle missing data automatically during pipeline execution.

Attempts:

2 left