Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to read a CSV file into a DataFrame using pandas.

MLOps

import pandas as pd
data = pd.read_csv([1])

Drag options to blanks, or click blank then click option'

Adata.csv

B'data.csv'

Cread.csv

Dcsv.read

Attempts:

3 left

2fill in blank

medium

Complete the code to split data into training and testing sets using scikit-learn.

MLOps

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=[1], random_state=42)

Drag options to blanks, or click blank then click option'

A0.2

B0.5

D20

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to automate data scaling with StandardScaler.

MLOps

from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
X_scaled = scaler.[1](X_train)

Drag options to blanks, or click blank then click option'

Ascale

Btransform

Cfit

Dfit_transform

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a pipeline that scales data and fits a logistic regression model.

MLOps

from sklearn.pipeline import Pipeline
from sklearn.linear_model import LogisticRegression
pipeline = Pipeline([('scaler', [1]()), ('model', [2]())])

Drag options to blanks, or click blank then click option'

AStandardScaler

BMinMaxScaler

CLogisticRegression

DRandomForestClassifier

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to automate training, prediction, and accuracy calculation.

MLOps

pipeline.fit([1], [2])
predictions = pipeline.predict([3])
from sklearn.metrics import accuracy_score
accuracy = accuracy_score(y_test, predictions)

Drag options to blanks, or click blank then click option'

AX_train

By_train

CX_test

Dy_test

Attempts:

3 left

Practice

(1/5)

1. What is the main benefit of automating a training data pipeline in machine learning?

easy

A. It saves time and reduces human errors during data preparation.

B. It makes the model training faster by using GPUs.

C. It increases the size of the training dataset automatically.

D. It guarantees 100% accuracy of the machine learning model.

Training data pipeline automation in MLOps - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of automation in data pipelines

Step 2: Identify the key benefits of automation

Final Answer:

Quick Check:

Solution

Step 1: Identify correct Python function syntax

Step 2: Check indentation and syntax correctness

Final Answer:

Quick Check:

Solution

Step 1: Calculate mean and standard deviation of the sample

Step 2: Normalize each value and round to 2 decimals

Final Answer:

Quick Check:

Solution

Step 1: Understand the error message

Step 2: Fix by importing pandas with alias 'pd'

Final Answer:

Quick Check:

Solution

Step 1: Identify requirements for automation and monitoring

Step 2: Evaluate options for pipeline automation

Final Answer:

Quick Check: