ML Pythonml~10 mins

Target encoding in ML Python - Interactive Code Practice

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to calculate the mean target value for each category.

ML Python

mean_target = df.groupby('category')['target'].[1]()

Drag options to blanks, or click blank then click option'

Asum

Bmax

Cmean

Dcount

Attempts:

3 left

2fill in blank

medium

Complete the code to map the target mean to the original dataframe's category column.

ML Python

df['category_encoded'] = df['category'].[1](mean_target)

Drag options to blanks, or click blank then click option'

Amap

Bapply

Creplace

Dfillna

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to avoid data leakage by computing target encoding only on the training set.

ML Python

mean_target_train = train_df.groupby('category')['target'].[1]()
train_df['category_encoded'] = train_df['category'].map(mean_target_train)
test_df['category_encoded'] = test_df['category'].map(mean_target_train).fillna([2])

Drag options to blanks, or click blank then click option'

Amean

Bsum

Ccount

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a target encoding function that fits on training data and transforms new data.

ML Python

def target_encode(train, test, cat_col, target_col):
    mean_target = train.groupby([1])[[2]].mean()
    train_encoded = train[cat_col].map(mean_target)
    test_encoded = test[cat_col].map(mean_target).fillna(mean_target.mean())
    return train_encoded, test_encoded

Drag options to blanks, or click blank then click option'

A'category'

B'target'

C'category_encoded'

D'target_encoded'

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to apply target encoding with smoothing to reduce noise for categories with few samples.

ML Python

def smooth_target_encode(train, test, cat_col, target_col, m=5):
    mean = train[target_col].mean()
    agg = train.groupby([1])[[2]].agg(['mean', 'count'])
    smooth = (agg['count'] * agg['mean'] + m * mean) / (agg['count'] + m)
    train_encoded = train[cat_col].map(smooth)
    test_encoded = test[cat_col].map(smooth).fillna(mean)
    return train_encoded, test_encoded

Drag options to blanks, or click blank then click option'

A'category'

B'target'

Attempts:

3 left

Practice

(1/5)

1. What is the main purpose of target encoding in machine learning?

easy

A. Remove missing values from the dataset

B. Normalize numerical features to a 0-1 scale

C. Create new categorical features by combining existing ones

D. Convert categorical variables into numbers using the average target value

Target encoding in ML Python - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand what target encoding does

Step 2: Compare with other options

Final Answer:

Quick Check:

Solution

Step 1: Identify correct target encoding method

Step 2: Check code correctness

Final Answer:

Quick Check:

Solution

Step 1: Calculate mean target per category from training data

Step 2: Map test categories and fill missing

Final Answer:

Quick Check:

Solution

Step 1: Understand overfitting cause in target encoding

Step 2: Identify mistake in data leakage

Final Answer:

Quick Check:

Solution

Step 1: Understand overfitting from rare categories

Step 2: Apply smoothing to reduce noise

Final Answer:

Quick Check: