0
0
ML Pythonml~5 mins

CatBoost in ML Python - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is CatBoost?
CatBoost is a machine learning algorithm that builds decision trees and is especially good at handling categorical data without needing to convert it manually.
Click to reveal answer
intermediate
How does CatBoost handle categorical features?
CatBoost automatically converts categorical features into numbers using a special technique called 'ordered target statistics' which helps avoid overfitting.
Click to reveal answer
intermediate
What is the main advantage of using CatBoost over other gradient boosting methods?
CatBoost reduces prediction shift and overfitting by using ordered boosting and supports categorical data natively, making it faster and more accurate on many datasets.
Click to reveal answer
beginner
What metric can you use to evaluate a CatBoost model for classification?
You can use accuracy, AUC (Area Under the Curve), or log-loss to evaluate a CatBoost classification model depending on your problem.
Click to reveal answer
advanced
Explain the concept of 'ordered boosting' in CatBoost.
Ordered boosting is a technique where CatBoost builds trees using a special permutation of data to prevent target leakage and reduce overfitting during training.
Click to reveal answer
What type of data does CatBoost handle natively without manual preprocessing?
AOnly numerical data
BCategorical data
CText data only
DImage data
Which technique does CatBoost use to avoid overfitting when processing categorical features?
AOrdered target statistics
BLabel encoding
COne-hot encoding
DRandom sampling
What is the main purpose of ordered boosting in CatBoost?
APrevent target leakage and reduce overfitting
BSpeed up training
CIncrease model size
DSimplify data preprocessing
Which of the following is NOT a typical evaluation metric for CatBoost classification models?
ALog-loss
BAccuracy
CAUC
DMean Squared Error
CatBoost is a type of which machine learning method?
ANeural network
BK-nearest neighbors
CGradient boosting
DSupport vector machine
Describe how CatBoost handles categorical features and why this is beneficial.
Think about how CatBoost avoids manual steps and overfitting with categorical data.
You got /4 concepts.
    Explain the concept of ordered boosting and its role in CatBoost's training process.
    Consider how training order affects model learning and overfitting.
    You got /4 concepts.