Which of the following best describes the main goal of a churn prediction model in digital marketing?
Think about what 'churn' means in customer behavior.
Churn prediction models aim to find customers who might leave soon so companies can act to keep them.
You have a dataset with customer demographics, usage patterns, and past churn labels. Which model is most suitable for predicting churn?
Churn prediction is a yes/no problem.
Logistic Regression is used for binary classification problems like churn (yes/no).
Which metric is most important when you want to minimize false negatives (missing customers who will churn) in a churn prediction model?
False negatives mean churners predicted as non-churners.
Recall measures how many actual churners are correctly identified, minimizing false negatives.
What error will this Python code raise when training a churn prediction model?
from sklearn.linear_model import LogisticRegression X = [[1, 2], [3, 4]] y = [0, 1, 0] model = LogisticRegression() model.fit(X, y)
Check if the number of samples in X and y match.
The feature matrix X has 2 samples but label vector y has 3, causing a mismatch error.
You are tuning a Random Forest model for churn prediction. Which hyperparameter adjustment is most likely to reduce overfitting?
Overfitting happens when trees are too complex.
Reducing max_depth limits tree complexity, helping prevent overfitting.