Interpreting predicted probabilities after rebalancing

Asked Dec 06 '24 at 11:58

Active Dec 09 '24 at 09:50

Viewed 27 times

Consider a setting in which I have an unbalanced dataset where the targeted class takes values = 1 in 0,01% of observations and value = 0 in 99,9% of the observations.

I train a classification model, say XGBClassifier and obtain the predict_proba, from the documentation:

probability of each X example being of a given class.

Now, suppose I want to rebalance a bit the class by undersampling, and train a second model where my target has value = 1 in 10% of cases and value = 0 in the remaining 90% of observations.

Is the interpretation of the predicted probabilities affectd by this rebalancing?

Can I still say that if observation x_i has value 0.4 then it's 40% likely to have class = 1?

edited Dec 09 '24 at 09:50

asked Dec 06 '24 at 11:58

Ale

Interpreting predicted probabilities after rebalancing

0 Answers0