2

I have two sample groups of customers, each customer has 100s of features. For a single sample, i would use Decision Trees to find sub-groups that have a high churn rate. Thats easy.

However, my requirement is: between two samples (below), find segment(s) such that in one sample its churn rate is high and in the other, it is low. In other words, find a sub-group which has the highest difference in churn rate.

What is an appropriate algorithm to solve this?

Thanks.

enter image description here

Arslán
  • 131
  • 2

1 Answers1

-1

You can frame this issue as feature importance. Which features have the greatest influence on the target value of churn rate?

There are many ways to approach feature importance. In decision trees, permutation importance can be used.

Brian Spiering
  • 23,131
  • 2
  • 29
  • 113