I have dataset with two label class (good and bad), I want to apply K Means on my dataset using python, should I use that label dataset or I have to delete the label class column ?
Asked
Active
Viewed 74 times
1 Answers
1
Delete the label column.
Assuming that you want to compare the clusters to the labels later, then the labels must not be part of the data passed to k-means.
And k-means only works well on continuous variables anyway.
Has QUIT--Anony-Mousse
- 8,134
- 1
- 16
- 31