I have a dataset with many ordered features, most of which have 3 levels (e.g., 0, 1, 2), and my outcome variable is censored. I’m debating whether to treat these ordinal features as numeric or categorical.
If I treat them as categorical, I’m considering options like one-hot encoding or target encoding. However, I’m unsure what factors to take into account to help me decide the best way to handle these features. Should I focus on preserving their ordinal nature, or does encoding them as categorical variables provide better flexibility for modeling? Any guidance would be greatly appreciated.