As far as I know, tree models (such as those trained using xgboost/lightgbm) makes reasonable prediction only if the input feature vector is similar to the train set data. If the feature vector looks like an outlier, then the prediction result is not reliable.
So my question is: how to determine the feasibility of applying the model to make prediction given a feature vector?
I initially tried to use the one-class SVM to determined whether the given feature vectors is close to the train set, but it turned out that SVM cannot handle big data set (a few million samples and ~1000 features).