I have a dataset with two classes and am interested in learning which features are 'important' for predicting the class. There are a lot of features available and I want to find subset(s) that lead to a good classifier.
Suppose I have found two classifiers clf1 and clf2 that are close in performance (but clf1 > clf2). Now suppose that the pipeline used for clf1 involved feature selection, and call the subset of features found feat_sub_1 and similar for classifier 2. Presumably these feature subsets will be different -- is it correct to use those from clf1 as it performed better?