5. Conclusion
In this paper, we have proposed an efficient Multi Filtration Feature Selection (MFFS) method applicable to medical data mining. Empirical study on 6 synthetic medical datasets suggests that MFFS gives better over-all performance than the existing counterparts in terms of all three evaluation criteria, i.e., number of selected features, classification accuracy, and computational time. The comparison to other methods in the literature also suggests MFFS has competitive performance. MFFS is capable of eliminating irrelevant and redundant features based on both feature subset selection and ranking models effectively, thus providing a small set of reliable features for the physicians to prescribe further medications. For simplicity, several key points are collected as follows. (1) It seems that the classification performance is necessarily proportional to the removal of redundant features, heavily dependent on the inclusion of relevant features and the ‘‘Accuracy’’ metric is observed maximum with minimum number of features. (2) The proposed MFFS algorithm operates invariably well on any type of classifier model. This shows the generalization ability and applicability of the proposed system. (3) Our training and test database collects the popular and benchmark medical datasets. However, the proposed method can be tested and applied on real-world dataset too. (4) The best accuracy rate achieved by our proposed system is superior to the existing schemes. To make our system more practical, future work could include the following. (a) Fitting the proposed system to classify any other realworld dataset. (b) Applying the proposed method for a multi-label dataset, where a record may belong to many classes simultaneously.