Stable and Accurate Feature Selection


Gulgezen G., Cataltepe Z. , Yu L.

Joint European Conference on Machine Learning (ECML)/European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), Bled, Slovenia, 7 - 11 September 2009, vol.5781, pp.455-456 identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 5781
  • City: Bled
  • Country: Slovenia
  • Page Numbers: pp.455-456

Abstract

In addition to accuracy, stability is also a measure of success for a feature selection algorithm. Stability could especially be a concern when the number of samples in a data set is small and the dimensionality is high. In this study, we introduce a stability measure, and perform both accuracy and stability measurements of MRMR (Minimum Redundancy Maximum Relevance) feature selection algorithm on different data sets. The two feature evaluation criteria used by MRMR, MID (Mutual Information Difference) and MIQ (Mutual Information Quotient), result in similar accuracies, but MID is more stable. We also introduce a new feature selection criterion, MID alpha, where redundancy and relevance of selected features are controlled by parameter alpha.