arXiv Analytics

Sign in

arXiv:2009.12075 [stat.ML]AbstractReferencesReviewsResources

Adjusted Measures for Feature Selection Stability for Data Sets with Similar Features

Andrea Bommert, Jörg Rahnenführer

Published 2020-09-25Version 1

For data sets with similar features, for example highly correlated features, most existing stability measures behave in an undesired way: They consider features that are almost identical but have different identifiers as different features. Existing adjusted stability measures, that is, stability measures that take into account the similarities between features, have major theoretical drawbacks. We introduce new adjusted stability measures that overcome these drawbacks. We compare them to each other and to existing stability measures based on both artificial and real sets of selected features. Based on the results, we suggest using one new stability measure that considers highly similar features as exchangeable.

Related articles: Most relevant | Search more
arXiv:2106.08105 [stat.ML] (Published 2021-06-15)
Employing an Adjusted Stability Measure for Multi-Criteria Model Fitting on Data Sets with Similar Features
arXiv:1905.02515 [stat.ML] (Published 2019-05-07)
Guided Visual Exploration of Relations in Data Sets
arXiv:2302.03931 [stat.ML] (Published 2023-02-08)
Fast Linear Model Trees by PILOT