SOTAVerified

On the Selection Stability of Stability Selection and Its Applications

2024-11-14Code Available0· sign in to hype

Mahdi Nouraie, Samuel Muller

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Stability selection is a widely adopted resampling-based framework for high-dimensional variable selection. This paper seeks to broaden the use of an established stability estimator to evaluate the overall stability of the stability selection results, moving beyond single-variable analysis. We suggest that the stability estimator offers two advantages: it can serve as a reference to reflect the robustness of the results obtained, and it can help identify a Pareto optimal regularization value to improve stability. By determining the regularization value, we calibrate key stability selection parameters, namely, the decision-making threshold and the expected number of falsely selected variables, within established theoretical bounds. In addition, the convergence of stability values over successive sub-samples sheds light on the required number of sub-samples addressing a notable gap in prior studies. The stabplot R package is developed to facilitate the use of the methodology featured in this paper.

Tasks

Reproductions