Feature Selection with the Boruta Package
2010-09-01Journal of Statistical Software 2010 2010Code Available0· sign in to hype
Miron B. Kursa, Witold R. Rudnicki
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/scikit-learn-contrib/boruta_pynone★ 1,623
Abstract
This article describes a R package Boruta, implementing a novel feature selection algorithm for finding all relevant variables. The algorithm is designed as a wrapper around a Random Forest classification algorithm. It iteratively removes the features which are proved by a statistical test to be less relevant than random probes. The Boruta package provides a convenient interface to the algorithm. The short description of the algorithm and examples of its application are presented.