SOTAVerified

Feature Selection with the Boruta Package

2010-09-01Journal of Statistical Software 2010 2010Code Available0· sign in to hype

Miron B. Kursa, Witold R. Rudnicki

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This article describes a R package Boruta, implementing a novel feature selection algorithm for finding all relevant variables. The algorithm is designed as a wrapper around a Random Forest classification algorithm. It iteratively removes the features which are proved by a statistical test to be less relevant than random probes. The Boruta package provides a convenient interface to the algorithm. The short description of the algorithm and examples of its application are presented.

Tasks

Reproductions