SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 76100 of 1706 papers

TitleStatusHype
DriveML: An R Package for Driverless Machine LearningCode1
A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular DataCode1
Efficient End-to-End AutoML via Scalable Search Space DecompositionCode1
Context-Aware Deep Learning for Multi Modal Depression DetectionCode1
Deep & Cross Network for Ad Click PredictionsCode1
CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code MatchingCode1
A Survey of Information Cascade Analysis: Models, Predictions, and Recent AdvancesCode1
Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global InferenceCode1
Attention-Based Deep Learning Framework for Human Activity Recognition with User AdaptationCode1
Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation ScoringCode1
fseval: A Benchmarking Framework for Feature Selection and Feature Ranking AlgorithmsCode1
Cognitive Evolutionary Search to Select Feature Interactions for Click-Through Rate PredictionCode1
AutoGL: A Library for Automated Graph LearningCode1
Anomaly Detection for Solder Joints Using β-VAECode1
Understanding the Dynamics of DNNs Using Graph ModularityCode1
AutoML: A Survey of the State-of-the-ArtCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud DetectionCode1
A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch LiteratureCode1
Discovering Neural WiringsCode1
Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time seriesCode1
AutoSmart: An Efficient and Automatic Machine Learning framework for Temporal Relational DataCode1
Bayesian Optimization of Catalysis With In-Context LearningCode1
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language ModelCode1
Compatible deep neural network framework with financial time series data, including data preprocessor, neural network model and trading strategyCode1
Show:102550
← PrevPage 4 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified