SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 2650 of 1706 papers

TitleStatusHype
GSDFuse: Capturing Cognitive Inconsistencies from Multi-Dimensional Weak Signals in Social Media SteganalysisCode0
Text embedding models can be great data engineers0
Time to Embed: Unlocking Foundation Models for Time Series with Channel Descriptions0
Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding0
Enhancing Abstractive Summarization of Scientific Papers Using Structure InformationCode0
Machine Learning-Based Prediction of Mortality in Geriatric Traumatic Brain Injury Patients0
A Hybrid Quantum Classical Pipeline for X Ray Based Fracture Diagnosis0
Lightweight Spatio-Temporal Attention Network with Graph Embedding and Rotational Position Encoding for Traffic Forecasting0
IISE PG&E Energy Analytics Challenge 2025: Hourly-Binned Regression Models Beat Transformers in Load Forecasting0
NeurIPS 2024 Ariel Data Challenge: Characterisation of Exoplanetary Atmospheres Using a Data-Centric Approach0
Benchmarking Graph Neural Networks for Document Layout Analysis in Public Affairs0
Machine Learning-Based Detection of DDoS Attacks in VANETs for Emergency Vehicle Communication0
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction0
Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning0
Rethinking Multimodal Sentiment Analysis: A High-Accuracy, Simplified Fusion Architecture0
Wide & Deep Learning for Node ClassificationCode0
MPEC: Manifold-Preserved EEG Classification via an Ensemble of Clustering-Based Classifiers0
LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection0
FLARE: Feature-based Lightweight Aggregation for Robust Evaluation of IoT Intrusion Detection0
Word Embedding Techniques for Classification of Star Ratings0
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection0
Morphing-based Compression for Data-centric ML Pipelines0
Beyond Glucose-Only Assessment: Advancing Nocturnal Hypoglycemia Prediction in Children with Type 1 Diabetes0
Bringing Structure to Naturalness: On the Naturalness of ASTs0
Boosting Relational Deep Learning with Pretrained Tabular ModelsCode0
Show:102550
← PrevPage 2 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified