SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 5175 of 1706 papers

TitleStatusHype
Feature Engineering on LMS Data to Optimize Student Performance Prediction0
Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection0
SPIO: Ensemble and Selective Strategies via LLM-Based Multi-Agent Planning in Automated Data Science0
FeRG-LLM : Feature Engineering by Reason Generation Large Language Models0
RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts0
Embedding Domain-Specific Knowledge from LLMs into the Feature Engineering Pipeline0
Feature-Enhanced Machine Learning for All-Cause Mortality Prediction in Healthcare Data0
Asset price movement prediction using empirical mode decomposition and Gaussian mixture models0
Machine Learning - Driven Materials Discovery: Unlocking Next-Generation Functional Materials -- A minireview0
NeuralFoil: An Airfoil Aerodynamics Analysis Tool Using Physics-Informed Machine LearningCode3
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary OptimizersCode2
CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings0
Applications of Large Language Model Reasoning in Feature Generation0
VORTEX: Challenging CNNs at Texture Recognition by using Vision Transformers with Orderless and Randomized Token EncodingsCode0
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets0
Bridging the Semantic Gap in Virtual Machine Introspection and Forensic Memory Analysis0
YARE-GAN: Yet Another Resting State EEG-GANCode0
Efficient or Powerful? Trade-offs Between Machine Learning and Deep Learning for Mental Illness Detection on Social Media0
Integrating convolutional layers and biformer network with forward-forward and backpropagation trainingCode0
Improving Representation Learning of Complex Critical Care Data with ICU-BERT0
Edge Training and Inference with Analog ReRAM Technology for Hand Gesture Recognition0
Mitigating Attrition: Data-Driven Approach Using Machine Learning and Data Engineering0
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration0
ML-Driven Approaches to Combat Medicare Fraud: Advances in Class Imbalance Solutions, Feature Engineering, Adaptive Learning, and Business Impact0
A Defensive Framework Against Adversarial Attacks on Machine Learning-Based Network Intrusion Detection Systems0
Show:102550
← PrevPage 3 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified