SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 51100 of 1706 papers

TitleStatusHype
Feature Engineering on LMS Data to Optimize Student Performance Prediction0
Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection0
SPIO: Ensemble and Selective Strategies via LLM-Based Multi-Agent Planning in Automated Data Science0
FeRG-LLM : Feature Engineering by Reason Generation Large Language Models0
RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts0
Embedding Domain-Specific Knowledge from LLMs into the Feature Engineering Pipeline0
Feature-Enhanced Machine Learning for All-Cause Mortality Prediction in Healthcare Data0
Asset price movement prediction using empirical mode decomposition and Gaussian mixture models0
Machine Learning - Driven Materials Discovery: Unlocking Next-Generation Functional Materials -- A minireview0
NeuralFoil: An Airfoil Aerodynamics Analysis Tool Using Physics-Informed Machine LearningCode3
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary OptimizersCode2
CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings0
Applications of Large Language Model Reasoning in Feature Generation0
VORTEX: Challenging CNNs at Texture Recognition by using Vision Transformers with Orderless and Randomized Token EncodingsCode0
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets0
Bridging the Semantic Gap in Virtual Machine Introspection and Forensic Memory Analysis0
YARE-GAN: Yet Another Resting State EEG-GANCode0
Efficient or Powerful? Trade-offs Between Machine Learning and Deep Learning for Mental Illness Detection on Social Media0
Integrating convolutional layers and biformer network with forward-forward and backpropagation trainingCode0
Improving Representation Learning of Complex Critical Care Data with ICU-BERT0
Mitigating Attrition: Data-Driven Approach Using Machine Learning and Data Engineering0
Edge Training and Inference with Analog ReRAM Technology for Hand Gesture Recognition0
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration0
ML-Driven Approaches to Combat Medicare Fraud: Advances in Class Imbalance Solutions, Feature Engineering, Adaptive Learning, and Business Impact0
A Defensive Framework Against Adversarial Attacks on Machine Learning-Based Network Intrusion Detection Systems0
Feature Engineering Approach to Building Load Prediction: A Case Study for Commercial Building Chiller Plant Optimization in Tropical Weather0
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models0
SEM-CLIP: Precise Few-Shot Learning for Nanoscale Defect Detection in Scanning Electron Microscope Image0
PainDECOG: Machine Learning-Based Identification of Pain Biomarkers from sEEG Signals0
Recent Advances in Malware Detection: Graph Learning and Explainability0
Chronic Diseases Prediction Using ML0
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search0
A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective0
Decision Tree Based Wrappers for Hearing Loss0
Exploring Patterns Behind Sports0
Enhancing Physics-Informed Neural Networks Through Feature Engineering0
Application of quantum machine learning using quantum kernel algorithms on multiclass neuron M type classification0
Agentic AI Systems Applied to tasks in Financial Services: Modeling and model risk management crews0
Decision Trees That Remember: Gradient-Based Learning of Recurrent Decision Trees with Memory0
From Features to Transformers: Redefining Ranking for Scalable Impact0
Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications0
Year-over-Year Developments in Financial Fraud Detection via Deep Learning: A Systematic Literature Review0
CAAT-EHR: Cross-Attentional Autoregressive Transformer for Multimodal Electronic Health Record EmbeddingsCode0
RAINER: A Robust Ensemble Learning Grid Search-Tuned Framework for Rainfall Patterns Prediction0
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation0
Sample-Efficient Behavior Cloning Using General Domain Knowledge0
A Transferable Physics-Informed Framework for Battery Degradation Diagnosis, Knee-Onset Detection and Knee Prediction0
EvoGP: A GPU-accelerated Framework for Tree-based Genetic ProgrammingCode7
Distributed Multi-Head Learning Systems for Power Consumption Prediction0
DLinear-based Prediction of Remaining Useful Life of Lithium-Ion Batteries: Feature Engineering through Explainable Artificial Intelligence0
Show:102550
← PrevPage 2 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified