SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 201250 of 1706 papers

TitleStatusHype
Augmenting train maintenance technicians with automated incident diagnostic suggestions0
Understanding Generative AI Content with Embedding Models0
EEG Right & Left Voluntary Hand Movement-based Virtual Brain-Computer Interfacing Keyboard Using Hybrid Deep Learning Approach0
Detecting Unsuccessful Students in Cybersecurity Exercises in Two Different Learning EnvironmentsCode0
Improving VTE Identification through Language Models from Radiology Reports: A Comparative Study of Mamba, Phi-3 Mini, and BERT0
LOLgorithm: Integrating Semantic,Syntactic and Contextual Elements for Humor Classification0
Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach0
Classification of Raw MEG/EEG Data with Detach-Rocket Ensemble: An Improved ROCKET Algorithm for Multivariate Time Series AnalysisCode1
IBB Traffic Graph Data: Benchmarking and Road Traffic Prediction Model0
Improving Machine Learning Based Sepsis Diagnosis Using Heart Rate Variability0
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language ModelsCode0
RelBench: A Benchmark for Deep Learning on Relational DatabasesCode3
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review0
An Efficient and Flexible Deep Learning Method for Signal Delineation via Keypoints Estimation0
Self-Reasoning Assistant Learning for non-Abelian Gauge Fields Design0
Fever Detection with Infrared Thermography: Enhancing Accuracy through Machine Learning Techniques0
Temperature Distribution Prediction in Laser Powder Bed Fusion using Transferable and Scalable Graph Neural Networks0
Evaluating Large Language Models for Anxiety and Depression Classification using Counseling and Psychotherapy TranscriptsCode0
GraphGuard: Contrastive Self-Supervised Learning for Credit-Card Fraud Detection in Multi-Relational Dynamic Graphs0
Molecular Topological Profile (MOLTOP) -- Simple and Strong Baseline for Molecular Graph ClassificationCode0
GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text0
Deep-Graph-Sprints: Accelerated Representation Learning in Continuous-Time Dynamic Graphs0
Advancing Automated Deception Detection: A Multimodal Approach to Feature Extraction and AnalysisCode0
MERGE -- A Bimodal Audio-Lyrics Dataset for Static Music Emotion Recognition0
Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques0
GraphCNNpred: A stock market indices prediction using a Graph based deep learning system0
OSPC: Artificial VLM Features for Hateful Meme Detection0
A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular DataCode1
The Remarkable Robustness of LLMs: Stages of Inference?Code1
TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning BenchmarksCode4
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry0
Comparing fingers and gestures for bci control using an optimized classical machine learning decoder0
Horseshoe-type Priors for Independent Component Estimation0
LightGBM robust optimization algorithm based on topological data analysis0
PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation ModelCode0
Retrieval-Augmented Feature Generation for Domain-Specific Classification0
Deep Learning Domain Adaptation to Understand Physico-Chemical Processes from Fluorescence Spectroscopy Small Datasets: Application to Ageing of Olive Oil0
Explainable AI for Comparative Analysis of Intrusion Detection ModelsCode0
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree ReasoningCode1
Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy0
Learned Feature Importance Scores for Automated Feature Engineering0
Dynamic and Adaptive Feature Generation with LLM0
DeepMol: An Automated Machine and Deep Learning Framework for Computational ChemistrCode2
Iterative Feature Boosting for Explainable Speech Emotion RecognitionCode0
Network Analytics for Anti-Money Laundering -- A Systematic Literature Review and Experimental EvaluationCode1
Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking SequencesCode1
Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction0
Transitional Uncertainty with Layered Intermediate Predictions0
Maintaining and Managing Road Quality:Using MLP and DNN0
Wearable-based behaviour interpolation for semi-supervised human activity recognition0
Show:102550
← PrevPage 5 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified