SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 151200 of 1706 papers

TitleStatusHype
See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers0
Explainable cognitive decline detection in free dialogues with a Machine Learning approach based on pre-trained Large Language Models0
Enhancing Glucose Level Prediction of ICU Patients through Hierarchical Modeling of Irregular Time-SeriesCode0
Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers0
Machine Learning Framework for Audio-Based Content Evaluation using MFCC, Chroma, Spectral Contrast, and Temporal Feature Engineering0
Can Models Help Us Create Better Models? Evaluating LLMs as Data ScientistsCode1
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science CompetitionsCode3
Predicting 30-Day Hospital Readmission in Medicare Patients: Insights from an LSTM Deep Learning Model0
Large Language Models Engineer Too Many Simple Features For Tabular DataCode0
AdaptoML-UX: An Adaptive User-centered GUI-based AutoML Toolkit for Non-AI Experts and HCI ResearchersCode0
Molecular Topological Profile (MOLTOP) - Simple and Strong Baseline for Molecular Graph ClassificationCode0
Reproducible Machine Learning-based Voice Pathology Detection: Introducing the Pitch Difference FeatureCode0
Statistical Test for Auto Feature Engineering by Selective InferenceCode0
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular PredictionCode0
Sui Generis: Large Language Models for Authorship Attribution and Verification in Latin0
Towards Trustworthy Web Attack Detection: An Uncertainty-Aware Ensemble Deep Kernel Learning Model0
Principal Orthogonal Latent Components Analysis (POLCA Net)Code0
Neural-Bayesian Program Learning for Few-shot Dialogue Intent Parsing0
Learning to Solve Abstract Reasoning Problems with Neurosymbolic Program Synthesis and Task Generation0
Self-eXplainable AI for Medical Image Analysis: A Survey and New Outlooks0
Semantic-Guided RL for Interpretable Feature Engineering0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach0
Automatic deductive coding in discourse analysis: an application of large language models in learning analyticsCode0
LML-DAP: Language Model Learning a Dataset for Data-Augmented PredictionCode1
Enhanced Convolution Neural Network with Optimized Pooling and Hyperparameter Tuning for Network Intrusion DetectionCode0
Reinforcement Feature Transformation for Polymer Property Performance Prediction0
A Feature Engineering Approach for Literary and Colloquial Tamil Speech Classification using 1D-CNN0
Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks0
Machine Learning for Public Good: Predicting Urban Crime Patterns to Enhance Community Safety0
Leveraging Open-Source Large Language Models for Native Language Identification0
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous DrivingCode2
Learn2Aggregate: Supervised Generation of Chvátal-Gomory Cuts Using Graph Neural Networks0
HybridFC: A Hybrid Fact-Checking Approach for Knowledge GraphsCode0
Machine Learning-Based Prediction of Key Genes Correlated to the Subretinal Lesion Severity in a Mouse Model of Age-Related Macular Degeneration0
IIFE: Interaction Information Based Automated Feature EngineeringCode0
Large Margin Prototypical Network for Few-shot Relation Classification with Fine-grained Features0
Leveraging Large Language Models through Natural Language Processing to provide interpretable Machine Learning predictions of mental deterioration in real time0
Application Research On Real-Time Perception Of Device Performance Status0
Towards Autonomous Cybersecurity: An Intelligent AutoML Framework for Autonomous Intrusion DetectionCode1
Hybridization of Persistent Homology with Neural Networks for Time-Series Prediction: A Case Study in Wave Height0
PoliPrompt: A High-Performance Cost-Effective LLM-Based Text Classification Framework for Political Science0
LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition0
Enhancing Customer Churn Prediction in Telecommunications: An Adaptive Ensemble Learning Approach0
Android Malware Detection Based on RGB Images and Multi-feature Fusion0
gWaveNet: Classification of Gravity Waves from Noisy Satellite Data using Custom Kernel Integrated Deep Learning MethodCode0
Obfuscated Memory Malware Detection0
Improving Radiography Machine Learning Workflows via Metadata Management for Training Data Selection0
Graph Classification via Reference Distribution Learning: Theory and Practice0
Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning MethodsCode0
Improved Differential Evolution based Feature Selection through Quantum, Chaos, and Lasso0
Show:102550
← PrevPage 4 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified