SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 151175 of 1706 papers

TitleStatusHype
See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers0
Explainable cognitive decline detection in free dialogues with a Machine Learning approach based on pre-trained Large Language Models0
Enhancing Glucose Level Prediction of ICU Patients through Hierarchical Modeling of Irregular Time-SeriesCode0
Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers0
Machine Learning Framework for Audio-Based Content Evaluation using MFCC, Chroma, Spectral Contrast, and Temporal Feature Engineering0
Can Models Help Us Create Better Models? Evaluating LLMs as Data ScientistsCode1
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science CompetitionsCode3
Predicting 30-Day Hospital Readmission in Medicare Patients: Insights from an LSTM Deep Learning Model0
Large Language Models Engineer Too Many Simple Features For Tabular DataCode0
AdaptoML-UX: An Adaptive User-centered GUI-based AutoML Toolkit for Non-AI Experts and HCI ResearchersCode0
Molecular Topological Profile (MOLTOP) - Simple and Strong Baseline for Molecular Graph ClassificationCode0
Reproducible Machine Learning-based Voice Pathology Detection: Introducing the Pitch Difference FeatureCode0
Statistical Test for Auto Feature Engineering by Selective InferenceCode0
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular PredictionCode0
Sui Generis: Large Language Models for Authorship Attribution and Verification in Latin0
Towards Trustworthy Web Attack Detection: An Uncertainty-Aware Ensemble Deep Kernel Learning Model0
Principal Orthogonal Latent Components Analysis (POLCA Net)Code0
Neural-Bayesian Program Learning for Few-shot Dialogue Intent Parsing0
Learning to Solve Abstract Reasoning Problems with Neurosymbolic Program Synthesis and Task Generation0
Self-eXplainable AI for Medical Image Analysis: A Survey and New Outlooks0
Semantic-Guided RL for Interpretable Feature Engineering0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach0
Automatic deductive coding in discourse analysis: an application of large language models in learning analyticsCode0
LML-DAP: Language Model Learning a Dataset for Data-Augmented PredictionCode1
Enhanced Convolution Neural Network with Optimized Pooling and Hyperparameter Tuning for Network Intrusion DetectionCode0
Show:102550
← PrevPage 7 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified