SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 101150 of 1706 papers

TitleStatusHype
Retrieve, Merge, Predict: Augmenting Tables with Data LakesCode1
Online learning techniques for prediction of temporal tabular datasets with regime changesCode1
Short-term Renewable Energy Forecasting in Greece using Prophet Decomposition and Tree-based EnsemblesCode1
Simplified DOM Trees for Transferable Attribute Extraction from the WebCode1
SkillGPT: a RESTful API service for skill extraction and standardization using a Large Language ModelCode1
SMUTF: Schema Matching Using Generative Tags and Hybrid FeaturesCode1
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language ModelCode1
Supervised Learning on Relational Databases with Graph Neural NetworksCode1
SYNC: A Copula based Framework for Generating Synthetic Data from Aggregated SourcesCode1
SynC: A Unified Framework for Generating Synthetic Population with Gaussian CopulaCode1
Bayesian Optimization of Catalysis With In-Context LearningCode1
Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time seriesCode1
Towards Ground Truth Explainability on Tabular DataCode1
Transfer Learning for Motor Imagery Based Brain-Computer Interfaces: A Complete PipelineCode1
GenHPF: General Healthcare Predictive Framework with Multi-task Multi-source LearningCode1
Automated Website Fingerprinting through Deep LearningCode1
VCR-Graphormer: A Mini-batch Graph Transformer via Virtual ConnectionsCode1
VEST: Automatic Feature Engineering for ForecastingCode1
XCrossNet: Feature Structure-Oriented Learning for Click-Through Rate PredictionCode1
Yelp Review Rating Prediction: Machine Learning and Deep Learning ModelsCode1
A Survey of Information Cascade Analysis: Models, Predictions, and Recent AdvancesCode1
DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability DetectionCode1
Discovering Neural WiringsCode1
Attention-Based Deep Learning Framework for Human Activity Recognition with User AdaptationCode1
An End-to-End Reinforcement Learning Approach for Job-Shop Scheduling Problems Based on Constraint ProgrammingCode1
Compatible deep neural network framework with financial time series data, including data preprocessor, neural network model and trading strategyCode1
AutoML: A Survey of the State-of-the-ArtCode1
BP-Net: Efficient Deep Learning for Continuous Arterial Blood Pressure Estimation using PhotoplethysmogramCode1
AutoGL: A Library for Automated Graph LearningCode1
DeepFM: A Factorization-Machine based Neural Network for CTR PredictionCode1
AutoSmart: An Efficient and Automatic Machine Learning framework for Temporal Relational DataCode1
OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the BoundaryCode1
Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking SequencesCode1
Benchmarks and Custom Package for Energy ForecastingCode1
Evaluation Toolkit For Robustness Testing Of Automatic Essay Scoring SystemsCode1
Analyzing Multispectral Satellite Imagery of South American Wildfires Using Deep Learning0
Analysis of Rhythmic Phrasing: Feature Engineering vs. Representation Learning for Classifying Readout Poetry0
Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction0
Machine Learning for Wireless Link Quality Estimation: A Survey0
A multi-task learning model for malware classification with useful file access pattern from API call sequence0
Advanced fraud detection using machine learning models: enhancing financial transaction security0
Deep learning approach to control of prosthetic hands with electromyography signals0
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo0
A Multi-task Approach to Predict Likability of Books0
A Dual-Layer Semantic Role Labeling System0
A multi-model-based deep learning framework for short text multiclass classification with the imbalanced and extremely small data set0
A Multi-Attention based Neural Network with External Knowledge for Story Ending Predicting Task0
ADSAGE: Anomaly Detection in Sequences of Attributed Graph Edges applied to insider threat detection at fine-grained level0
A Brief Survey of Machine Learning Methods for Emotion Prediction using Physiological Data0
Physics-informed machine learning for composition-process-property alloy design: shape memory alloy demonstration0
Show:102550
← PrevPage 3 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified