SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 76100 of 1706 papers

TitleStatusHype
Short-term Renewable Energy Forecasting in Greece using Prophet Decomposition and Tree-based EnsemblesCode1
Enhancing the Analysis of Software Failures in Cloud Computing Systems with Deep LearningCode1
Predicting crop yields with little ground truth: A simple statistical model for in-season forecastingCode1
Mill.jl and JsonGrinder.jl: automated differentiable feature extraction for learning from raw JSON dataCode1
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud DetectionCode1
Anomaly Detection for Solder Joints Using β-VAECode1
XCrossNet: Feature Structure-Oriented Learning for Click-Through Rate PredictionCode1
AutoGL: A Library for Automated Graph LearningCode1
Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoderCode1
Memory-based Deep Reinforcement Learning for POMDPsCode1
Symbolic regression for scientific discovery: an application to wind speed forecastingCode1
MalNet: A Large-Scale Image Database of Malicious SoftwareCode1
Knowledge-Preserving Incremental Social Event Detection via Heterogeneous GNNsCode1
The Challenges of Persian User-generated Textual Content: A Machine Learning-Based ApproachCode1
Summaformers @ LaySumm 20, LongSumm 20Code1
Simplified DOM Trees for Transferable Attribute Extraction from the WebCode1
Statistical learning for accurate and interpretable battery lifetime predictionCode1
Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global InferenceCode1
Enabling Collaborative Data Science Development with the Ballet FrameworkCode1
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language ModelCode1
Yelp Review Rating Prediction: Machine Learning and Deep Learning ModelsCode1
CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code MatchingCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Short-Term Load Forecasting using Bi-directional Sequential Models and Feature Engineering for Small DatasetsCode1
Classification of Periodic Variable Stars with Novel Cyclic-Permutation Invariant Neural NetworksCode1
Show:102550
← PrevPage 4 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified