Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 1706 papers

Title	Date	Tasks	Status
GSDFuse: Capturing Cognitive Inconsistencies from Multi-Dimensional Weak Signals in Social Media Steganalysis	May 20, 2025	Data AugmentationFeature Engineering	CodeCode Available
Text embedding models can be great data engineers	May 20, 2025	Feature EngineeringTime Series	—Unverified
Time to Embed: Unlocking Foundation Models for Time Series with Channel Descriptions	May 20, 2025	Feature EngineeringRepresentation Learning	—Unverified
Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding	May 20, 2025	Feature Engineering	—Unverified
Enhancing Abstractive Summarization of Scientific Papers Using Structure Information	May 20, 2025	Abstractive Text SummarizationFeature Engineering	CodeCode Available
Machine Learning-Based Prediction of Mortality in Geriatric Traumatic Brain Injury Patients	May 19, 2025	Decision MakingFeature Engineering	—Unverified
A Hybrid Quantum Classical Pipeline for X Ray Based Fracture Diagnosis	May 19, 2025	Dimensionality ReductionFeature Engineering	—Unverified
Lightweight Spatio-Temporal Attention Network with Graph Embedding and Rotational Position Encoding for Traffic Forecasting	May 17, 2025	Feature EngineeringGraph Embedding	—Unverified
IISE PG&E Energy Analytics Challenge 2025: Hourly-Binned Regression Models Beat Transformers in Load Forecasting	May 16, 2025	Computational EfficiencyDeep Learning	—Unverified
NeurIPS 2024 Ariel Data Challenge: Characterisation of Exoplanetary Atmospheres Using a Data-Centric Approach	May 13, 2025	Feature Engineering	—Unverified
Benchmarking Graph Neural Networks for Document Layout Analysis in Public Affairs	May 12, 2025	BenchmarkingDocument Layout Analysis	—Unverified
Machine Learning-Based Detection of DDoS Attacks in VANETs for Emergency Vehicle Communication	May 12, 2025	Feature EngineeringFeature Importance	—Unverified
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction	May 9, 2025	Feature EngineeringPrediction	—Unverified
Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning	May 8, 2025	Feature EngineeringGeneral Knowledge	—Unverified
Rethinking Multimodal Sentiment Analysis: A High-Accuracy, Simplified Fusion Architecture	May 5, 2025	Emotion ClassificationFeature Engineering	—Unverified
Wide & Deep Learning for Node Classification	May 4, 2025	ClassificationDeep Learning	CodeCode Available
MPEC: Manifold-Preserved EEG Classification via an Ensemble of Clustering-Based Classifiers	Apr 30, 2025	ClassificationClustering	—Unverified
LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection	Apr 25, 2025	Feature EngineeringRAG	—Unverified
FLARE: Feature-based Lightweight Aggregation for Robust Evaluation of IoT Intrusion Detection	Apr 21, 2025	Feature EngineeringIntrusion Detection	—Unverified
Word Embedding Techniques for Classification of Star Ratings	Apr 18, 2025	ClassificationDimensionality Reduction	—Unverified
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection	Apr 18, 2025	DecoderFeature Engineering	—Unverified
Morphing-based Compression for Data-centric ML Pipelines	Apr 15, 2025	Feature Engineering	—Unverified
Beyond Glucose-Only Assessment: Advancing Nocturnal Hypoglycemia Prediction in Children with Type 1 Diabetes	Apr 12, 2025	Decision MakingFeature Engineering	—Unverified
Bringing Structure to Naturalness: On the Naturalness of ASTs	Apr 11, 2025	Feature EngineeringLanguage Modelling	—Unverified
Boosting Relational Deep Learning with Pretrained Tabular Models	Apr 7, 2025	Deep LearningFeature Engineering	CodeCode Available

Show:10 25 50

← PrevPage 2 of 69Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified