Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 1706 papers

Title	Date	Tasks	Status	Hype
Augmenting train maintenance technicians with automated incident diagnostic suggestions	Aug 19, 2024	DiagnosticFeature Engineering	—Unverified	0
Understanding Generative AI Content with Embedding Models	Aug 19, 2024	Feature Engineering	—Unverified	0
EEG Right & Left Voluntary Hand Movement-based Virtual Brain-Computer Interfacing Keyboard Using Hybrid Deep Learning Approach	Aug 18, 2024	Brain Computer InterfaceDeep Learning	—Unverified	0
Detecting Unsuccessful Students in Cybersecurity Exercises in Two Different Learning Environments	Aug 16, 2024	Feature Engineering	CodeCode Available	0
Improving VTE Identification through Language Models from Radiology Reports: A Comparative Study of Mamba, Phi-3 Mini, and BERT	Aug 16, 2024	Feature EngineeringLanguage Modelling	—Unverified	0
LOLgorithm: Integrating Semantic,Syntactic and Contextual Elements for Humor Classification	Aug 12, 2024	Feature EngineeringHumor Detection	—Unverified	0
Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach	Aug 7, 2024	Depth EstimationFeature Engineering	—Unverified	0
Classification of Raw MEG/EEG Data with Detach-Rocket Ensemble: An Improved ROCKET Algorithm for Multivariate Time Series Analysis	Aug 5, 2024	ClassificationEEG	CodeCode Available	1
IBB Traffic Graph Data: Benchmarking and Road Traffic Prediction Model	Aug 2, 2024	BenchmarkingFeature Engineering	—Unverified	0
Improving Machine Learning Based Sepsis Diagnosis Using Heart Rate Variability	Aug 1, 2024	Decision MakingFeature Engineering	—Unverified	0
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models	Aug 1, 2024	AutoMLCode Generation	CodeCode Available	0
RelBench: A Benchmark for Deep Learning on Relational Databases	Jul 29, 2024	Deep LearningFeature Engineering	CodeCode Available	3
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review	Jul 27, 2024	ArticlesEthics	—Unverified	0
An Efficient and Flexible Deep Learning Method for Signal Delineation via Keypoints Estimation	Jul 24, 2024	Feature EngineeringKeypoint Estimation	—Unverified	0
Self-Reasoning Assistant Learning for non-Abelian Gauge Fields Design	Jul 23, 2024	Feature Engineering	—Unverified	0
Fever Detection with Infrared Thermography: Enhancing Accuracy through Machine Learning Techniques	Jul 22, 2024	DiagnosticFeature Engineering	—Unverified	0
Temperature Distribution Prediction in Laser Powder Bed Fusion using Transferable and Scalable Graph Neural Networks	Jul 18, 2024	Bayesian OptimizationFeature Engineering	—Unverified	0
Evaluating Large Language Models for Anxiety and Depression Classification using Counseling and Psychotherapy Transcripts	Jul 18, 2024	Feature Engineering	CodeCode Available	0
GraphGuard: Contrastive Self-Supervised Learning for Credit-Card Fraud Detection in Multi-Relational Dynamic Graphs	Jul 17, 2024	Feature EngineeringFraud Detection	—Unverified	0
Molecular Topological Profile (MOLTOP) -- Simple and Strong Baseline for Molecular Graph Classification	Jul 16, 2024	Feature EngineeringGraph Classification	CodeCode Available	0
GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text	Jul 16, 2024	Feature EngineeringLanguage Modelling	—Unverified	0
Deep-Graph-Sprints: Accelerated Representation Learning in Continuous-Time Dynamic Graphs	Jul 10, 2024	Deep LearningFeature Engineering	—Unverified	0
Advancing Automated Deception Detection: A Multimodal Approach to Feature Extraction and Analysis	Jul 8, 2024	Deception DetectionFeature Engineering	CodeCode Available	0
MERGE -- A Bimodal Audio-Lyrics Dataset for Static Music Emotion Recognition	Jul 8, 2024	BenchmarkingDeep Learning	—Unverified	0
Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques	Jul 5, 2024	Decision MakingFeature Engineering	—Unverified	0
GraphCNNpred: A stock market indices prediction using a Graph based deep learning system	Jul 4, 2024	Feature EngineeringGraph Neural Network	—Unverified	0
OSPC: Artificial VLM Features for Hateful Meme Detection	Jul 3, 2024	Computational EfficiencyFeature Engineering	—Unverified	0
A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data	Jul 2, 2024	Feature EngineeringHyperparameter Optimization	CodeCode Available	1
The Remarkable Robustness of LLMs: Stages of Inference?	Jun 27, 2024	Feature EngineeringPrediction	CodeCode Available	1
TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks	Jun 27, 2024	Feature EngineeringModel Selection	CodeCode Available	4
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry	Jun 26, 2024	Feature EngineeringLanguage Modeling	—Unverified	0
Comparing fingers and gestures for bci control using an optimized classical machine learning decoder	Jun 25, 2024	DecoderFeature Engineering	—Unverified	0
Horseshoe-type Priors for Independent Component Estimation	Jun 24, 2024	Feature Engineering	—Unverified	0
LightGBM robust optimization algorithm based on topological data analysis	Jun 19, 2024	ClassificationFeature Engineering	—Unverified	0
PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation Model	Jun 19, 2024	Feature EngineeringLanguage Modelling	CodeCode Available	0
Retrieval-Augmented Feature Generation for Domain-Specific Classification	Jun 17, 2024	Classificationdomain classification	—Unverified	0
Deep Learning Domain Adaptation to Understand Physico-Chemical Processes from Fluorescence Spectroscopy Small Datasets: Application to Ageing of Olive Oil	Jun 14, 2024	Domain AdaptationFeature Engineering	—Unverified	0
Explainable AI for Comparative Analysis of Intrusion Detection Models	Jun 14, 2024	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	CodeCode Available	0
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning	Jun 12, 2024	Automated Feature EngineeringFeature Engineering	CodeCode Available	1
Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy	Jun 11, 2024	Decision MakingFeature Engineering	—Unverified	0
Learned Feature Importance Scores for Automated Feature Engineering	Jun 6, 2024	Automated Feature EngineeringFeature Engineering	—Unverified	0
Dynamic and Adaptive Feature Generation with LLM	Jun 4, 2024	Automated Feature EngineeringFeature Engineering	—Unverified	0
DeepMol: An Automated Machine and Deep Learning Framework for Computational Chemistr	Jun 1, 2024	Activity PredictionAutoML	CodeCode Available	2
Iterative Feature Boosting for Explainable Speech Emotion Recognition	May 30, 2024	Emotion RecognitionFeature Engineering	CodeCode Available	0
Network Analytics for Anti-Money Laundering -- A Systematic Literature Review and Experimental Evaluation	May 29, 2024	Feature EngineeringFraud Detection	CodeCode Available	1
Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking Sequences	May 28, 2024	BenchmarkingFeature Engineering	CodeCode Available	1
Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction	May 27, 2024	Feature EngineeringGesture Recognition	—Unverified	0
Transitional Uncertainty with Layered Intermediate Predictions	May 25, 2024	Feature Engineering	—Unverified	0
Maintaining and Managing Road Quality:Using MLP and DNN	May 25, 2024	Feature Engineering	—Unverified	0
Wearable-based behaviour interpolation for semi-supervised human activity recognition	May 24, 2024	Activity RecognitionDeep Learning	—Unverified	0

Show:10 25 50

← PrevPage 5 of 35Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified