Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 1706 papers

Title	Date	Tasks	Status	Hype
See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers	Nov 4, 2024	Anomaly DetectionFeature Engineering	—Unverified	0
Explainable cognitive decline detection in free dialogues with a Machine Learning approach based on pre-trained Large Language Models	Nov 4, 2024	Feature EngineeringPrompt Engineering	—Unverified	0
Enhancing Glucose Level Prediction of ICU Patients through Hierarchical Modeling of Irregular Time-Series	Nov 3, 2024	Data IntegrationFeature Engineering	CodeCode Available	0
Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers	Nov 3, 2024	Ensemble LearningFeature Engineering	—Unverified	0
Machine Learning Framework for Audio-Based Content Evaluation using MFCC, Chroma, Spectral Contrast, and Temporal Feature Engineering	Oct 31, 2024	Feature Engineering	—Unverified	0
Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists	Oct 30, 2024	Feature Engineering	CodeCode Available	1
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions	Oct 27, 2024	Feature Engineering	CodeCode Available	3
Predicting 30-Day Hospital Readmission in Medicare Patients: Insights from an LSTM Deep Learning Model	Oct 23, 2024	Feature EngineeringReadmission Prediction	—Unverified	0
Large Language Models Engineer Too Many Simple Features For Tabular Data	Oct 23, 2024	Feature EngineeringText Generation	CodeCode Available	0
AdaptoML-UX: An Adaptive User-centered GUI-based AutoML Toolkit for Non-AI Experts and HCI Researchers	Oct 22, 2024	Automated Feature EngineeringAutoML	CodeCode Available	0
Molecular Topological Profile (MOLTOP) - Simple and Strong Baseline for Molecular Graph Classification	Oct 17, 2024	Feature EngineeringGraph Classification	CodeCode Available	0
Reproducible Machine Learning-based Voice Pathology Detection: Introducing the Pitch Difference Feature	Oct 14, 2024	Feature EngineeringVoice pathology detection	CodeCode Available	0
Statistical Test for Auto Feature Engineering by Selective Inference	Oct 13, 2024	Feature Engineering	CodeCode Available	0
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular Prediction	Oct 13, 2024	Feature Engineering	CodeCode Available	0
Sui Generis: Large Language Models for Authorship Attribution and Verification in Latin	Oct 11, 2024	Authorship AttributionAuthorship Verification	—Unverified	0
Towards Trustworthy Web Attack Detection: An Uncertainty-Aware Ensemble Deep Kernel Learning Model	Oct 10, 2024	Ensemble LearningFeature Engineering	—Unverified	0
Principal Orthogonal Latent Components Analysis (POLCA Net)	Oct 9, 2024	Dimensionality ReductionFeature Correlation	CodeCode Available	0
Neural-Bayesian Program Learning for Few-shot Dialogue Intent Parsing	Oct 8, 2024	Feature EngineeringFew-Shot Learning	—Unverified	0
Learning to Solve Abstract Reasoning Problems with Neurosymbolic Program Synthesis and Task Generation	Oct 6, 2024	Feature EngineeringProgram Synthesis	—Unverified	0
Self-eXplainable AI for Medical Image Analysis: A Survey and New Outlooks	Oct 3, 2024	counterfactualCounterfactual Explanation	—Unverified	0
Semantic-Guided RL for Interpretable Feature Engineering	Oct 3, 2024	Automated Feature EngineeringDeep Reinforcement Learning	—Unverified	0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach	Oct 2, 2024	Data IntegrationFeature Engineering	—Unverified	0
Automatic deductive coding in discourse analysis: an application of large language models in learning analytics	Oct 2, 2024	Feature EngineeringLanguage Modeling	CodeCode Available	0
LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction	Sep 27, 2024	ClassificationFeature Engineering	CodeCode Available	1
Enhanced Convolution Neural Network with Optimized Pooling and Hyperparameter Tuning for Network Intrusion Detection	Sep 27, 2024	Attention Score PredictionFeature Engineering	CodeCode Available	0
Reinforcement Feature Transformation for Polymer Property Performance Prediction	Sep 23, 2024	Feature EngineeringPrediction	—Unverified	0
A Feature Engineering Approach for Literary and Colloquial Tamil Speech Classification using 1D-CNN	Sep 22, 2024	Feature EngineeringForm	—Unverified	0
Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks	Sep 20, 2024	Feature Engineering	—Unverified	0
Machine Learning for Public Good: Predicting Urban Crime Patterns to Enhance Community Safety	Sep 17, 2024	Feature Engineering	—Unverified	0
Leveraging Open-Source Large Language Models for Native Language Identification	Sep 15, 2024	Feature EngineeringLanguage Acquisition	—Unverified	0
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving	Sep 11, 2024	Autonomous DrivingFeature Engineering	CodeCode Available	2
Learn2Aggregate: Supervised Generation of Chvátal-Gomory Cuts Using Graph Neural Networks	Sep 10, 2024	Feature EngineeringGraph Neural Network	—Unverified	0
HybridFC: A Hybrid Fact-Checking Approach for Knowledge Graphs	Sep 10, 2024	DiversityEnsemble Learning	CodeCode Available	0
Machine Learning-Based Prediction of Key Genes Correlated to the Subretinal Lesion Severity in a Mouse Model of Age-Related Macular Degeneration	Sep 8, 2024	Dimensionality ReductionDrug Discovery	—Unverified	0
IIFE: Interaction Information Based Automated Feature Engineering	Sep 7, 2024	Automated Feature EngineeringFeature Engineering	CodeCode Available	0
Large Margin Prototypical Network for Few-shot Relation Classification with Fine-grained Features	Sep 6, 2024	Feature EngineeringFew-Shot Learning	—Unverified	0
Leveraging Large Language Models through Natural Language Processing to provide interpretable Machine Learning predictions of mental deterioration in real time	Sep 5, 2024	ChatbotDiagnostic	—Unverified	0
Application Research On Real-Time Perception Of Device Performance Status	Sep 5, 2024	DescriptiveDimensionality Reduction	—Unverified	0
Towards Autonomous Cybersecurity: An Intelligent AutoML Framework for Autonomous Intrusion Detection	Sep 5, 2024	AutoMLBayesian Optimization	CodeCode Available	1
Hybridization of Persistent Homology with Neural Networks for Time-Series Prediction: A Case Study in Wave Height	Sep 3, 2024	Feature EngineeringTime Series	—Unverified	0
PoliPrompt: A High-Performance Cost-Effective LLM-Based Text Classification Framework for Political Science	Sep 2, 2024	ClassificationFeature Engineering	—Unverified	0
LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition	Aug 30, 2024	ArticlesFeature Engineering	—Unverified	0
Enhancing Customer Churn Prediction in Telecommunications: An Adaptive Ensemble Learning Approach	Aug 29, 2024	Ensemble LearningFeature Engineering	—Unverified	0
Android Malware Detection Based on RGB Images and Multi-feature Fusion	Aug 29, 2024	Android Malware DetectionEdge Detection	—Unverified	0
gWaveNet: Classification of Gravity Waves from Noisy Satellite Data using Custom Kernel Integrated Deep Learning Method	Aug 26, 2024	Feature Engineering	CodeCode Available	0
Obfuscated Memory Malware Detection	Aug 23, 2024	Binary ClassificationFeature Engineering	—Unverified	0
Improving Radiography Machine Learning Workflows via Metadata Management for Training Data Selection	Aug 22, 2024	Feature EngineeringManagement	—Unverified	0
Graph Classification via Reference Distribution Learning: Theory and Practice	Aug 21, 2024	Feature EngineeringGraph Classification	—Unverified	0
Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods	Aug 21, 2024	Data AugmentationFeature Engineering	CodeCode Available	0
Improved Differential Evolution based Feature Selection through Quantum, Chaos, and Lasso	Aug 20, 2024	Feature Engineeringfeature selection	—Unverified	0

Show:10 25 50

← PrevPage 4 of 35Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified