Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 1706 papers

Title	Date	Tasks	Status	Hype
Risk Analysis of Flowlines in the Oil and Gas Sector: A GIS and Machine Learning Approach	Jan 20, 2025	Dimensionality ReductionFeature Engineering	CodeCode Available	0
Algorithmic Derivation of Human Spatial Navigation Indices From Eye Movement Data	Jan 18, 2025	Feature EngineeringLandmark Recognition	—Unverified	0
Challenges and recommendations for Electronic Health Records data extraction and preparation for dynamic prediction modelling in hospitalized patients -- a practical guide	Jan 17, 2025	Feature Engineering	—Unverified	0
Dataset-Agnostic Recommender Systems	Jan 13, 2025	Feature Engineeringfeature selection	—Unverified	0
Text to Band Gap: Pre-trained Language Models as Encoders for Semiconductor Band Gap Prediction	Jan 7, 2025	Band GapFeature Engineering	CodeCode Available	0
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features	Jan 6, 2025	Feature EngineeringTime Series	CodeCode Available	3
Predicting Vulnerability to Malware Using Machine Learning Models: A Study on Microsoft Windows Machines	Jan 5, 2025	Feature EngineeringMalware Detection	—Unverified	0
Classification of Operational Records in Aviation Using Deep Learning Approaches	Jan 2, 2025	ClassificationData Augmentation	—Unverified	0
Multi-Modal Video Feature Extraction for Popularity Prediction	Jan 2, 2025	Feature EngineeringPrediction	—Unverified	0
Dynamic Adaptation in Data Storage: Real-Time Machine Learning for Enhanced Prefetching	Dec 29, 2024	Computational EfficiencyFeature Engineering	—Unverified	0
Assets Forecasting with Feature Engineering and Transformation Methods for LightGBM	Dec 27, 2024	Feature EngineeringFeature Importance	—Unverified	0
Context-Aware Deep Learning for Multi Modal Depression Detection	Dec 26, 2024	Data AugmentationDeep Learning	CodeCode Available	1
Three-Class Text Sentiment Analysis Based on LSTM	Dec 23, 2024	Feature EngineeringSentiment Analysis	—Unverified	0
STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction	Dec 23, 2024	Feature EngineeringGraph Attention	—Unverified	0
Intelligent Approaches to Predictive Analytics in Occupational Health and Safety in India	Dec 20, 2024	Anomaly DetectionDecision Making	—Unverified	0
Risk-Adjusted Performance of Random Forest Models in High-Frequency Trading	Dec 19, 2024	Algorithmic TradingFeature Engineering	—Unverified	0
PCA-Featured Transformer for Jamming Detection in 5G UAV Networks	Dec 19, 2024	ChunkingFeature Engineering	—Unverified	0
Hunting Tomorrow's Leaders: Using Machine Learning to Forecast S&P 500 Additions & Removal	Dec 17, 2024	Decision MakingFeature Engineering	—Unverified	0
GLARE: Google Apps Arabic Reviews Dataset	Dec 16, 2024	Feature Engineering	CodeCode Available	0
F-RBA: A Federated Learning-based Framework for Risk-based Authentication	Dec 16, 2024	Anomaly DetectionFeature Engineering	—Unverified	0
S&P 500 Trend Prediction	Dec 16, 2024	Feature EngineeringFeature Importance	—Unverified	0
A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer	Dec 15, 2024	Feature EngineeringLanguage Modeling	—Unverified	0
Feature engineering vs. deep learning for paper section identification: Toward applications in Chinese medical literature	Dec 15, 2024	Deep LearningFeature Engineering	—Unverified	0
Deep Learning-Based Noninvasive Screening of Type 2 Diabetes with Chest X-ray Images and Electronic Health Records	Dec 14, 2024	DiagnosticFeature Engineering	CodeCode Available	0
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs	Dec 13, 2024	Feature EngineeringMarketing	—Unverified	0
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction	Dec 12, 2024	Data AugmentationFeature Engineering	—Unverified	0
Image-Based Malware Classification Using QR and Aztec Codes	Dec 11, 2024	Feature EngineeringMalware Classification	—Unverified	0
Robust Feature Engineering Techniques for Designing Efficient Motor Imagery-Based BCI-Systems	Dec 10, 2024	Brain Computer InterfaceEEG	—Unverified	0
RUL forecasting for wind turbine predictive maintenance based on deep learning	Dec 9, 2024	Feature EngineeringScheduling	—Unverified	0
PRECISE: Pre-training Sequential Recommenders with Collaborative and Semantic Information	Dec 9, 2024	Feature EngineeringRecommendation Systems	—Unverified	0
Parkinson's Disease Diagnosis Through Deep Learning: A Novel LSTM-Based Approach for Freezing of Gait Detection	Dec 9, 2024	Feature EngineeringL2 Regularization	—Unverified	0
Federated Automated Feature Engineering	Dec 5, 2024	Automated Feature EngineeringFeature Engineering	—Unverified	0
Deep Learning in Single-Cell and Spatial Transcriptomics Data Analysis: Advances and Challenges from a Data Science Perspective	Dec 4, 2024	Feature Engineering	—Unverified	0
Comparative Performance of Machine Learning Algorithms for Early Genetic Disorder and Subclass Classification	Dec 3, 2024	Feature Engineering	—Unverified	0
Intelligent Spark Agents: A Modular LangGraph Framework for Scalable, Visualized, and Enhanced Big Data Machine Learning Workflows	Dec 2, 2024	Decision MakingDistributed Computing	—Unverified	0
HiCat: A Semi-Supervised Approach for Cell Type Annotation	Nov 25, 2024	Dimensionality ReductionFeature Engineering	—Unverified	0
An AutoML-based approach for Network Intrusion Detection	Nov 24, 2024	AutoMLFeature Engineering	—Unverified	0
Understanding LLM Embeddings for Regression	Nov 22, 2024	Feature Engineeringregression	—Unverified	0
Enhancing Molecular Design through Graph-based Topological Reinforcement Learning	Nov 22, 2024	Drug DesignDrug Discovery	—Unverified	0
Advancing Heatwave Forecasting via Distribution Informed-Graph Neural Networks (DI-GNNs): Integrating Extreme Value Theory with GNNs	Nov 20, 2024	Feature EngineeringGraph Neural Network	—Unverified	0
Graph Neural Networks for Quantifying Compatibility Mechanisms in Traditional Chinese Medicine	Nov 18, 2024	Drug DiscoveryFeature Engineering	CodeCode Available	1
Is Precise Recovery Necessary? A Task-Oriented Imputation Approach for Time Series Forecasting on Variable Subset	Nov 15, 2024	Feature EngineeringImputation	—Unverified	0
What makes a good BIM design: quantitative linking between design behavior and quality	Nov 14, 2024	Feature Engineering	—Unverified	0
GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees	Nov 13, 2024	Decision MakingFeature Engineering	—Unverified	0
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey	Nov 11, 2024	AutoMLFeature Engineering	CodeCode Available	0
Classification of residential and non-residential buildings based on satellite data using deep learning	Nov 11, 2024	ClassificationComputational Efficiency	—Unverified	0
RAGulator: Lightweight Out-of-Context Detectors for Grounded Text Generation	Nov 6, 2024	Feature EngineeringRAG	—Unverified	0
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level	Nov 5, 2024	Bayesian OptimisationBenchmarking	—Unverified	0
Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Nov 5, 2024	Depth EstimationDepth Prediction	CodeCode Available	0
Exploring Feature Importance and Explainability Towards Enhanced ML-Based DoS Detection in AI Systems	Nov 4, 2024	Feature EngineeringFeature Importance	—Unverified	0

Show:10 25 50

← PrevPage 3 of 35Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified