Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 1706 papers

Title	Date	Tasks	Status	Hype
Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering	May 5, 2023	Automated Feature EngineeringAutoML	CodeCode Available	1
Attention-Based Deep Learning Framework for Human Activity Recognition with User Adaptation	Jun 6, 2020	Activity RecognitionDeep Learning	CodeCode Available	1
Anomaly Detection for Solder Joints Using β-VAE	Apr 24, 2021	Anomaly DetectionFeature Engineering	CodeCode Available	1
Discovering Neural Wirings	Jun 3, 2019	Feature EngineeringNetwork Pruning	CodeCode Available	1
A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances	May 22, 2020	Feature EngineeringMarketing	CodeCode Available	1
Network Analytics for Anti-Money Laundering -- A Systematic Literature Review and Experimental Evaluation	May 29, 2024	Feature EngineeringFraud Detection	CodeCode Available	1
Automated Website Fingerprinting through Deep Learning	Aug 21, 2017	Deep LearningFeature Engineering	CodeCode Available	1
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning	Jun 12, 2024	Automated Feature EngineeringFeature Engineering	CodeCode Available	1
Predicting crop yields with little ground truth: A simple statistical model for in-season forecasting	Jun 16, 2021	Crop Yield PredictionFeature Engineering	CodeCode Available	1
PTRAIL -- A python package for parallel trajectory data preprocessing	Aug 26, 2021	Feature EngineeringPosition	CodeCode Available	1
An End-to-End Reinforcement Learning Approach for Job-Shop Scheduling Problems Based on Constraint Programming	Jun 9, 2023	Combinatorial OptimizationFeature Engineering	CodeCode Available	1
Representation learning of writing style	Nov 1, 2020	ArticlesAuthorship Attribution	CodeCode Available	1
Online learning techniques for prediction of temporal tabular datasets with regime changes	Dec 30, 2022	Feature EngineeringModel Selection	CodeCode Available	1
Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder	Mar 31, 2021	Anomaly DetectionFeature Engineering	CodeCode Available	1
Short-term Renewable Energy Forecasting in Greece using Prophet Decomposition and Tree-based Ensembles	Jul 8, 2021	Feature EngineeringTime Series	CodeCode Available	1
Simplified DOM Trees for Transferable Attribute Extraction from the Web	Jan 7, 2021	AttributeAttribute Extraction	CodeCode Available	1
SkillGPT: a RESTful API service for skill extraction and standardization using a Large Language Model	Apr 17, 2023	Feature EngineeringLanguage Modeling	CodeCode Available	1
SMUTF: Schema Matching Using Generative Tags and Hybrid Features	Jan 22, 2024	Feature EngineeringHumanitarian	CodeCode Available	1
Supervised Learning on Relational Databases with Graph Neural Networks	Feb 6, 2020	BIG-bench Machine LearningFeature Engineering	CodeCode Available	1
Symbolic regression for scientific discovery: an application to wind speed forecasting	Feb 21, 2021	Feature Engineeringregression	CodeCode Available	1
Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series	Oct 19, 2023	DiversityFeature Engineering	CodeCode Available	1
Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model	Sep 23, 2021	CPUFeature Engineering	CodeCode Available	1
The Remarkable Robustness of LLMs: Stages of Inference?	Jun 27, 2024	Feature EngineeringPrediction	CodeCode Available	1
Deep Dive into Hunting for LotLs Using Machine Learning and Feature Engineering.	Apr 21, 2023	Feature Engineering	CodeCode Available	1
General-Purpose User Embeddings based on Mobile App Usage	May 27, 2020	Feature Engineering	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 69Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified