Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 1706 papers

Title	Date	Tasks	Status	Hype	Score
DriveML: An R Package for Driverless Machine Learning	May 1, 2020	AutoMLBIG-bench Machine Learning	CodeCode Available	1	5
A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data	Jul 2, 2024	Feature EngineeringHyperparameter Optimization	CodeCode Available	1	5
Efficient End-to-End AutoML via Scalable Search Space Decomposition	Jun 19, 2022	AutoMLFeature Engineering	CodeCode Available	1	5
Context-Aware Deep Learning for Multi Modal Depression Detection	Dec 26, 2024	Data AugmentationDeep Learning	CodeCode Available	1	5
Deep & Cross Network for Ad Click Predictions	Aug 17, 2017	Click-Through Rate PredictionFeature Engineering	CodeCode Available	1	5
CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching	Dec 1, 2020	Computer SecurityCross-Modal Retrieval	CodeCode Available	1	5
A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances	May 22, 2020	Feature EngineeringMarketing	CodeCode Available	1	5
Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference	Dec 16, 2020	Feature EngineeringMedical Question Answering	CodeCode Available	1	5
Attention-Based Deep Learning Framework for Human Activity Recognition with User Adaptation	Jun 6, 2020	Activity RecognitionDeep Learning	CodeCode Available	1	5
Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring	Sep 19, 2023	Feature EngineeringPhone-level pronunciation scoring	CodeCode Available	1	5
fseval: A Benchmarking Framework for Feature Selection and Feature Ranking Algorithms	Nov 23, 2022	Automated Feature EngineeringBenchmarking	CodeCode Available	1	5
Cognitive Evolutionary Search to Select Feature Interactions for Click-Through Rate Prediction	Aug 1, 2023	Click-Through Rate PredictionEvolutionary Algorithms	CodeCode Available	1	5
AutoGL: A Library for Automated Graph Learning	Apr 11, 2021	AutoMLBIG-bench Machine Learning	CodeCode Available	1	5
Anomaly Detection for Solder Joints Using β-VAE	Apr 24, 2021	Anomaly DetectionFeature Engineering	CodeCode Available	1	5
Understanding the Dynamics of DNNs Using Graph Modularity	Nov 24, 2021	Feature Engineering	CodeCode Available	1	5
AutoML: A Survey of the State-of-the-Art	Aug 2, 2019	AutoMLFeature Engineering	CodeCode Available	1	5
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?	Sep 26, 2019	Feature EngineeringQ-Learning	CodeCode Available	1	5
Itsy Bitsy SpiderNet: Fully Connected Residual Network for Fraud Detection	May 17, 2021	Feature EngineeringFraud Detection	CodeCode Available	1	5
A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch Literature	Nov 1, 2021	coreference-resolutionCoreference Resolution	CodeCode Available	1	5
Discovering Neural Wirings	Jun 3, 2019	Feature EngineeringNetwork Pruning	CodeCode Available	1	5
Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series	Oct 19, 2023	DiversityFeature Engineering	CodeCode Available	1	5
AutoSmart: An Efficient and Automatic Machine Learning framework for Temporal Relational Data	Sep 9, 2021	AutoMLBIG-bench Machine Learning	CodeCode Available	1	5
Bayesian Optimization of Catalysis With In-Context Learning	Apr 11, 2023	Bayesian OptimizationFeature Engineering	CodeCode Available	1	5
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language Model	Dec 14, 2020	Deep LearningFeature Engineering	CodeCode Available	1	5
Compatible deep neural network framework with financial time series data, including data preprocessor, neural network model and trading strategy	May 11, 2022	Binary ClassificationFeature Engineering	CodeCode Available	1	5

Show:10 25 50

← PrevPage 4 of 69Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified