Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1576–1600 of 1706 papers

Title	Date	Tasks	Status
RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization	Nov 6, 2023	Feature EngineeringHumanitarian	CodeCode Available
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision	Oct 31, 2016	Feature EngineeringStructured Prediction	CodeCode Available
Neural Vector Spaces for Unsupervised Information Retrieval	Aug 9, 2017	Document RankingFeature Engineering	CodeCode Available
Large Language Models Engineer Too Many Simple Features For Tabular Data	Oct 23, 2024	Feature EngineeringText Generation	CodeCode Available
Boosting Relational Deep Learning with Pretrained Tabular Models	Apr 7, 2025	Deep LearningFeature Engineering	CodeCode Available
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey	Nov 11, 2024	AutoMLFeature Engineering	CodeCode Available
Neural Word Segmentation Learning for Chinese	Jun 14, 2016	Chinese Word SegmentationFeature Engineering	CodeCode Available
Active DOP: A constituency treebank annotation tool with online learning	Aug 1, 2018	Active LearningFeature Engineering	CodeCode Available
Attention-based Neural Text Segmentation	Aug 29, 2018	Feature EngineeringSegmentation	CodeCode Available
Relation Classification via Recurrent Neural Network	Aug 5, 2015	ClassificationFeature Engineering	CodeCode Available
xDeepInt: a hybrid architecture for modeling the vector-wise and bit-wise feature interactions	Jan 3, 2023	Click-Through Rate PredictionFeature Engineering	CodeCode Available
Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation	Jun 13, 2018	ArticlesFeature Engineering	CodeCode Available
Small Language Models for Tabular Data	Nov 5, 2022	Feature EngineeringRepresentation Learning	CodeCode Available
Extracting Relational Facts by an End-to-End Neural Model with Copy Mechanism	Jul 1, 2018	DecoderFeature Engineering	CodeCode Available
Large-Scale Multi-Domain Recommendation: an Automatic Domain Feature Extraction and Personalized Integration Framework	Apr 12, 2024	Feature EngineeringTask 2	CodeCode Available
Attention-Based Convolutional Neural Network for Semantic Relation Extraction	Dec 1, 2016	Feature EngineeringGeneral Classification	CodeCode Available
Extreme Learning Machine for the Characterization of Anomalous Diffusion from Single Trajectories	May 6, 2021	Feature Engineering	CodeCode Available
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes	Jun 3, 2025	BenchmarkingFeature Engineering	CodeCode Available
Fair multilingual vandalism detection system for Wikipedia	Jun 2, 2023	Feature EngineeringLanguage Modeling	CodeCode Available
Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks	Oct 1, 2018	AttributeEntity Alignment	CodeCode Available
A Generalized Flow for B2B Sales Predictive Modeling: An Azure Machine Learning Approach	Feb 4, 2020	BIG-bench Machine LearningDecision Making	CodeCode Available
ATM: A distributed, collaborative, scalable system for automated machine learning	Dec 11, 2017	AutoMLBIG-bench Machine Learning	CodeCode Available
False Information on Web and Social Media: A Survey	Apr 23, 2018	Feature EngineeringGraph Mining	CodeCode Available
SMILES2Vec: An Interpretable General-Purpose Deep Neural Network for Predicting Chemical Properties	Dec 6, 2017	Bayesian OptimizationFeature Engineering	CodeCode Available
Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Nov 5, 2024	Depth EstimationDepth Prediction	CodeCode Available

Show:10 25 50

← PrevPage 64 of 69Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified