Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 1706 papers

Title	Date	Tasks	Status	Hype
Minimal-Configuration Anomaly Detection for IIoT Sensors	Oct 8, 2021	Anomaly DetectionFeature Engineering	—Unverified	0
Learning post-processing for QRS detection using Recurrent Neural Network	Oct 7, 2021	Deep LearningFeature Engineering	—Unverified	0
Post-hoc Models for Performance Estimation of Machine Learning Inference	Oct 6, 2021	BIG-bench Machine LearningFeature Engineering	—Unverified	0
GenTAL: Generative Denoising Skip-gram Transformer for Unsupervised Binary Code Similarity Detection	Sep 29, 2021	Contrastive LearningDenoising	—Unverified	0
Automated Mobile Attention KPConv Networks via A Wide & Deep Predictor	Sep 29, 2021	3D Point Cloud ClassificationFeature Engineering	—Unverified	0
Deep Learning-Based Detection of the Acute Respiratory Distress Syndrome: What Are the Models Learning?	Sep 25, 2021	Feature EngineeringRespiratory Failure	—Unverified	0
Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model	Sep 23, 2021	CPUFeature Engineering	CodeCode Available	1
SFFDD: Deep Neural Network with Enriched Features for Failure Prediction with Its Application to Computer Disk Driver	Sep 20, 2021	Feature EngineeringTime Series	—Unverified	0
Unsupervised Continual Learning in Streaming Environments	Sep 20, 2021	ClusteringContinual Learning	—Unverified	0
Feature Engineering for US State Legislative Hearings: Stance, Affiliation, Engagement and Absentees	Sep 18, 2021	Feature Engineering	—Unverified	0
Generative Pre-Training from Molecules	Sep 16, 2021	Feature EngineeringGeneral Knowledge	CodeCode Available	1
Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification	Sep 15, 2021	Feature EngineeringFeature Importance	—Unverified	0
Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine Translation	Sep 15, 2021	Feature EngineeringLanguage Modeling	—Unverified	0
A comparative study of six model complexity metrics to search for parsimonious models with GAparsimony R Package	Sep 10, 2021	Feature Engineeringfeature selection	—Unverified	0
AutoSmart: An Efficient and Automatic Machine Learning framework for Temporal Relational Data	Sep 9, 2021	AutoMLBIG-bench Machine Learning	CodeCode Available	1
Detecting Attacks on IoT Devices using Featureless 1D-CNN	Sep 9, 2021	Anomaly DetectionBIG-bench Machine Learning	—Unverified	0
Data Science Kitchen at GermEval 2021: A Fine Selection of Hand-Picked Features, Delivered Fresh from the Oven	Sep 6, 2021	Fact CheckingFeature Engineering	CodeCode Available	0
RF-LighGBM: A probabilistic ensemble way to predict customer repurchase behaviour in community e-commerce	Sep 2, 2021	Feature EngineeringHyperparameter Optimization	—Unverified	0
Sequence-to-Sequence Learning with Latent Neural Grammars	Sep 2, 2021	DiagnosticFeature Engineering	CodeCode Available	1
Personality Trait Identification Using the Russian Feature Extraction Toolkit	Sep 1, 2021	Feature EngineeringSentence	—Unverified	0
Precog-LTRC-IIITH at GermEval 2021: Ensembling Pre-Trained Language Models with Feature Engineering	Sep 1, 2021	Data AugmentationFeature Engineering	CodeCode Available	0
Time Series Prediction using Deep Learning Methods in Healthcare	Aug 30, 2021	Deep LearningFeature Engineering	—Unverified	0
Growing Cosine Unit: A Novel Oscillatory Activation Function That Can Speedup Training and Reduce Parameters in Convolutional Neural Networks	Aug 30, 2021	Feature Engineering	—Unverified	0
End-To-End Anomaly Detection for Identifying Malicious Cyber Behavior through NLP-Based Log Embeddings	Aug 27, 2021	Anomaly DetectionFeature Engineering	—Unverified	0
PTRAIL -- A python package for parallel trajectory data preprocessing	Aug 26, 2021	Feature EngineeringPosition	CodeCode Available	1
Towards Personalized and Human-in-the-Loop Document Summarization	Aug 21, 2021	Document SummarizationFeature Engineering	—Unverified	0
Data-driven Smart Ponzi Scheme Detection	Aug 20, 2021	Dynamic graph embeddingFeature Engineering	—Unverified	0
Graph Contrastive Learning for Anomaly Detection	Aug 17, 2021	Anomaly DetectionBinary Classification	CodeCode Available	1
Feature Engineering with Regularity Structures	Aug 12, 2021	Feature Engineering	CodeCode Available	0
Empirical Analysis on Effectiveness of NLP Methods for Predicting Code Smell	Aug 8, 2021	Feature Engineering	—Unverified	0
Deep Learning Chromatic and Clique Numbers of Graphs	Aug 4, 2021	Combinatorial OptimizationDeep Learning	CodeCode Available	0
Effective Model Integration Algorithm for Improving Link and Sign Prediction in Complex Networks	Aug 3, 2021	Decision MakingFeature Engineering	—Unverified	0
Classification of Electrical Impedance Tomography Data Using Machine Learning	Aug 2, 2021	BIG-bench Machine LearningClassification	—Unverified	0
Efficient Deep Feature Calibration for Cross-Modal Joint Embedding Learning	Aug 2, 2021	Feature EngineeringTriplet	—Unverified	0
Alejandro Mosquera at SemEval-2021 Task 1: Exploring Sentence and Word Features for Lexical Complexity Prediction	Aug 1, 2021	Feature EngineeringLexical Complexity Prediction	—Unverified	0
CLULEX at SemEval-2021 Task 1: A Simple System Goes a Long Way	Aug 1, 2021	Feature EngineeringLexical Complexity Prediction	—Unverified	0
A Plant Root System Algorithm Based on Swarm Intelligence for One-dimensional Biomedical Signal Feature Engineering	Jul 31, 2021	DiagnosticFeature Engineering	—Unverified	0
AutoML Meets Time Series Regression Design and Analysis of the AutoSeries Challenge	Jul 28, 2021	AutoMLFeature Engineering	CodeCode Available	0
Leaf-FM: A Learnable Feature Generation Factorization Machine for Click-Through Rate Prediction	Jul 26, 2021	Click-Through Rate PredictionFeature Engineering	—Unverified	0
Multi-Perspective Content Delivery Networks Security Framework Using Optimized Unsupervised Anomaly Detection	Jul 24, 2021	Anomaly DetectionFeature Engineering	—Unverified	0
LocalGLMnet: interpretable deep learning for tabular data	Jul 23, 2021	Deep LearningFeature Engineering	—Unverified	0
Establishing process-structure linkages using Generative Adversarial Networks	Jul 20, 2021	Conditional Image GenerationFeature Engineering	CodeCode Available	1
VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space Decomposition	Jul 19, 2021	AutoMLFeature Engineering	CodeCode Available	1
Residual Attention Based Network for Automatic Classification of Phonation Modes	Jul 18, 2021	ClassificationFeature Engineering	—Unverified	0
Short-term Renewable Energy Forecasting in Greece using Prophet Decomposition and Tree-based Ensembles	Jul 8, 2021	Feature EngineeringTime Series	CodeCode Available	1
Feature Cross Search via Submodular Optimization	Jul 5, 2021	Feature Engineering	—Unverified	0
NOTE: Solution for KDD-CUP 2021 WikiKG90M-LSC	Jul 5, 2021	Feature EngineeringQuestion Answering	—Unverified	0
A Data-Driven Method for Recognizing Automated Negotiation Strategies	Jul 3, 2021	Feature EngineeringTime Series	—Unverified	0
Free-Text Keystroke Dynamics for User Authentication	Jul 1, 2021	Feature Engineering	—Unverified	0
Enhancing the Analysis of Software Failures in Cloud Computing Systems with Deep Learning	Jun 29, 2021	Anomaly DetectionCloud Computing	CodeCode Available	1

Show:10 25 50

← PrevPage 14 of 35Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CNN	14 gestures accuracy	0.98	—	Unverified