SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 651700 of 1706 papers

TitleStatusHype
Minimal-Configuration Anomaly Detection for IIoT Sensors0
Learning post-processing for QRS detection using Recurrent Neural Network0
Post-hoc Models for Performance Estimation of Machine Learning Inference0
GenTAL: Generative Denoising Skip-gram Transformer for Unsupervised Binary Code Similarity Detection0
Automated Mobile Attention KPConv Networks via A Wide & Deep Predictor0
Deep Learning-Based Detection of the Acute Respiratory Distress Syndrome: What Are the Models Learning?0
Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural modelCode1
SFFDD: Deep Neural Network with Enriched Features for Failure Prediction with Its Application to Computer Disk Driver0
Unsupervised Continual Learning in Streaming Environments0
Feature Engineering for US State Legislative Hearings: Stance, Affiliation, Engagement and Absentees0
Generative Pre-Training from MoleculesCode1
Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification0
Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine Translation0
A comparative study of six model complexity metrics to search for parsimonious models with GAparsimony R Package0
AutoSmart: An Efficient and Automatic Machine Learning framework for Temporal Relational DataCode1
Detecting Attacks on IoT Devices using Featureless 1D-CNN0
Data Science Kitchen at GermEval 2021: A Fine Selection of Hand-Picked Features, Delivered Fresh from the OvenCode0
RF-LighGBM: A probabilistic ensemble way to predict customer repurchase behaviour in community e-commerce0
Sequence-to-Sequence Learning with Latent Neural GrammarsCode1
Personality Trait Identification Using the Russian Feature Extraction Toolkit0
Precog-LTRC-IIITH at GermEval 2021: Ensembling Pre-Trained Language Models with Feature EngineeringCode0
Time Series Prediction using Deep Learning Methods in Healthcare0
Growing Cosine Unit: A Novel Oscillatory Activation Function That Can Speedup Training and Reduce Parameters in Convolutional Neural Networks0
End-To-End Anomaly Detection for Identifying Malicious Cyber Behavior through NLP-Based Log Embeddings0
PTRAIL -- A python package for parallel trajectory data preprocessingCode1
Towards Personalized and Human-in-the-Loop Document Summarization0
Data-driven Smart Ponzi Scheme Detection0
Graph Contrastive Learning for Anomaly DetectionCode1
Feature Engineering with Regularity StructuresCode0
Empirical Analysis on Effectiveness of NLP Methods for Predicting Code Smell0
Deep Learning Chromatic and Clique Numbers of GraphsCode0
Effective Model Integration Algorithm for Improving Link and Sign Prediction in Complex Networks0
Classification of Electrical Impedance Tomography Data Using Machine Learning0
Efficient Deep Feature Calibration for Cross-Modal Joint Embedding Learning0
Alejandro Mosquera at SemEval-2021 Task 1: Exploring Sentence and Word Features for Lexical Complexity Prediction0
CLULEX at SemEval-2021 Task 1: A Simple System Goes a Long Way0
A Plant Root System Algorithm Based on Swarm Intelligence for One-dimensional Biomedical Signal Feature Engineering0
AutoML Meets Time Series Regression Design and Analysis of the AutoSeries ChallengeCode0
Leaf-FM: A Learnable Feature Generation Factorization Machine for Click-Through Rate Prediction0
Multi-Perspective Content Delivery Networks Security Framework Using Optimized Unsupervised Anomaly Detection0
LocalGLMnet: interpretable deep learning for tabular data0
Establishing process-structure linkages using Generative Adversarial NetworksCode1
VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space DecompositionCode1
Residual Attention Based Network for Automatic Classification of Phonation Modes0
Short-term Renewable Energy Forecasting in Greece using Prophet Decomposition and Tree-based EnsemblesCode1
Feature Cross Search via Submodular Optimization0
NOTE: Solution for KDD-CUP 2021 WikiKG90M-LSC0
A Data-Driven Method for Recognizing Automated Negotiation Strategies0
Free-Text Keystroke Dynamics for User Authentication0
Enhancing the Analysis of Software Failures in Cloud Computing Systems with Deep LearningCode1
Show:102550
← PrevPage 14 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified