SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 401425 of 1706 papers

TitleStatusHype
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak SupervisionCode0
CLRGaze: Contrastive Learning of Representations for Eye Movement SignalsCode0
Novelty Goes Deep. A Deep Neural Solution To Document Level Novelty DetectionCode0
Numeric Encoding Options with AutomungeCode0
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language ModelsCode0
AutoLearn - Automated Feature Generation and SelectionCode0
CNN-LSTM Hybrid Model for AI-Driven Prediction of COVID-19 Severity from Spike Sequences and Clinical DataCode0
DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment AnalysisCode0
Data Science Kitchen at GermEval 2021: A Fine Selection of Hand-Picked Features, Delivered Fresh from the OvenCode0
Malware Makeover: Breaking ML-based Static Analysis by Modifying Executable BytesCode0
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsCode0
PADME: A Deep Learning-based Framework for Drug-Target Interaction PredictionCode0
PathoLM: Identifying pathogenicity from the DNA sequence through the Genome Foundation ModelCode0
AutoFITS: Automatic Feature Engineering for Irregular Time SeriesCode0
An Embedding Learning Framework for Numerical Features in CTR PredictionCode0
CyberTronics at SemEval-2020 Task 12: Multilingual Offensive Language Identification over Social MediaCode0
Probabilistic Bag-Of-Hyperlinks Model for Entity LinkingCode0
Product-based Neural Networks for User Response Prediction over Multi-field Categorical DataCode0
Auto deep learning for bioacoustic signalsCode0
Advancing Automated Deception Detection: A Multimodal Approach to Feature Extraction and AnalysisCode0
PyRATA, Python Rule-based feAture sTructure AnalysisCode0
Quantifying yeast colony morphologies with feature engineering from time-lapse photographyCode0
Cross-type Biomedical Named Entity Recognition with Deep Multi-Task LearningCode0
Recurrent neural networks with specialized word embeddings for health-domain named-entity recognitionCode0
Danish Stance Classification and Rumour ResolutionCode0
Show:102550
← PrevPage 17 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified