SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 501550 of 1706 papers

TitleStatusHype
Deep Interaction Machine: A Simple but Effective Model for High-order Feature Interactions0
ADSAGE: Anomaly Detection in Sequences of Attributed Graph Edges applied to insider threat detection at fine-grained level0
A Brief Survey of Machine Learning Methods for Emotion Prediction using Physiological Data0
Decoding and interpreting cortical signals with a compact convolutional neural network0
A Survey on Data Collection for Machine Learning: a Big Data -- AI Integration Perspective0
Deep Learning based, end-to-end metaphor detection in Greek language with Recurrent and Convolutional Neural Networks0
Decision Trees That Remember: Gradient-Based Learning of Recurrent Decision Trees with Memory0
Decision Tree Based Wrappers for Hearing Loss0
A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective0
Amrita_CEN at SemEval-2022 Task 6: A Machine Learning Approach for Detecting Intended Sarcasm using Oversampling0
Deceptive Review Spam Detection via Exploiting Task Relatedness and Unlabeled Data0
A Survey on Churn Analysis0
DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison0
A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends0
Amrita_CEN at SemEval-2022 Task 4: Oversampling-based Machine Learning Approach for Detecting Patronizing and Condescending Language0
Data Smashing 2.0: Sequence Likelihood (SL) Divergence For Fast Time Series Comparison0
Dataset-Agnostic Recommender Systems0
Dataiku's Solution to SPHERE's Activity Recognition Challenge0
Data-driven Smart Ponzi Scheme Detection0
Data-Driven Investigative Journalism For Connectas Dataset0
Enhancing Sindhi Word Segmentation using Subword Representation Learning and Position-aware Self-attention0
Data-driven intelligent computational design for products: Method, techniques, and applications0
Data Collection and Quality Challenges in Deep Learning: A Data-Centric AI Perspective0
A Study of Variable-Role-based Feature Enrichment in Neural Models of Code0
A Model of Coherence Based on Distributed Sentence Representation0
A Defensive Framework Against Adversarial Attacks on Machine Learning-Based Network Intrusion Detection Systems0
DAG-based Long Short-Term Memory for Neural Word Segmentation0
Customer Support Ticket Escalation Prediction using Feature Engineering0
A strong baseline for question relevancy ranking0
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System0
Customers Churn Prediction in Financial Institution Using Artificial Neural Network0
Customer Lifetime Value in Video Games Using Deep Learning and Parametric Models0
A streamable large-scale clinical EEG dataset for Deep Learning0
Cuffless Blood Pressure Estimation from Electrocardiogram and Photoplethysmogram Using Waveform Based ANN-LSTM Network0
CTSys at SemEval-2018 Task 3: Irony in Tweets0
A State-of-the-Art Mention-Pair Model for Coreference Resolution0
AMC-Net: An Effective Network for Automatic Modulation Classification0
A Deep Representation Empowered Distant Supervision Paradigm for Clinical Information Extraction0
Democratizing AI: Non-expert design of prediction tasks0
A Stacking Gated Neural Architecture for Implicit Discourse Relation Classification0
Cross-lingual Short-text Matching with Deep Learning0
AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science0
A machine learning model for identifying cyclic alternating patterns in the sleeping brain0
Cross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation0
Cross-Class Relevance Learning for Temporal Concept Localization0
Assets Forecasting with Feature Engineering and Transformation Methods for LightGBM0
Credit card fraud detection using machine learning: A survey0
Coupled IGMM-GANs for deep multimodal anomaly detection in human mobility data0
Asset price movement prediction using empirical mode decomposition and Gaussian mixture models0
A Machine Learning Approach to Digital Contact Tracing: TC4TL Challenge0
Show:102550
← PrevPage 11 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified