SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 101125 of 1706 papers

TitleStatusHype
Discovering Neural WiringsCode1
BP-Net: Efficient Deep Learning for Continuous Arterial Blood Pressure Estimation using PhotoplethysmogramCode1
Can Models Help Us Create Better Models? Evaluating LLMs as Data ScientistsCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
CASPR: Customer Activity Sequence-based Prediction and RepresentationCode1
Cardea: An Open Automated Machine Learning Framework for Electronic Health RecordsCode1
Deep Dive into Hunting for LotLs Using Machine Learning and Feature Engineering.Code1
Relational Deep Learning: Graph Representation Learning on Relational DatabasesCode1
Replay and Synthetic Speech Detection with Res2net ArchitectureCode1
CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERTCode1
Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global InferenceCode1
Compatible deep neural network framework with financial time series data, including data preprocessor, neural network model and trading strategyCode1
CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code MatchingCode1
Cognitive Evolutionary Search to Select Feature Interactions for Click-Through Rate PredictionCode1
Simplified DOM Trees for Transferable Attribute Extraction from the WebCode1
SimTeG: A Frustratingly Simple Approach Improves Textual Graph LearningCode1
Context-Aware Deep Learning for Multi Modal Depression DetectionCode1
SMUTF: Schema Matching Using Generative Tags and Hybrid FeaturesCode1
Supervised Learning on Relational Databases with Graph Neural NetworksCode1
Symbolic regression for scientific discovery: an application to wind speed forecastingCode1
DeepSurv: Personalized Treatment Recommender System Using A Cox Proportional Hazards Deep Neural NetworkCode1
Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural modelCode1
The Remarkable Robustness of LLMs: Stages of Inference?Code1
DiviK: Divisive intelligent K-Means for hands-free unsupervised clustering in big biological dataCode1
Modelling Context with User Embeddings for Sarcasm Detection in Social MediaCode1
Show:102550
← PrevPage 5 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified