SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 251275 of 1706 papers

TitleStatusHype
F-RBA: A Federated Learning-based Framework for Risk-based Authentication0
A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer0
Feature engineering vs. deep learning for paper section identification: Toward applications in Chinese medical literature0
Deep Learning-Based Noninvasive Screening of Type 2 Diabetes with Chest X-ray Images and Electronic Health RecordsCode0
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs0
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction0
Image-Based Malware Classification Using QR and Aztec Codes0
Robust Feature Engineering Techniques for Designing Efficient Motor Imagery-Based BCI-Systems0
RUL forecasting for wind turbine predictive maintenance based on deep learning0
PRECISE: Pre-training Sequential Recommenders with Collaborative and Semantic Information0
Parkinson's Disease Diagnosis Through Deep Learning: A Novel LSTM-Based Approach for Freezing of Gait Detection0
Federated Automated Feature Engineering0
Deep Learning in Single-Cell and Spatial Transcriptomics Data Analysis: Advances and Challenges from a Data Science Perspective0
Comparative Performance of Machine Learning Algorithms for Early Genetic Disorder and Subclass Classification0
Intelligent Spark Agents: A Modular LangGraph Framework for Scalable, Visualized, and Enhanced Big Data Machine Learning Workflows0
HiCat: A Semi-Supervised Approach for Cell Type Annotation0
An AutoML-based approach for Network Intrusion Detection0
Enhancing Molecular Design through Graph-based Topological Reinforcement Learning0
Understanding LLM Embeddings for Regression0
Advancing Heatwave Forecasting via Distribution Informed-Graph Neural Networks (DI-GNNs): Integrating Extreme Value Theory with GNNs0
Is Precise Recovery Necessary? A Task-Oriented Imputation Approach for Time Series Forecasting on Variable Subset0
What makes a good BIM design: quantitative linking between design behavior and quality0
GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees0
Classification of residential and non-residential buildings based on satellite data using deep learning0
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A SurveyCode0
Show:102550
← PrevPage 11 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified