SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 101125 of 1706 papers

TitleStatusHype
Risk Analysis of Flowlines in the Oil and Gas Sector: A GIS and Machine Learning ApproachCode0
Algorithmic Derivation of Human Spatial Navigation Indices From Eye Movement Data0
Challenges and recommendations for Electronic Health Records data extraction and preparation for dynamic prediction modelling in hospitalized patients -- a practical guide0
Dataset-Agnostic Recommender Systems0
Text to Band Gap: Pre-trained Language Models as Encoders for Semiconductor Band Gap PredictionCode0
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple FeaturesCode3
Predicting Vulnerability to Malware Using Machine Learning Models: A Study on Microsoft Windows Machines0
Classification of Operational Records in Aviation Using Deep Learning Approaches0
Multi-Modal Video Feature Extraction for Popularity Prediction0
Dynamic Adaptation in Data Storage: Real-Time Machine Learning for Enhanced Prefetching0
Assets Forecasting with Feature Engineering and Transformation Methods for LightGBM0
Context-Aware Deep Learning for Multi Modal Depression DetectionCode1
Three-Class Text Sentiment Analysis Based on LSTM0
STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction0
Intelligent Approaches to Predictive Analytics in Occupational Health and Safety in India0
Risk-Adjusted Performance of Random Forest Models in High-Frequency Trading0
PCA-Featured Transformer for Jamming Detection in 5G UAV Networks0
Hunting Tomorrow's Leaders: Using Machine Learning to Forecast S&P 500 Additions & Removal0
GLARE: Google Apps Arabic Reviews DatasetCode0
F-RBA: A Federated Learning-based Framework for Risk-based Authentication0
S&P 500 Trend Prediction0
Feature engineering vs. deep learning for paper section identification: Toward applications in Chinese medical literature0
A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer0
Deep Learning-Based Noninvasive Screening of Type 2 Diabetes with Chest X-ray Images and Electronic Health RecordsCode0
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs0
Show:102550
← PrevPage 5 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified