SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 101150 of 1706 papers

TitleStatusHype
Risk Analysis of Flowlines in the Oil and Gas Sector: A GIS and Machine Learning ApproachCode0
Algorithmic Derivation of Human Spatial Navigation Indices From Eye Movement Data0
Challenges and recommendations for Electronic Health Records data extraction and preparation for dynamic prediction modelling in hospitalized patients -- a practical guide0
Dataset-Agnostic Recommender Systems0
Text to Band Gap: Pre-trained Language Models as Encoders for Semiconductor Band Gap PredictionCode0
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple FeaturesCode3
Predicting Vulnerability to Malware Using Machine Learning Models: A Study on Microsoft Windows Machines0
Classification of Operational Records in Aviation Using Deep Learning Approaches0
Multi-Modal Video Feature Extraction for Popularity Prediction0
Dynamic Adaptation in Data Storage: Real-Time Machine Learning for Enhanced Prefetching0
Assets Forecasting with Feature Engineering and Transformation Methods for LightGBM0
Context-Aware Deep Learning for Multi Modal Depression DetectionCode1
Three-Class Text Sentiment Analysis Based on LSTM0
STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction0
Intelligent Approaches to Predictive Analytics in Occupational Health and Safety in India0
Risk-Adjusted Performance of Random Forest Models in High-Frequency Trading0
PCA-Featured Transformer for Jamming Detection in 5G UAV Networks0
Hunting Tomorrow's Leaders: Using Machine Learning to Forecast S&P 500 Additions & Removal0
GLARE: Google Apps Arabic Reviews DatasetCode0
F-RBA: A Federated Learning-based Framework for Risk-based Authentication0
S&P 500 Trend Prediction0
A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer0
Feature engineering vs. deep learning for paper section identification: Toward applications in Chinese medical literature0
Deep Learning-Based Noninvasive Screening of Type 2 Diabetes with Chest X-ray Images and Electronic Health RecordsCode0
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs0
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction0
Image-Based Malware Classification Using QR and Aztec Codes0
Robust Feature Engineering Techniques for Designing Efficient Motor Imagery-Based BCI-Systems0
RUL forecasting for wind turbine predictive maintenance based on deep learning0
PRECISE: Pre-training Sequential Recommenders with Collaborative and Semantic Information0
Parkinson's Disease Diagnosis Through Deep Learning: A Novel LSTM-Based Approach for Freezing of Gait Detection0
Federated Automated Feature Engineering0
Deep Learning in Single-Cell and Spatial Transcriptomics Data Analysis: Advances and Challenges from a Data Science Perspective0
Comparative Performance of Machine Learning Algorithms for Early Genetic Disorder and Subclass Classification0
Intelligent Spark Agents: A Modular LangGraph Framework for Scalable, Visualized, and Enhanced Big Data Machine Learning Workflows0
HiCat: A Semi-Supervised Approach for Cell Type Annotation0
An AutoML-based approach for Network Intrusion Detection0
Understanding LLM Embeddings for Regression0
Enhancing Molecular Design through Graph-based Topological Reinforcement Learning0
Advancing Heatwave Forecasting via Distribution Informed-Graph Neural Networks (DI-GNNs): Integrating Extreme Value Theory with GNNs0
Graph Neural Networks for Quantifying Compatibility Mechanisms in Traditional Chinese MedicineCode1
Is Precise Recovery Necessary? A Task-Oriented Imputation Approach for Time Series Forecasting on Variable Subset0
What makes a good BIM design: quantitative linking between design behavior and quality0
GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees0
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A SurveyCode0
Classification of residential and non-residential buildings based on satellite data using deep learning0
RAGulator: Lightweight Out-of-Context Detectors for Grounded Text Generation0
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
Exploring Feature Importance and Explainability Towards Enhanced ML-Based DoS Detection in AI Systems0
Show:102550
← PrevPage 3 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified