SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 301350 of 1706 papers

TitleStatusHype
Enhanced Convolution Neural Network with Optimized Pooling and Hyperparameter Tuning for Network Intrusion DetectionCode0
Reinforcement Feature Transformation for Polymer Property Performance Prediction0
A Feature Engineering Approach for Literary and Colloquial Tamil Speech Classification using 1D-CNN0
Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks0
Machine Learning for Public Good: Predicting Urban Crime Patterns to Enhance Community Safety0
Leveraging Open-Source Large Language Models for Native Language Identification0
Learn2Aggregate: Supervised Generation of Chvátal-Gomory Cuts Using Graph Neural Networks0
HybridFC: A Hybrid Fact-Checking Approach for Knowledge GraphsCode0
Machine Learning-Based Prediction of Key Genes Correlated to the Subretinal Lesion Severity in a Mouse Model of Age-Related Macular Degeneration0
IIFE: Interaction Information Based Automated Feature EngineeringCode0
Large Margin Prototypical Network for Few-shot Relation Classification with Fine-grained Features0
Application Research On Real-Time Perception Of Device Performance Status0
Leveraging Large Language Models through Natural Language Processing to provide interpretable Machine Learning predictions of mental deterioration in real time0
Hybridization of Persistent Homology with Neural Networks for Time-Series Prediction: A Case Study in Wave Height0
PoliPrompt: A High-Performance Cost-Effective LLM-Based Text Classification Framework for Political Science0
LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition0
Enhancing Customer Churn Prediction in Telecommunications: An Adaptive Ensemble Learning Approach0
Android Malware Detection Based on RGB Images and Multi-feature Fusion0
gWaveNet: Classification of Gravity Waves from Noisy Satellite Data using Custom Kernel Integrated Deep Learning MethodCode0
Obfuscated Memory Malware Detection0
Improving Radiography Machine Learning Workflows via Metadata Management for Training Data Selection0
Graph Classification via Reference Distribution Learning: Theory and Practice0
Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning MethodsCode0
Improved Differential Evolution based Feature Selection through Quantum, Chaos, and Lasso0
Understanding Generative AI Content with Embedding Models0
Augmenting train maintenance technicians with automated incident diagnostic suggestions0
EEG Right & Left Voluntary Hand Movement-based Virtual Brain-Computer Interfacing Keyboard Using Hybrid Deep Learning Approach0
Detecting Unsuccessful Students in Cybersecurity Exercises in Two Different Learning EnvironmentsCode0
Improving VTE Identification through Language Models from Radiology Reports: A Comparative Study of Mamba, Phi-3 Mini, and BERT0
LOLgorithm: Integrating Semantic,Syntactic and Contextual Elements for Humor Classification0
Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach0
IBB Traffic Graph Data: Benchmarking and Road Traffic Prediction Model0
Improving Machine Learning Based Sepsis Diagnosis Using Heart Rate Variability0
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language ModelsCode0
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review0
An Efficient and Flexible Deep Learning Method for Signal Delineation via Keypoints Estimation0
Self-Reasoning Assistant Learning for non-Abelian Gauge Fields Design0
Fever Detection with Infrared Thermography: Enhancing Accuracy through Machine Learning Techniques0
Evaluating Large Language Models for Anxiety and Depression Classification using Counseling and Psychotherapy TranscriptsCode0
Temperature Distribution Prediction in Laser Powder Bed Fusion using Transferable and Scalable Graph Neural Networks0
GraphGuard: Contrastive Self-Supervised Learning for Credit-Card Fraud Detection in Multi-Relational Dynamic Graphs0
Molecular Topological Profile (MOLTOP) -- Simple and Strong Baseline for Molecular Graph ClassificationCode0
GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text0
Deep-Graph-Sprints: Accelerated Representation Learning in Continuous-Time Dynamic Graphs0
MERGE -- A Bimodal Audio-Lyrics Dataset for Static Music Emotion Recognition0
Advancing Automated Deception Detection: A Multimodal Approach to Feature Extraction and AnalysisCode0
Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques0
GraphCNNpred: A stock market indices prediction using a Graph based deep learning system0
OSPC: Artificial VLM Features for Hateful Meme Detection0
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry0
Show:102550
← PrevPage 7 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified