SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 751800 of 1706 papers

TitleStatusHype
Machine Learning Applications on Neuroimaging for Diagnosis and Prognosis of Epilepsy: A Review0
Global Earth Magnetic Field Modeling and Forecasting with Spherical Harmonics Decomposition0
MalNet: A Large-Scale Image Database of Malicious SoftwareCode1
Importance of feature engineering and database selection in a machine learning model: A case study on carbon crystal structures0
Machine Learning for the Detection and Identification of Internet of Things (IoT) Devices: A Survey0
Machine Learning in LiDAR 3D point clouds0
Knowledge-Preserving Incremental Social Event Detection via Heterogeneous GNNsCode1
The Challenges of Persian User-generated Textual Content: A Machine Learning-Based ApproachCode1
Intelligent Icing Detection Model of Wind Turbine Blades Based on SCADA data0
Electrocardiogram Classification and Visual Diagnosis of Atrial Fibrillation with DenseECG0
MONAH: Multi-Modal Narratives for Humans to analyze conversationsCode0
A Survey on Extraction of Causal Relations from Natural Language Text0
Condition Assessment of Stay Cables through Enhanced Time Series Classification Using a Deep Learning ApproachCode0
Summaformers @ LaySumm 20, LongSumm 20Code1
Symmetry-adapted graph neural networks for constructing molecular dynamics force fields0
Simplified DOM Trees for Transferable Attribute Extraction from the WebCode1
Statistical learning for accurate and interpretable battery lifetime predictionCode1
Improving DGA-Based Malicious Domain Classifiers for Malware Defense with Adversarial Machine Learning0
Detecting Singleton Spams in Reviews via Learning Deep Anomalous Temporal Aspect-Sentiment PatternsCode0
String Theory: Parsed Categoric Encodings with Automunge0
A Numbers Game: Numeric Encoding Options with Automunge0
Reusing Preprocessing Data as Auxiliary Supervision in Conversational Analysis0
Simple deductive reasoning tests and data sets for exposing limitation of today's deep neural networks0
Enhancing Sindhi Word Segmentation using Subword Representation Learning and Position-aware Self-attention0
Advances in deep learning methods for pavement surface crack detection and identification with visible light visual imagesCode0
Shape-based Feature Engineering for Solar Flare Prediction0
Explainable Multi-class Classification of Medical Data0
AutonoML: Towards an Integrated Framework for Autonomous Machine LearningCode0
Intelligent Vector-based Customer Segmentation in the Banking Industry0
Unboxing Engagement in YouTube Influencer Videos: An Attention-Based Approach0
Machine Learning for Detecting Data Exfiltration: A Review0
Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global InferenceCode1
An Embedding Learning Framework for Numerical Features in CTR PredictionCode0
Semantic Annotation for Tabular Data0
Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language ModelCode1
Enabling Collaborative Data Science Development with the Ballet FrameworkCode1
Yelp Review Rating Prediction: Machine Learning and Deep Learning ModelsCode1
Repurposing recidivism models for forecasting police officer use of forceCode0
3D Bounding Box Detection in Volumetric Medical Image Data: A Systematic Literature Review0
AI-enabled Prediction of eSports Player Performance Using the Data from Heterogeneous SensorsCode0
RotNet: Fast and Scalable Estimation of Stellar Rotation Periods Using Convolutional Neural Networks0
A Novel Approach to Radiometric IdentificationCode0
Leveraging Latent Representations of Speech for Indian Language Identification0
BertAA : BERT fine-tuning for Authorship Attribution0
Neural Automated Essay Scoring Incorporating Handcrafted Features0
CyberTronics at SemEval-2020 Task 12: Multilingual Offensive Language Identification over Social MediaCode0
Classifying Malware Using Function Representations in a Static Call Graph0
CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code MatchingCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Short-Term Load Forecasting using Bi-directional Sequential Models and Feature Engineering for Small DatasetsCode1
Show:102550
← PrevPage 16 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified