SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 12011250 of 1706 papers

TitleStatusHype
Unity in Diversity: A Unified Parsing Strategy for Major Indian Languages0
Universal Reusability in Recommender Systems: The Case for Dataset- and Task-Independent Frameworks0
Unleashing the Power of Pre-trained Encoders for Universal Adversarial Attack Detection0
Unlocking New York City Crime Insights using Relational Database Embeddings0
Unraveling Cold Start Enigmas in Predictive Analytics for OTT Media: Synergistic Meta-Insights and Multimodal Ensemble Mastery0
Unraveling the Key of Machine Learning Solutions for Android Malware Detection0
Unsupervised Abbreviation Detection in Clinical Narratives0
Unsupervised Concept-to-text Generation with Hypergraphs0
Unsupervised Continual Learning in Streaming Environments0
Unsupervised Learning of Prototypical Fillers for Implicit Semantic Role Labeling0
Unsupervised Machine Learning for Networking: Techniques, Applications and Research Challenges0
Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning0
UofL at SemEval-2016 Task 4: Multi Domain word2vec for Twitter Sentiment Classification0
User-click Modelling for Predicting Purchase Intent0
Using Person Embedding to Enrich Features and Data Augmentation for Classification0
USTHB at NADI 2023 shared task: Exploring Preprocessing and Feature Engineering Strategies for Arabic Dialect Identification0
Utilizing the LightGBM Algorithm for Operator User Credit Assessment Research0
Varying Linguistic Purposes of Emoji in (Twitter) Context0
Vibration fault detection in wind turbines based on normal behaviour models without feature engineering0
Unboxing Engagement in YouTube Influencer Videos: An Attention-Based Approach0
Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction0
Vital Node Identification in Complex Networks Using a Machine Learning-Based Approach0
WBI-NER: The impact of domain-specific features on the performance of identifying and classifying mentions of drugs0
Wearable-based behaviour interpolation for semi-supervised human activity recognition0
Web Content Extraction - a Meta-Analysis of its Past and Thoughts on its Future0
Weisfeiler-Lehman Embedding for Molecular Graph Neural Networks0
What can we learn from quantum convolutional neural networks?0
What makes a good BIM design: quantitative linking between design behavior and quality0
What Makes Word-level Neural Machine Translation Hard: A Case Study on English-German Translation0
When Did that Happen? --- Linking Events and Relations to Timestamps0
WHUNlp at SemEval-2016 Task DiMSUM: A Pilot Study in Detecting Minimal Semantic Units and their Meanings using Supervised Models0
Word and Document Embeddings based on Neural Network Approaches0
Word-Based Dialog State Tracking with Recurrent Neural Networks0
Word embeddings and discourse information for Quality Estimation0
Word Embedding Techniques for Classification of Star Ratings0
Word Embedding Techniques for Malware Evolution Detection0
Word Translation Prediction for Morphologically Rich Languages with Bilingual Neural Networks0
Year-over-Year Developments in Financial Fraud Detection via Deep Learning: A Systematic Literature Review0
Your Instructions Are Not Always Helpful: Assessing the Efficacy of Instruction Fine-tuning for Software Vulnerability Detection0
Zara: A Virtual Interactive Dialogue System Incorporating Emotion, Sentiment and Personality Recognition0
LSTM Shift-Reduce CCG Parsing0
Deep learning approach to control of prosthetic hands with electromyography signals0
IBB Traffic Graph Data: Benchmarking and Road Traffic Prediction Model0
IISE PG&E Energy Analytics Challenge 2025: Hourly-Binned Regression Models Beat Transformers in Load Forecasting0
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation0
3D Bounding Box Detection in Volumetric Medical Image Data: A Systematic Literature Review0
Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge0
A bag-of-concepts model improves relation extraction in a narrow knowledge domain with limited data0
A Benchmark Dataset for Tornado Detection and Prediction using Full-Resolution Polarimetric Weather Radar Data0
A Blockchain Transaction Graph based Machine Learning Method for Bitcoin Price Prediction0
Show:102550
← PrevPage 25 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified