SOTAVerified

Feature Engineering

Feature engineering is the process of taking a dataset and constructing explanatory variables — features — that can be used to train a machine learning model for a prediction problem. Often, data is spread across multiple tables and must be gathered into a single table with rows containing the observations and features in the columns.

The traditional approach to feature engineering is to build features one at a time using domain knowledge, a tedious, time-consuming, and error-prone process known as manual feature engineering. The code for manual feature engineering is problem-dependent and must be re-written for each new dataset.

Papers

Showing 15761600 of 1706 papers

TitleStatusHype
RELand: Risk Estimation of Landmines via Interpretable Invariant Risk MinimizationCode0
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak SupervisionCode0
Neural Vector Spaces for Unsupervised Information RetrievalCode0
Large Language Models Engineer Too Many Simple Features For Tabular DataCode0
Boosting Relational Deep Learning with Pretrained Tabular ModelsCode0
Large Language Models for Constructing and Optimizing Machine Learning Workflows: A SurveyCode0
Neural Word Segmentation Learning for ChineseCode0
Active DOP: A constituency treebank annotation tool with online learningCode0
Attention-based Neural Text SegmentationCode0
Relation Classification via Recurrent Neural NetworkCode0
xDeepInt: a hybrid architecture for modeling the vector-wise and bit-wise feature interactionsCode0
Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine TranslationCode0
Small Language Models for Tabular DataCode0
Extracting Relational Facts by an End-to-End Neural Model with Copy MechanismCode0
Large-Scale Multi-Domain Recommendation: an Automatic Domain Feature Extraction and Personalized Integration FrameworkCode0
Attention-Based Convolutional Neural Network for Semantic Relation ExtractionCode0
Extreme Learning Machine for the Characterization of Anomalous Diffusion from Single TrajectoriesCode0
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure ModesCode0
Fair multilingual vandalism detection system for WikipediaCode0
Cross-lingual Knowledge Graph Alignment via Graph Convolutional NetworksCode0
A Generalized Flow for B2B Sales Predictive Modeling: An Azure Machine Learning ApproachCode0
ATM: A distributed, collaborative, scalable system for automated machine learningCode0
False Information on Web and Social Media: A SurveyCode0
SMILES2Vec: An Interpretable General-Purpose Deep Neural Network for Predicting Chemical PropertiesCode0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
Show:102550
← PrevPage 64 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CNN14 gestures accuracy0.98Unverified