EvoGP: A GPU-accelerated Framework for Tree-based Genetic Programming Jan 21, 2025 Feature Engineering GPU
Code Code Available 75 Baichuan 2: Open Large-scale Language Models Sep 19, 2023 Feature Engineering GSM8K
Code Code Available 45 TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks Jun 27, 2024 Feature Engineering Model Selection
Code Code Available 45 Fairness Implications of Encoding Protected Categorical Attributes Jan 27, 2022 Fairness Feature Engineering
Code Code Available 45 Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification Apr 16, 2024 Feature Engineering Language Modeling
Code Code Available 35 NeuralFoil: An Airfoil Aerodynamics Analysis Tool Using Physics-Informed Machine Learning Mar 20, 2025 Feature Engineering Physics-informed machine learning
Code Code Available 35 AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions Oct 27, 2024 Feature Engineering
Code Code Available 35 The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features Jan 6, 2025 Feature Engineering Time Series
Code Code Available 35 RelBench: A Benchmark for Deep Learning on Relational Databases Jul 29, 2024 Deep Learning Feature Engineering
Code Code Available 35 How Can Recommender Systems Benefit from Large Language Models: A Survey Jun 9, 2023 Ethics Feature Engineering
Code Code Available 35 Universal Time-Series Representation Learning: A Survey Jan 8, 2024 Feature Engineering Representation Learning
Code Code Available 35 DeepMol: An Automated Machine and Deep Learning Framework for Computational Chemistr Jun 1, 2024 Activity Prediction AutoML
Code Code Available 25 DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation Mar 18, 2024 Feature Engineering Image Manipulation
Code Code Available 25 TSFEL: Time Series Feature Extraction Library Mar 21, 2020 Feature Engineering Time Series
Code Code Available 25 LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Mar 18, 2025 Automated Feature Engineering Feature Engineering
Code Code Available 25 Fraud Dataset Benchmark and Applications Aug 30, 2022 AutoML Feature Engineering
Code Code Available 25 OmniXAI: A Library for Explainable AI Jun 1, 2022 counterfactual Counterfactual Explanation
Code Code Available 25 MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving Sep 11, 2024 Autonomous Driving Feature Engineering
Code Code Available 25 DriveML: An R Package for Driverless Machine Learning May 1, 2020 AutoML BIG-bench Machine Learning
Code Code Available 15 DiviK: Divisive intelligent K-Means for hands-free unsupervised clustering in big biological data Sep 22, 2020 Clustering Feature Engineering
Code Code Available 15 Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation Dec 21, 2023 Edge Detection Feature Engineering
Code Code Available 15 Dimensionality Reduction of Longitudinal 'Omics Data using Modern Tensor Factorization Nov 28, 2021 Dimensionality Reduction Feature Engineering
Code Code Available 15 DeepFM: A Factorization-Machine based Neural Network for CTR Prediction Mar 13, 2017 Click-Through Rate Prediction Feature Engineering
Code Code Available 15 DeepSurv: Personalized Treatment Recommender System Using A Cox Proportional Hazards Deep Neural Network Jun 2, 2016 Feature Engineering Predicting Patient Outcomes
Code Code Available 15 DeltaPy: A Framework for Tabular Data Augmentation in Python May 22, 2020 BIG-bench Machine Learning Data Augmentation
Code Code Available 15 DIFER: Differentiable Automated Feature Engineering Oct 17, 2020 Automated Feature Engineering BIG-bench Machine Learning
Code Code Available 15 A Survey of Information Cascade Analysis: Models, Predictions, and Recent Advances May 22, 2020 Feature Engineering Marketing
Code Code Available 15 DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection Apr 1, 2023 Deep Learning Feature Engineering
Code Code Available 15 DoE2Vec: Deep-learning Based Features for Exploratory Landscape Analysis Mar 31, 2023 Deep Learning Feature Engineering
Code Code Available 15 Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees May 18, 2019 Feature Engineering Feature Importance
Code Code Available 15 Efficient End-to-End AutoML via Scalable Search Space Decomposition Jun 19, 2022 AutoML Feature Engineering
Code Code Available 15 CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching Dec 1, 2020 Computer Security Cross-Modal Retrieval
Code Code Available 15 CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT Apr 20, 2020 Feature Engineering
Code Code Available 15 Cognitive Evolutionary Search to Select Feature Interactions for Click-Through Rate Prediction Aug 1, 2023 Click-Through Rate Prediction Evolutionary Algorithms
Code Code Available 15 Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? Dec 1, 2020 Feature Engineering Q-Learning
Code Code Available 15 Classification of Raw MEG/EEG Data with Detach-Rocket Ensemble: An Improved ROCKET Algorithm for Multivariate Time Series Analysis Aug 5, 2024 Classification EEG
Code Code Available 15 Evaluation Toolkit For Robustness Testing Of Automatic Essay Scoring Systems Jul 14, 2020 Automated Essay Scoring Common Sense Reasoning
Code Code Available 15 Cardea: An Open Automated Machine Learning Framework for Electronic Health Records Oct 1, 2020 Automated Feature Engineering AutoML
Code Code Available 15 Compatible deep neural network framework with financial time series data, including data preprocessor, neural network model and trading strategy May 11, 2022 Binary Classification Feature Engineering
Code Code Available 15 BP-Net: Efficient Deep Learning for Continuous Arterial Blood Pressure Estimation using Photoplethysmogram Nov 29, 2021 Blood pressure estimation Feature Engineering
Code Code Available 15 Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists Oct 30, 2024 Feature Engineering
Code Code Available 15 An End-to-End Reinforcement Learning Approach for Job-Shop Scheduling Problems Based on Constraint Programming Jun 9, 2023 Combinatorial Optimization Feature Engineering
Code Code Available 15 Anomaly Detection for Solder Joints Using β-VAE Apr 24, 2021 Anomaly Detection Feature Engineering
Code Code Available 15 CASPR: Customer Activity Sequence-based Prediction and Representation Nov 16, 2022 Feature Engineering Prediction
Code Code Available 15 Classification of Periodic Variable Stars with Novel Cyclic-Permutation Invariant Neural Networks Nov 2, 2020 Astronomy Feature Engineering
Code Code Available 15 Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference Dec 16, 2020 Feature Engineering Medical Question Answering
Code Code Available 15 Deep & Cross Network for Ad Click Predictions Aug 17, 2017 Click-Through Rate Prediction Feature Engineering
Code Code Available 15 Deep Dive into Hunting for LotLs Using Machine Learning and Feature Engineering. Apr 21, 2023 Feature Engineering
Code Code Available 15 A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data Jul 2, 2024 Feature Engineering Hyperparameter Optimization
Code Code Available 15 A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch Literature Nov 1, 2021 coreference-resolution Coreference Resolution
Code Code Available 15