| Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence Constraints | Nov 25, 2020 | BenchmarkingScheduling | CodeCode Available | 0 | 5 |
| Inverse Contextual Bandits: Learning How Behavior Evolves over Time | Jul 13, 2021 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAM | Oct 8, 2014 | Benchmarking | CodeCode Available | 0 | 5 |
| InViG: Benchmarking Interactive Visual Grounding with 500K Human-Robot Interactions | Oct 18, 2023 | BenchmarkingVisual Grounding | CodeCode Available | 0 | 5 |
| B-XAIC Dataset: Benchmarking Explainable AI for Graph Neural Networks Using Chemical Data | May 28, 2025 | BenchmarkingDrug Discovery | CodeCode Available | 0 | 5 |
| INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition | Jun 10, 2024 | BenchmarkingEmotion Recognition | CodeCode Available | 0 | 5 |
| Analysis | OPEN | Published: 17 June 2019 Multitask learning and benchmarking with clinical time series data | Jun 17, 2019 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Building Conformal Prediction Intervals with Approximate Message Passing | Oct 21, 2024 | BenchmarkingConformal Prediction | CodeCode Available | 0 | 5 |
| Building and benchmarking an Arabic Speech Commands dataset for small-footprint keyword spotting | May 7, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 | 5 |
| Adaptive Visual Scene Understanding: Incremental Scene Graph Generation | Oct 2, 2023 | BenchmarkingContinual Learning | CodeCode Available | 0 | 5 |
| Integrating Expert Knowledge into Logical Programs via LLMs | Feb 17, 2025 | BenchmarkingLogical Reasoning | CodeCode Available | 0 | 5 |
| Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark | May 9, 2016 | BenchmarkingEmotion Recognition | CodeCode Available | 0 | 5 |
| ColorGrid: A Multi-Agent Non-Stationary Environment for Goal Inference and Assistance | Jan 17, 2025 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models | Mar 11, 2025 | BenchmarkingHyperparameter Optimization | CodeCode Available | 0 | 5 |
| Bugs in the Data: How ImageNet Misrepresents Biodiversity | Aug 24, 2022 | BenchmarkingObject Detection | CodeCode Available | 0 | 5 |
| CleanPatrick: A Benchmark for Image Data Cleaning | May 16, 2025 | BenchmarkingLabel Error Detection | CodeCode Available | 0 | 5 |
| BubGAN: Bubble Generative Adversarial Networks for Synthesizing Realistic Bubbly Flow Images | Sep 7, 2018 | Benchmarking | CodeCode Available | 0 | 5 |
| InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition | Dec 23, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 | 5 |
| bsnsing: A decision tree induction method based on recursive optimal boolean rule composition | May 30, 2022 | Benchmarking | CodeCode Available | 0 | 5 |
| BSBench: will your LLM find the largest prime number? | Jun 5, 2025 | Benchmarking | CodeCode Available | 0 | 5 |
| Adaptive Shrinkage Estimation For Personalized Deep Kernel Regression In Modeling Brain Trajectories | Apr 10, 2025 | Additive modelsBenchmarking | CodeCode Available | 0 | 5 |
| inMOTIFin: a lightweight end-to-end simulation software for regulatory sequences | Jun 25, 2025 | Benchmarking | CodeCode Available | 0 | 5 |
| Towards Learning Universal, Regional, and Local Hydrological Behaviors via Machine-Learning Applied to Large-Sample Datasets | Jul 19, 2019 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation | Apr 29, 2025 | BenchmarkingFairness | CodeCode Available | 0 | 5 |
| Adaptive Power System Emergency Control using Deep Reinforcement Learning | Mar 9, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |