| Mol-MoE: Training Preference-Guided Routers for Molecule Generation | Feb 8, 2025 | BenchmarkingDrug Design | CodeCode Available | 0 |
| Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Jul 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene | Sep 7, 2021 | BenchmarkingFine-Grained Image Recognition | CodeCode Available | 0 |
| Moment Matching for Multi-Source Domain Adaptation | Dec 4, 2018 | BenchmarkingDomain Adaptation | CodeCode Available | 0 |
| Benchmarking Robustness to Text-Guided Corruptions | Apr 6, 2023 | BenchmarkingData Augmentation | CodeCode Available | 0 |
| Fine-grained Entity Recognition with Reduced False Negatives and Large Type Coverage | Apr 30, 2019 | Benchmarking | CodeCode Available | 0 |
| Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0 | Aug 23, 2023 | Benchmarkingregression | CodeCode Available | 0 |
| Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data | Sep 24, 2024 | BenchmarkingDepth Estimation | CodeCode Available | 0 |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving | Mar 20, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Scission: Performance-driven and Context-aware Cloud-Edge Distribution of Deep Neural Networks | Aug 8, 2020 | BenchmarkingDecision Making | CodeCode Available | 0 |
| ALDI++: Automatic and parameter-less discord and outlier detection for building energy load profiles | Mar 13, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming | Jul 17, 2019 | Autonomous DrivingBenchmarking | CodeCode Available | 0 |
| Motley: Benchmarking Heterogeneity and Personalization in Federated Learning | Jun 18, 2022 | BenchmarkingFairness | CodeCode Available | 0 |
| ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning | May 30, 2023 | BenchmarkingIn-Context Learning | CodeCode Available | 0 |
| Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease Generalization | Jun 21, 2024 | BenchmarkingSegmentation | CodeCode Available | 0 |
| The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA | May 2, 2024 | BenchmarkingDrug Discovery | CodeCode Available | 0 |
| AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs | May 27, 2025 | BenchmarkingQuestion Selection | CodeCode Available | 0 |
| Benchmarking Representation Learning for Natural World Image Collections | Mar 30, 2021 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| Benchmarking Reinforcement Learning Algorithms on Real-World Robots | Sep 20, 2018 | Benchmarkingcontinuous-control | CodeCode Available | 0 |
| Benchmarking Quantum Reinforcement Learning | Jan 27, 2025 | Benchmarkingreinforcement-learning | CodeCode Available | 0 |
| MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization | May 1, 2022 | Benchmarkingdialogue summary | CodeCode Available | 0 |
| Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models | Jun 22, 2019 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| FHBench: Towards Efficient and Personalized Federated Learning for Multimodal Healthcare | Apr 15, 2025 | BenchmarkingDiagnostic | CodeCode Available | 0 |
| Benchmarking quantum machine learning kernel training for classification tasks | Aug 17, 2024 | BenchmarkingQuantum Machine Learning | CodeCode Available | 0 |
| The Saudi Privacy Policy Dataset | Apr 5, 2023 | Benchmarking | CodeCode Available | 0 |