| Benchmarking Omni-Vision Representation through the Lens of Visual Realms | Jul 14, 2022 | BenchmarkingContrastive Learning | CodeCode Available | 1 | 5 |
| Neural Multi-Hop Reasoning With Logical Rules on Biomedical Knowledge Graphs | Mar 18, 2021 | BenchmarkingKnowledge Graphs | CodeCode Available | 1 | 5 |
| EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions | Jun 8, 2021 | Bayesian OptimisationBenchmarking | CodeCode Available | 1 | 5 |
| DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models | Nov 19, 2024 | BenchmarkingDeep Learning | CodeCode Available | 1 | 5 |
| dMelodies: A Music Dataset for Disentanglement Learning | Jul 29, 2020 | BenchmarkingDisentanglement | CodeCode Available | 1 | 5 |
| Benchmarking Offline Reinforcement Learning on Real-Robot Hardware | Jul 28, 2023 | Benchmarkingreinforcement-learning | CodeCode Available | 1 | 5 |
| Benchmarking Object Detectors with COCO: A New Path Forward | Mar 27, 2024 | BenchmarkingObject | CodeCode Available | 1 | 5 |
| Benchmarking Generated Poses: How Rational is Structure-based Drug Design with Generative Models? | Aug 14, 2023 | BenchmarkingDrug Design | CodeCode Available | 1 | 5 |
| Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization | Nov 15, 2023 | BenchmarkingInstruction Following | CodeCode Available | 1 | 5 |
| Benchmarking: Past, Present and Future | Aug 1, 2021 | BenchmarkingReading Comprehension | CodeCode Available | 1 | 5 |