| AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking | Jul 23, 2024 | BenchmarkingTransfer Learning | CodeCode Available | 3 |
| Flexible Generation of Preference Data for Recommendation Analysis | Jul 23, 2024 | BenchmarkingRecommendation Systems | CodeCode Available | 0 |
| Aggregated Attributions for Explanatory Analysis of 3D Segmentation Models | Jul 23, 2024 | BenchmarkingSegmentation | CodeCode Available | 0 |
| InLUT3D: Challenging real indoor dataset for point cloud analysis | Jul 22, 2024 | BenchmarkingScene Understanding | —Unverified | 0 |
| Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research | Jul 22, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarks as Microscopes: A Call for Model Metrology | Jul 22, 2024 | Benchmarkingmodel | —Unverified | 0 |
| Cascaded two-stage feature clustering and selection via separability and consistency in fuzzy decision systems | Jul 22, 2024 | BenchmarkingClustering | —Unverified | 0 |
| LCA-on-the-Line: Benchmarking Out-of-Distribution Generalization with Class Taxonomies | Jul 22, 2024 | BenchmarkingOut-of-Distribution Generalization | CodeCode Available | 1 |
| StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation | Jul 22, 2024 | BenchmarkingText Generation | —Unverified | 0 |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Jul 22, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |