| LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Oct 13, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| RMB: Comprehensively Benchmarking Reward Models in LLM Alignment | Oct 13, 2024 | Benchmarking | CodeCode Available | 1 |
| Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation | Oct 11, 2024 | BenchmarkingImage Segmentation | CodeCode Available | 1 |
| When Graph meets Multimodal: Benchmarking on Multimodal Attributed Graphs Learning | Oct 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Towards Generalisable Time Series Understanding Across Domains | Oct 9, 2024 | BenchmarkingTime Series | CodeCode Available | 1 |
| Entering Real Social World! Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective | Oct 8, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild | Oct 7, 2024 | BenchmarkingMixture-of-Experts | CodeCode Available | 1 |
| Large Scale MRI Collection and Segmentation of Cirrhotic Liver | Oct 6, 2024 | BenchmarkingDiagnostic | CodeCode Available | 1 |
| Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning | Oct 5, 2024 | BenchmarkingDrug Design | CodeCode Available | 1 |
| EBES: Easy Benchmarking for Event Sequences | Oct 4, 2024 | Benchmarking | CodeCode Available | 1 |