| HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability prediction | Jul 9, 2024 | Benchmarking | CodeCode Available | 0 |
| CodeUpdateArena: Benchmarking Knowledge Editing on API Updates | Jul 8, 2024 | Benchmarkingknowledge editing | CodeCode Available | 1 |
| Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation Experiments | Jul 8, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 |
| OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning | Jul 8, 2024 | Benchmarkingclass-incremental learning | CodeCode Available | 1 |
| GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation | Jul 8, 2024 | BenchmarkingGraph Embedding | —Unverified | 0 |
| TARGO: Benchmarking Target-driven Object Grasping under Occlusions | Jul 8, 2024 | BenchmarkingObject | —Unverified | 0 |
| A Benchmark for Multi-speaker Anonymization | Jul 8, 2024 | BenchmarkingDisentanglement | —Unverified | 0 |
| MERGE -- A Bimodal Audio-Lyrics Dataset for Static Music Emotion Recognition | Jul 8, 2024 | BenchmarkingDeep Learning | —Unverified | 0 |
| Replication in Visual Diffusion Models: A Survey and Outlook | Jul 7, 2024 | BenchmarkingSurvey | CodeCode Available | 1 |
| Rethinking the Effectiveness of Graph Classification Datasets in Benchmarks for Assessing GNNs | Jul 6, 2024 | BenchmarkingDataset Generation | CodeCode Available | 0 |