| Categorization of 33 computational methods to detect spatially variable genes from spatially resolved transcriptomics data | May 29, 2024 | BenchmarkingSpecificity | —Unverified | 0 |
| MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions | May 29, 2024 | BenchmarkingDialogue Understanding | CodeCode Available | 1 |
| Benchmarking and Improving Detail Image Caption | May 29, 2024 | BenchmarkingImage Captioning | CodeCode Available | 2 |
| Quantitative Certification of Bias in Large Language Models | May 29, 2024 | Benchmarking | CodeCode Available | 1 |
| MDIW-13: a New Multi-Lingual and Multi-Script Database and Benchmark for Script Identification | May 29, 2024 | Benchmarking | —Unverified | 0 |
| Exploring Thermography Technology: A Comprehensive Facial Dataset for Face Detection, Recognition, and Emotion | May 28, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Risk-Neutral Generative Networks | May 28, 2024 | Benchmarking | —Unverified | 0 |
| DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime | May 28, 2024 | BenchmarkingReinforcement Learning (RL) | CodeCode Available | 1 |
| Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking Sequences | May 28, 2024 | BenchmarkingFeature Engineering | CodeCode Available | 1 |
| LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters | May 27, 2024 | BenchmarkingGSM8K | CodeCode Available | 2 |