| ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| Empirical Analysis of Privacy-Fairness-Accuracy Trade-offs in Federated Learning: A Step Towards Responsible AI | Mar 20, 2025 | BenchmarkingFairness | —Unverified | 0 |
| FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding | Mar 19, 2025 | BenchmarkingMultiple-choice | —Unverified | 0 |
| Benchmarking Open-Source Large Language Models on Healthcare Text Classification Tasks | Mar 19, 2025 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| Benchmarking Large Language Models for Handwritten Text Recognition | Mar 19, 2025 | BenchmarkingHandwritten Text Recognition | —Unverified | 0 |
| Kolmogorov-Arnold Network for Transistor Compact Modeling | Mar 19, 2025 | Benchmarking | —Unverified | 0 |
| VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning | Mar 19, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Language-based Image Colorization: A Benchmark and Beyond | Mar 19, 2025 | BenchmarkingColorization | CodeCode Available | 0 |
| SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes | Mar 19, 2025 | 3D Semantic SegmentationBenchmarking | —Unverified | 0 |
| ImputeGAP: A Comprehensive Library for Time Series Imputation | Mar 19, 2025 | BenchmarkingImputation | —Unverified | 0 |