| Task-oriented Over-the-air Computation for Edge-device Co-inference with Balanced Classification Accuracy | Jul 1, 2024 | Benchmarking | —Unverified | 0 |
| ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions | Jul 1, 2024 | BenchmarkingQuestion Generation | —Unverified | 0 |
| BERGEN: A Benchmarking Library for Retrieval-Augmented Generation | Jul 1, 2024 | BenchmarkingRAG | CodeCode Available | 3 |
| Modified CMA-ES Algorithm for Multi-Modal Optimization: Incorporating Niching Strategies and Dynamic Adaptation Mechanism | Jul 1, 2024 | BenchmarkingDiversity | —Unverified | 0 |
| Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents | Jul 1, 2024 | Benchmarking | CodeCode Available | 1 |
| Reinvestigating the R2 Indicator: Achieving Pareto Compliance by Integration | Jul 1, 2024 | Benchmarking | CodeCode Available | 0 |
| MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations | Jul 1, 2024 | Benchmarkingdocument understanding | CodeCode Available | 2 |
| EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting | Jul 1, 2024 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models | Jul 1, 2024 | BenchmarkingFairness | CodeCode Available | 2 |
| Benchmarking Predictive Coding Networks -- Made Simple | Jul 1, 2024 | Benchmarking | CodeCode Available | 2 |