| Benchmarking of GPU-optimized Quantum-Inspired Evolutionary Optimization Algorithm using Functional Analysis | Dec 12, 2024 | BenchmarkingGPU | —Unverified | 0 |
| JuStRank: Benchmarking LLM Judges for System Ranking | Dec 12, 2024 | Benchmarking | —Unverified | 0 |
| Neptune: The Long Orbit to Benchmarking Long Video Understanding | Dec 12, 2024 | BenchmarkingMultimodal Reasoning | CodeCode Available | 2 |
| Benchmarking LLMs for Mimicking Child-Caregiver Language in Interaction | Dec 12, 2024 | BenchmarkingDiversity | —Unverified | 0 |
| Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph Generation | Dec 11, 2024 | BenchmarkingFederated Learning | CodeCode Available | 0 |
| Koopman Theory-Inspired Method for Learning Time Advancement Operators in Unstable Flame Front Evolution | Dec 11, 2024 | Benchmarking | —Unverified | 0 |
| Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions | Dec 11, 2024 | BenchmarkingQuestion Answering | CodeCode Available | 0 |
| Learn How to Query from Unlabeled Data Streams in Federated Learning | Dec 11, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Benchmarking learned algorithms for computed tomography image reconstruction tasks | Dec 11, 2024 | BenchmarkingComputed Tomography (CT) | —Unverified | 0 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| LCFO: Long Context and Long Form Output Dataset and Benchmarking | Dec 11, 2024 | BenchmarkingForm | —Unverified | 0 |
| A quantum-classical reinforcement learning model to play Atari games | Dec 11, 2024 | Atari GamesBenchmarking | CodeCode Available | 0 |
| Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions | Dec 10, 2024 | Auxiliary LearningBenchmarking | —Unverified | 0 |
| MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems | Dec 10, 2024 | BenchmarkingMixture-of-Experts | —Unverified | 0 |
| Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings | Dec 10, 2024 | BenchmarkingGraph Learning | —Unverified | 0 |
| OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Dec 10, 2024 | AttributeBenchmarking | CodeCode Available | 5 |
| Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments | Dec 10, 2024 | Benchmarkingobject-detection | —Unverified | 0 |
| MO-IOHinspector: Anytime Benchmarking of Multi-Objective Algorithms using IOHprofiler | Dec 10, 2024 | BenchmarkingExperimental Design | —Unverified | 0 |
| Bilingual BSARD: Extending Statutory Article Retrieval to Dutch | Dec 10, 2024 | ArticlesBenchmarking | CodeCode Available | 0 |
| Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective | Dec 10, 2024 | Benchmarking | CodeCode Available | 0 |
| Multi-Behavior Recommendation with Personalized Directed Acyclic Behavior Graphs | Dec 9, 2024 | BenchmarkingComputational Efficiency | CodeCode Available | 1 |
| PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models | Dec 9, 2024 | BenchmarkingInstruction Following | CodeCode Available | 0 |
| PowerMamba: A Deep State Space Model and Comprehensive Benchmark for Time Series Prediction in Electric Power Systems | Dec 9, 2024 | BenchmarkingPrediction | CodeCode Available | 1 |
| ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities | Dec 9, 2024 | AllBenchmarking | —Unverified | 0 |
| On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events | Dec 9, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |