| PEDANTS: Cheap but Effective and Interpretable Answer Equivalence | Feb 17, 2024 | BenchmarkingForm | CodeCode Available | 2 |
| Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence | Feb 15, 2024 | ArticlesCoherence Evaluation | CodeCode Available | 0 |
| Closed-form Filtering for Non-linear Systems | Feb 15, 2024 | Computational EfficiencyForm | —Unverified | 0 |
| Long-form evaluation of model editing | Feb 14, 2024 | Formmodel | CodeCode Available | 0 |
| Comment-aided Video-Language Alignment via Contrastive Pre-training for Short-form Video Humor Detection | Feb 14, 2024 | FormHumor Detection | CodeCode Available | 0 |
| Perturbative partial moment matching and gradient-flow adaptive importance sampling transformations for Bayesian leave one out cross-validation | Feb 13, 2024 | Form | CodeCode Available | 0 |
| Game Agent Driven by Free-Form Text Command: Using LLM-based Code Generation and Behavior Branch | Feb 12, 2024 | Code GenerationForm | —Unverified | 0 |
| KVQ: Kwai Video Quality Assessment for Short-form Videos | Feb 11, 2024 | FormVideo Quality Assessment | CodeCode Available | 2 |
| Closed-form solutions for generic N-token AMM arbitrage | Feb 9, 2024 | Form | —Unverified | 0 |
| Outage performance of the α-Beaulieu-Xie Shadowed Fading Channel Model | Feb 9, 2024 | Form | —Unverified | 0 |