| Qwen2.5-Omni Technical Report | Mar 26, 2025 | Automatic Speech Recognition (ASR)GSM8K | CodeCode Available | 7 |
| A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation | Mar 26, 2025 | Large Language ModelScheduling | CodeCode Available | 1 |
| Exploring the Effect of Robotic Embodiment and Empathetic Tone of LLMs on Empathy Elicitation | Mar 26, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping | Mar 25, 2025 | Computational PhenotypingLanguage Modeling | —Unverified | 0 |
| Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees | Mar 25, 2025 | Large Language Modelscientific discovery | —Unverified | 0 |
| Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SemEval-2025 Task 9: The Food Hazard Detection Challenge | Mar 25, 2025 | DecoderLanguage Modeling | —Unverified | 0 |