| EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | May 22, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Continually Self-Improving Language Models for Bariatric Surgery Question--Answering | May 22, 2025 | Large Language ModelMisinformation | —Unverified | 0 |
| Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering | May 22, 2025 | Global FactsLanguage Modeling | CodeCode Available | 0 |
| SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images | May 22, 2025 | Anomaly DetectionFew-Shot Learning | —Unverified | 0 |
| LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality | May 22, 2025 | Abstractive Text SummarizationInformativeness | CodeCode Available | 0 |
| Large Language Model-Empowered Interactive Load Forecasting | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models | May 22, 2025 | GSM8KLarge Language Model | —Unverified | 0 |