| Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Low-Resource Minority Language Translation with LLMs and Retrieval-Augmented Generation for Cultural Nuances | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | May 16, 2025 | FormLanguage Modeling | CodeCode Available | 0 |
| Token-Level Uncertainty Estimation for Large Language Model Reasoning | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Feasibility with Language Models for Open-World Compositional Zero-Shot Learning | May 16, 2025 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Neural Thermodynamic Laws for Large Language Model Training | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CRPE: Expanding The Reasoning Capability of Large Language Model for Code Generation | May 15, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |