| Rectified Sparse Attention | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Statistical Physics of Language Model Reasoning | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products | Jun 4, 2025 | image-classificationImage Classification | —Unverified | 0 |
| MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Jun 4, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| LaF-GRPO: In-Situ Navigation Instruction Generation for the Visually Impaired via GRPO with LLM-as-Follower Reward | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions | Jun 4, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Jun 3, 2025 | Data AugmentationInstruction Following | —Unverified | 0 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| TaxAgent: How Large Language Model Designs Fiscal Policy | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Prediction Meets Large Language Models: A Survey | Jun 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator | Jun 3, 2025 | Grammatical Error CorrectionGrammatical Error Detection | —Unverified | 0 |
| EALG: Evolutionary Adversarial Generation of Language Model-Guided Generators for Combinatorial Optimization | Jun 3, 2025 | Combinatorial OptimizationLanguage Modeling | —Unverified | 0 |
| TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models | Jun 3, 2025 | DecoderKnowledge Distillation | —Unverified | 0 |
| SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Text Compression: Evaluating Tokenizers Across Scales | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating the Impact of Word Informativeness on Speech Emotion Recognition | Jun 2, 2025 | Emotion RecognitionInformativeness | —Unverified | 0 |
| Why Gradients Rapidly Increase Near the End of Training | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Challenging Language Model Agents | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |