| Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners | May 29, 2025 | Humanoid ControlLanguage Modeling | —Unverified | 0 |
| ATLAS: Learning to Optimally Memorize the Context at Test Time | May 29, 2025 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Diversity-Aware Policy Optimization for Large Language Model Reasoning | May 29, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Discriminative Policy Optimization for Token-Level Reward Models | May 29, 2025 | GSM8KLanguage Modeling | CodeCode Available | 0 |
| Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models | May 29, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition | May 29, 2025 | Handwritten Mathmatical Expression RecognitionLanguage Modeling | CodeCode Available | 1 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 |
| CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Learning Parametric Distributions from Samples and Preferences | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Empirical Study of Federated Prompt Learning for Vision Language Model | May 29, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| VLM-RRT: Vision Language Model Guided RRT Search for Autonomous UAV Navigation | May 29, 2025 | Disaster ResponseLanguage Modeling | —Unverified | 0 |
| Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SeG-SR: Integrating Semantic Knowledge into Remote Sensing Image Super-Resolution via Vision-Language Model | May 29, 2025 | Image Super-ResolutionLanguage Modeling | CodeCode Available | 0 |
| PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model | May 29, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Spoken Language Modeling with Duration-Penalized Self-Supervised Units | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TrackVLA: Embodied Visual Tracking in the Wild | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents | May 28, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Speech as a Multimodal Digital Phenotype for Multi-Task LLM-based Mental Health Prediction | May 28, 2025 | Depression DetectionLanguage Modeling | —Unverified | 0 |
| ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLUE: Neural Networks Calibration via Learning Uncertainty-Error alignment | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image Generation | May 28, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| Improving Brain-to-Image Reconstruction via Fine-Grained Text Bridging | May 28, 2025 | Image ReconstructionLanguage Modeling | —Unverified | 0 |
| Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems | May 28, 2025 | Automated Essay ScoringLanguage Modeling | —Unverified | 0 |
| GateNLP at SemEval-2025 Task 10: Hierarchical Three-Step Prompting for Multilingual Narrative Classification | May 28, 2025 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning | May 28, 2025 | Combinatorial OptimizationFairness | —Unverified | 0 |
| VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Conversational Alignment with Artificial Intelligence in Context | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL | May 28, 2025 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding | May 28, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Tool for Generating Exceptional Behavior Tests With Large Language Models | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems | May 28, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-Shot Vision Encoder Grafting via LLM Surrogates | May 28, 2025 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation | May 28, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| The Multilingual Divide and Its Impact on Global AI Safety | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools | May 27, 2025 | AI AgentLanguage Modeling | —Unverified | 0 |
| Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pretraining Language Models to Ponder in Continuous Space | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms | May 27, 2025 | Bayesian OptimizationBenchmarking | CodeCode Available | 2 |