| ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Jul 11, 2025 | Depth EstimationHallucination | —Unverified | 0 |
| Repairing Language Model Pipelines by Meta Self-Refining Competing Constraints at Runtime | Jul 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models | Jul 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model | Jul 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evolution without Large Models: Training Language Model with Task Principles | Jul 8, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving | Jul 8, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 |
| MusiScene: Leveraging MU-LLaMA for Scene Imagination and Enhanced Video Background Music Generation | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Systematic Analysis of Hybrid Linear Attention | Jul 8, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Inaugural MOASEI Competition at AAMAS'2025: A Technical Report | Jul 7, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes | Jul 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model | Jul 7, 2025 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Transforming Calabi-Yau Constructions: Generating New Calabi-Yau Manifolds with Transformers | Jul 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering | Jul 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows | Jul 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning | Jun 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis | Jun 27, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Large Language Model Agent for Modular Task Execution in Drug Discovery | Jun 26, 2025 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon Simulation | Jun 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data Efficacy for Language Model Training | Jun 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompt-Guided Turn-Taking Prediction | Jun 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models | Jun 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling | Jun 26, 2025 | Autonomous DrivingContrastive Learning | —Unverified | 0 |