| ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Jul 11, 2025 | Depth EstimationHallucination | —Unverified | 0 |
| Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models | Jul 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model | Jul 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Systematic Analysis of Hybrid Linear Attention | Jul 8, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving | Jul 8, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 |
| Evolution without Large Models: Training Language Model with Task Principles | Jul 8, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |