| Shushing! Let's Imagine an Authentic Speech from the Silent Video | Mar 19, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| RWKV-7 "Goose" with Expressive Dynamic State Evolution | Mar 18, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 9 |
| Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Layer-wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Mar 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | Mar 18, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| ChatBEV: A Visual Language Model that Understands BEV Maps | Mar 18, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |