| GraspCorrect: Robotic Grasp Correction via Vision-Language Model-Guided Feedback | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Layer-wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Mar 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels | Mar 18, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RWKV-7 "Goose" with Expressive Dynamic State Evolution | Mar 18, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 9 |
| ChatBEV: A Visual Language Model that Understands BEV Maps | Mar 18, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences | Mar 18, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse | Mar 17, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model | Mar 17, 2025 | Continual LearningLanguage Modeling | —Unverified | 0 |
| PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing | Mar 17, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Agents Play Thousands of 3D Video Games | Mar 17, 2025 | FPS GamesLanguage Modeling | —Unverified | 0 |
| HybridGen: VLM-Guided Hybrid Planning for Scalable Data Generation of Imitation Learning | Mar 17, 2025 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| High-entropy Advantage in Neural Networks' Generalizability | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling | Mar 17, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Valid Text-to-SQL Generation with Unification-based DeepStochLog | Mar 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Mar 16, 2025 | Change DetectionImage Captioning | —Unverified | 0 |
| Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma? | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| State Fourier Diffusion Language Model (SFDLM): A Scalable, Novel Iterative Approach to Language Modeling | Mar 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |