| When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning | Mar 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Fine-Grained Video Question Answering | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Mar 10, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Building English ASR model with regional language support | Mar 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Trust Collides: Decoding Human-LLM Cooperation Dynamics through the Prisoner's Dilemma | Mar 10, 2025 | AI AgentLanguage Modeling | —Unverified | 0 |
| Effect of Selection Format on LLM Performance | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiffCLIP: Differential Attention Meets CLIP | Mar 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |