| Mellow: a small audio language model for reasoning | Mar 11, 2025 | Audio captioningLanguage Modeling | CodeCode Available | 2 |
| EditLord: Learning Code Transformation Rules for Code Editing | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Time Series Multitask Framework Integrating a Large Language Model, Pre-Trained Time Series Model, and Knowledge Graph | Mar 10, 2025 | Anomaly DetectionDecoder | —Unverified | 0 |
| MapQA: Open-domain Geospatial Question Answering on Map Data | Mar 10, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| EAZY: Eliminating Hallucinations in LVLMs by Zeroing out Hallucinatory Image Tokens | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| LLMIdxAdvis: Resource-Efficient Index Advisor Utilizing Large Language Model | Mar 10, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Evaluating LLaMA 3.2 for Software Vulnerability Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Mar 10, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| CAPT: Class-Aware Prompt Tuning for Federated Long-Tailed Learning with Vision-Language Model | Mar 10, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning | Mar 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval | Mar 10, 2025 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Towards Fine-Grained Video Question Answering | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Building English ASR model with regional language support | Mar 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Trust Collides: Decoding Human-LLM Cooperation Dynamics through the Prisoner's Dilemma | Mar 10, 2025 | AI AgentLanguage Modeling | —Unverified | 0 |
| Effect of Selection Format on LLM Performance | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiffCLIP: Differential Attention Meets CLIP | Mar 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Gender Encoding Patterns in Pretrained Language Model Representations | Mar 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Multimodal Programming in Computer Science with Interactive Assistance Powered by Large Language Model | Mar 9, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| Seesaw: High-throughput LLM Inference via Model Re-sharding | Mar 9, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model | Mar 9, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| AI-Facilitated Episodic Future Thinking For Adults with Obesity | Mar 8, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |