| Mellow: a small audio language model for reasoning | Mar 11, 2025 | Audio captioningLanguage Modeling | CodeCode Available | 2 |
| EditLord: Learning Code Transformation Rules for Code Editing | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Time Series Multitask Framework Integrating a Large Language Model, Pre-Trained Time Series Model, and Knowledge Graph | Mar 10, 2025 | Anomaly DetectionDecoder | —Unverified | 0 |
| MapQA: Open-domain Geospatial Question Answering on Map Data | Mar 10, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| EAZY: Eliminating Hallucinations in LVLMs by Zeroing out Hallucinatory Image Tokens | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| LLMIdxAdvis: Resource-Efficient Index Advisor Utilizing Large Language Model | Mar 10, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Evaluating LLaMA 3.2 for Software Vulnerability Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning | Mar 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Mar 10, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Fine-Grained Video Question Answering | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval | Mar 10, 2025 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| CAPT: Class-Aware Prompt Tuning for Federated Long-Tailed Learning with Vision-Language Model | Mar 10, 2025 | Federated LearningLanguage Modeling | —Unverified | 0 |
| CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Building English ASR model with regional language support | Mar 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Effect of Selection Format on LLM Performance | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Trust Collides: Decoding Human-LLM Cooperation Dynamics through the Prisoner's Dilemma | Mar 10, 2025 | AI AgentLanguage Modeling | —Unverified | 0 |
| DiffCLIP: Differential Attention Meets CLIP | Mar 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Gender Encoding Patterns in Pretrained Language Model Representations | Mar 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Seesaw: High-throughput LLM Inference via Model Re-sharding | Mar 9, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model | Mar 9, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Multimodal Programming in Computer Science with Interactive Assistance Powered by Large Language Model | Mar 9, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| AI-Facilitated Episodic Future Thinking For Adults with Obesity | Mar 8, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| Evaluation of the Automated Labeling Method for Taxonomic Nomenclature Through Prompt-Optimized Large Language Model | Mar 8, 2025 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models | Mar 8, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Image is All You Need: Towards Efficient and Effective Large Language Model-Based Recommender Systems | Mar 8, 2025 | AllAttribute | —Unverified | 0 |
| Language Model Personalization via Reward Factorization | Mar 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices | Mar 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion | Mar 8, 2025 | 3D Semantic Scene CompletionAutonomous Driving | CodeCode Available | 1 |
| Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding | Mar 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model | Mar 8, 2025 | Image Quality AssessmentLanguage Modeling | CodeCode Available | 2 |
| Phraselette: A Poet's Procedural Palette | Mar 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance | Mar 7, 2025 | ARCLanguage Modeling | —Unverified | 0 |
| Frequency Autoregressive Image Generation with Continuous Tokens | Mar 7, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Mar 7, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 2 |
| R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning | Mar 7, 2025 | Emotion RecognitionLanguage Modeling | CodeCode Available | 5 |
| Is Your Video Language Model a Reliable Judge? | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-based Iterative Approach to Metamodeling in Automotive | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DETQUS: Decomposition-Enhanced Transformers for QUery-focused Summarization | Mar 7, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Approximate Caching for Faster Retrieval-Augmented Generation | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PromptPex: Automatic Test Generation for Language Model Prompts | Mar 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Unveiling Biases in AI: ChatGPT's Political Economy Perspectives and Human Comparisons | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation | Mar 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Look Before You Leap: Using Serialized State Machine for Language Conditioned Robotic Manipulation | Mar 7, 2025 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| Generalized Interpolating Discrete Diffusion | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Wanda++: Pruning Large Language Models via Regional Gradients | Mar 6, 2025 | DecoderGPU | CodeCode Available | 0 |