| Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval | Mar 29, 2025 | AllLanguage Modeling | CodeCode Available | 1 |
| OpenHuEval: Evaluating Large Language Model on Hungarian Specifics | Mar 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression | Mar 27, 2025 | Computational EfficiencyLarge Language Model | CodeCode Available | 1 |
| Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation | Mar 26, 2025 | Large Language ModelScheduling | CodeCode Available | 1 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Mar 25, 2025 | Cross-Modal RetrievalHallucination | CodeCode Available | 1 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training | Mar 24, 2025 | DiversityLarge Language Model | CodeCode Available | 1 |
| Sun-Shine: A Large Language Model for Tibetan Culture | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MEPNet: Medical Entity-balanced Prompting Network for Brain CT Report Generation | Mar 22, 2025 | AnatomyLarge Language Model | CodeCode Available | 1 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination | Mar 20, 2025 | BenchmarkingLarge Language Model | CodeCode Available | 1 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |
| Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal Control | Mar 14, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Mar 10, 2025 | DecoderImage Generation | CodeCode Available | 1 |
| Lshan-1.0 Technical Report | Mar 10, 2025 | Large Language Model | CodeCode Available | 1 |
| Dynamic Updates for Language Adaptation in Visual-Language Tracking | Mar 9, 2025 | Large Language Model | CodeCode Available | 1 |
| Multimodal AI predicts clinical outcomes of drug combinations from preclinical data | Mar 4, 2025 | Large Language Model | CodeCode Available | 1 |
| InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model | Mar 4, 2025 | es-enLanguage Modeling | CodeCode Available | 1 |
| Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning | Mar 2, 2025 | Large Language ModelMulti-Instance Retrieval | CodeCode Available | 1 |
| SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection | Mar 1, 2025 | Human-Object Interaction DetectionLarge Language Model | CodeCode Available | 1 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards General Visual-Linguistic Face Forgery Detection(V2) | Feb 28, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| UDora: A Unified Red Teaming Framework against LLM Agents by Dynamically Hijacking Their Own Reasoning | Feb 28, 2025 | Large Language ModelRed Teaming | CodeCode Available | 1 |
| SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Playing Pokémon Red via Deep Reinforcement Learning | Feb 27, 2025 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 1 |
| Inverse Materials Design by Large Language Model-Assisted Generative Framework | Feb 25, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Are Sparse Autoencoders Useful? A Case Study in Sparse Probing | Feb 23, 2025 | Inductive BiasLarge Language Model | CodeCode Available | 1 |
| Weakly Supervised Video Scene Graph Generation via Natural Language Supervision | Feb 21, 2025 | Graph GenerationImage Captioning | CodeCode Available | 1 |
| ARS: Automatic Routing Solver with Large Language Models | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search | Feb 20, 2025 | AutoMLCode Generation | CodeCode Available | 1 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| STeCa: Step-level Trajectory Calibration for LLM Agent Learning | Feb 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Collaborative Retrieval for Large Language Model-based Conversational Recommender Systems | Feb 19, 2025 | Collaborative FilteringConversational Recommendation | CodeCode Available | 1 |
| MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition | Feb 18, 2025 | Emotion RecognitionLarge Language Model | CodeCode Available | 1 |
| G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation | Feb 18, 2025 | Collaborative FilteringExplainable Recommendation | CodeCode Available | 1 |
| Towards Text-Image Interleaved Retrieval | Feb 18, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| SMART: Self-Aware Agent for Tool Overuse Mitigation | Feb 17, 2025 | GSM8KLarge Language Model | CodeCode Available | 1 |
| Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation? | Feb 17, 2025 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BASE-SQL: A powerful open source Text-To-SQL baseline approach | Feb 15, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| Can Large Language Model Agents Balance Energy Systems? | Feb 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE | Feb 10, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |