| Automated Journalistic Questions: A New Method for Extracting 5W1H in French | May 20, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Too Long, Didn't Model: Decomposing LLM Long-Context Understanding With Novels | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation | May 20, 2025 | Conditional Text GenerationLanguage Modeling | —Unverified | 0 |
| MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations | May 20, 2025 | Fact CheckingHallucination | CodeCode Available | 0 |
| FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Graph Representations of Logical Forms for Language Modeling | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improve Language Model and Brain Alignment via Associative Memory | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Speculative Decoding Reimagined for Multimodal Large Language Models | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring | May 20, 2025 | Automated Essay ScoringLanguage Modeling | —Unverified | 0 |
| UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation | May 20, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation | May 20, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives | May 20, 2025 | Caption GenerationContrastive Learning | —Unverified | 0 |
| Rank-K: Test-Time Reasoning for Listwise Reranking | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring | May 20, 2025 | Automated Essay ScoringDiversity | —Unverified | 0 |
| Structured Agent Distillation for Large Language Model | May 20, 2025 | Decision MakingImitation Learning | —Unverified | 0 |
| MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow | May 20, 2025 | Graph structure learningLanguage Modeling | —Unverified | 0 |
| U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding | May 20, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising | May 20, 2025 | DecoderDenoising | —Unverified | 0 |
| sudoLLM : On Multi-role Alignment of Language Models | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A*-Decoding: Token-Efficient Inference Scaling | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReSW-VL: Representation Learning for Surgical Workflow Analysis Using Vision-Language Model | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Krikri: Advancing Open Large Language Models for Greek | May 19, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| VocalAgent: Large Language Models for Vocal Health Diagnostics with Safety-Aware Evaluation | May 19, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Sat2Sound: A Unified Framework for Zero-Shot Soundscape Mapping | May 19, 2025 | Contrastive LearningCross-Modal Retrieval | —Unverified | 0 |
| Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On the Thinking-Language Modeling Gap in Large Language Models | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation | May 19, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 |
| SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models | May 19, 2025 | Causal InferenceDecision Making | —Unverified | 0 |
| VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection | May 19, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| 3D Visual Illusion Depth Estimation | May 19, 2025 | Common Sense ReasoningDepth Estimation | CodeCode Available | 1 |
| G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO | May 19, 2025 | DecoderImage Generation | CodeCode Available | 0 |
| Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice | May 19, 2025 | AllHallucination | —Unverified | 0 |
| Structure-Aware Corpus Construction and User-Perception-Aligned Metrics for Large-Language-Model Code Completion | May 19, 2025 | Code CompletionLanguage Modeling | —Unverified | 0 |
| R3: Robust Rubric-Agnostic Reward Models | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CIE: Controlling Language Model Text Generations Using Continuous Signals | May 19, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Physics-Inspired Optimizer: Velocity Regularized Adam | May 19, 2025 | image-classificationImage Classification | —Unverified | 0 |
| CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling | May 19, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |