| PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration | May 21, 2025 | Large Language Modelscientific discovery | CodeCode Available | 1 |
| Bridging Sign and Spoken Languages: Pseudo Gloss Generation for Sign Language Translation | May 21, 2025 | In-Context LearningLarge Language Model | —Unverified | 0 |
| Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Privacy-Preserving Conformal Prediction Under Local Differential Privacy | May 21, 2025 | Conformal PredictionLarge Language Model | CodeCode Available | 0 |
| lmgame-Bench: How Good are LLMs at Playing Games? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FlowBERT: Prompt-tuned BERT for variable flow field prediction | May 20, 2025 | Dimensionality ReductionFew-Shot Learning | —Unverified | 0 |
| Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining | May 20, 2025 | Large Language Model | —Unverified | 0 |
| Semi-Clairvoyant Scheduling of Speculative Decoding Requests to Minimize LLM Inference Latency | May 20, 2025 | Large Language ModelScheduling | —Unverified | 0 |
| Large Language Model-Driven Distributed Integrated Multimodal Sensing and Semantic Communications | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reliable Decision Support with LLMs: A Framework for Evaluating Consistency in Binary Text Classification Applications | May 20, 2025 | ArticlesBinary text classification | —Unverified | 0 |
| Automated Journalistic Questions: A New Method for Extracting 5W1H in French | May 20, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Guarded Query Routing for Large Language Models | May 20, 2025 | Large Language Modeltext-classification | CodeCode Available | 0 |
| DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation | May 20, 2025 | In-Context LearningInference Optimization | —Unverified | 0 |
| Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity | May 20, 2025 | GPULarge Language Model | CodeCode Available | 0 |
| LLM-based Evaluation Policy Extraction for Ecological Modeling | May 20, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning | May 20, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring | May 20, 2025 | Automated Essay ScoringDiversity | —Unverified | 0 |
| TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring | May 20, 2025 | Automated Essay ScoringLanguage Modeling | —Unverified | 0 |
| Memory-Centric Embodied Question Answer | May 20, 2025 | Embodied Question AnsweringLarge Language Model | —Unverified | 0 |
| Structured Agent Distillation for Large Language Model | May 20, 2025 | Decision MakingImitation Learning | —Unverified | 0 |
| FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks | May 20, 2025 | Large Language ModelMinecraft | CodeCode Available | 0 |
| Beyond Words: Multimodal LLM Knows When to Speak | May 20, 2025 | Large Language Model | —Unverified | 0 |
| Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning | May 20, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation | May 20, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow | May 20, 2025 | Graph structure learningLanguage Modeling | —Unverified | 0 |
| U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding | May 20, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| sudoLLM : On Multi-role Alignment of Language Models | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs | May 20, 2025 | GPULarge Language Model | —Unverified | 0 |
| Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising | May 20, 2025 | DecoderDenoising | —Unverified | 0 |
| Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Krikri: Advancing Open Large Language Models for Greek | May 19, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| LLM-Based Compact Reranking with Document Features for Scientific Retrieval | May 19, 2025 | Large Language ModelReranking | —Unverified | 0 |
| VocalAgent: Large Language Models for Vocal Health Diagnostics with Safety-Aware Evaluation | May 19, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Optimizing Retrieval Augmented Generation for Object Constraint Language | May 19, 2025 | Large Language ModelObject | —Unverified | 0 |
| HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding | May 19, 2025 | Large Language Model | —Unverified | 0 |
| R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model | May 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming | May 19, 2025 | FairnessLarge Language Model | CodeCode Available | 2 |
| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 |
| The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | May 19, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 1 |
| Structure-Aware Corpus Construction and User-Perception-Aligned Metrics for Large-Language-Model Code Completion | May 19, 2025 | Code CompletionLanguage Modeling | —Unverified | 0 |
| MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO | May 19, 2025 | DecoderImage Generation | CodeCode Available | 0 |
| Geography-Aware Large Language Models for Next POI Recommendation | May 18, 2025 | Large Language Model | —Unverified | 0 |
| LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |