| SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling | May 30, 2025 | Large Language Model | CodeCode Available | 0 |
| From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning | May 30, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HardTests: Synthesizing High-Quality Test Cases for LLM Coding | May 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis | May 30, 2025 | DiversityLanguage Modeling | CodeCode Available | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation | May 30, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation | May 30, 2025 | DiagnosticLanguage Model Evaluation | CodeCode Available | 0 |
| Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling | May 29, 2025 | Computational EfficiencyFairness | —Unverified | 0 |
| Large Language Model Meets Constraint Propagation | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation | May 29, 2025 | Large Language Model | CodeCode Available | 11 |
| LLM Agents Should Employ Security Principles | May 29, 2025 | Large Language Model | —Unverified | 0 |
| Deep Retrieval at CheckThat! 2025: Identifying Scientific Papers from Implicit Social Media Mentions via Hybrid Retrieval and Re-Ranking | May 29, 2025 | Large Language ModelRe-Ranking | —Unverified | 0 |
| SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diversity-Aware Policy Optimization for Large Language Model Reasoning | May 29, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | May 29, 2025 | Large Language ModelPrompt Engineering | CodeCode Available | 2 |
| Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness | May 29, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| On-Policy RL with Optimal Reward Baseline | May 29, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 |
| BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | May 29, 2025 | Large Language Modelscientific discovery | CodeCode Available | 3 |
| SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents | May 29, 2025 | Adversarial AttackLarge Language Model | CodeCode Available | 1 |