| HADA: Human-AI Agent Decision Alignment Architecture | Jun 1, 2025 | AI AgentEthics | —Unverified | 0 |
| Bridging Subjective and Objective QoE: Operator-Level Aggregation Using LLM-Based Comment Analysis and Network MOS Comparison | Jun 1, 2025 | Large Language ModelTime Series Analysis | —Unverified | 0 |
| OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation | Jun 1, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG | Jun 1, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| Mamba Drafters for Speculative Decoding | Jun 1, 2025 | Large Language ModelMamba | —Unverified | 0 |
| A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems | Jun 1, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Organizational Adaptation to Generative AI in Cybersecurity: A Systematic Review | May 31, 2025 | Large Language Model | —Unverified | 0 |
| DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | May 31, 2025 | Large Language Model | CodeCode Available | 1 |
| Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Red Teaming Roadmap Towards System-Level Safety | May 30, 2025 | Large Language ModelRed Teaming | —Unverified | 0 |
| Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings | May 30, 2025 | ArticlesClustering | —Unverified | 0 |
| Artificial Empathy: AI based Mental Health | May 30, 2025 | Large Language Model | —Unverified | 0 |
| RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward | May 30, 2025 | Large Language Model | —Unverified | 0 |
| un^2CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP | May 30, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs | May 30, 2025 | Large Language Model | —Unverified | 0 |
| MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Beyond Exponential Decay: Rethinking Error Accumulation in Large Language Models | May 30, 2025 | Large Language Model | —Unverified | 0 |
| A Reward-driven Automated Webshell Malicious-code Generator for Red-teaming | May 30, 2025 | Code GenerationDiversity | —Unverified | 0 |
| SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems | May 30, 2025 | Anomaly DetectionLarge Language Model | —Unverified | 0 |
| CREFT: Sequential Multi-Agent LLM for Character Relation Extraction | May 30, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | May 30, 2025 | ClassificationDisaster Response | CodeCode Available | 2 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 |
| SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling | May 30, 2025 | Large Language Model | CodeCode Available | 0 |
| From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning | May 30, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HardTests: Synthesizing High-Quality Test Cases for LLM Coding | May 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis | May 30, 2025 | DiversityLanguage Modeling | CodeCode Available | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation | May 30, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation | May 30, 2025 | DiagnosticLanguage Model Evaluation | CodeCode Available | 0 |
| Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling | May 29, 2025 | Computational EfficiencyFairness | —Unverified | 0 |
| Large Language Model Meets Constraint Propagation | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation | May 29, 2025 | Large Language Model | CodeCode Available | 11 |
| LLM Agents Should Employ Security Principles | May 29, 2025 | Large Language Model | —Unverified | 0 |
| Deep Retrieval at CheckThat! 2025: Identifying Scientific Papers from Implicit Social Media Mentions via Hybrid Retrieval and Re-Ranking | May 29, 2025 | Large Language ModelRe-Ranking | —Unverified | 0 |
| SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diversity-Aware Policy Optimization for Large Language Model Reasoning | May 29, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | May 29, 2025 | Large Language ModelPrompt Engineering | CodeCode Available | 2 |
| Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness | May 29, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| On-Policy RL with Optimal Reward Baseline | May 29, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 |
| BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | May 29, 2025 | Large Language Modelscientific discovery | CodeCode Available | 3 |
| SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents | May 29, 2025 | Adversarial AttackLarge Language Model | CodeCode Available | 1 |