| Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback | Jun 4, 2025 | Large Language Model | —Unverified | 0 |
| Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications | Jun 4, 2025 | Large Language Model | —Unverified | 0 |
| MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Jun 4, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 |
| Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models | Jun 3, 2025 | DecoderKnowledge Distillation | —Unverified | 0 |
| TestAgent: An Adaptive and Intelligent Expert for Human Assessment | Jun 3, 2025 | Large Language ModelQuestion Selection | —Unverified | 0 |
| Adaptive Graph Pruning for Multi-Agent Communication | Jun 3, 2025 | Code GenerationLarge Language Model | CodeCode Available | 0 |
| TaxAgent: How Large Language Model Designs Fiscal Policy | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Jun 3, 2025 | Data AugmentationInstruction Following | —Unverified | 0 |
| Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws | Jun 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks | Jun 2, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning | Jun 2, 2025 | Knowledge DistillationLarge Language Model | —Unverified | 0 |
| PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models | Jun 2, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| MLorc: Momentum Low-rank Compression for Large Language Model Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Gradients Rapidly Increase Near the End of Training | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Image Generation from Contextually-Contradictory Prompts | Jun 2, 2025 | DenoisingImage Generation | —Unverified | 0 |
| LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback | Jun 2, 2025 | Large Language Model | —Unverified | 0 |
| COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents | Jun 2, 2025 | GPULarge Language Model | —Unverified | 0 |
| PointT2I: LLM-based text-to-image generation via keypoints | Jun 2, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG | Jun 1, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| Bridging Subjective and Objective QoE: Operator-Level Aggregation Using LLM-Based Comment Analysis and Network MOS Comparison | Jun 1, 2025 | Large Language ModelTime Series Analysis | —Unverified | 0 |
| Mamba Drafters for Speculative Decoding | Jun 1, 2025 | Large Language ModelMamba | —Unverified | 0 |
| OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation | Jun 1, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems | Jun 1, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| HADA: Human-AI Agent Decision Alignment Architecture | Jun 1, 2025 | AI AgentEthics | —Unverified | 0 |
| Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Organizational Adaptation to Generative AI in Cybersecurity: A Systematic Review | May 31, 2025 | Large Language Model | —Unverified | 0 |
| SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling | May 30, 2025 | Large Language Model | CodeCode Available | 0 |
| Artificial Empathy: AI based Mental Health | May 30, 2025 | Large Language Model | —Unverified | 0 |
| From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning | May 30, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| A Reward-driven Automated Webshell Malicious-code Generator for Red-teaming | May 30, 2025 | Code GenerationDiversity | —Unverified | 0 |
| Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings | May 30, 2025 | ArticlesClustering | —Unverified | 0 |
| Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Red Teaming Roadmap Towards System-Level Safety | May 30, 2025 | Large Language ModelRed Teaming | —Unverified | 0 |
| SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems | May 30, 2025 | Anomaly DetectionLarge Language Model | —Unverified | 0 |
| FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation | May 30, 2025 | DiagnosticLanguage Model Evaluation | CodeCode Available | 0 |
| HardTests: Synthesizing High-Quality Test Cases for LLM Coding | May 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation | May 30, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward | May 30, 2025 | Large Language Model | —Unverified | 0 |
| Beyond Exponential Decay: Rethinking Error Accumulation in Large Language Models | May 30, 2025 | Large Language Model | —Unverified | 0 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 |
| CREFT: Sequential Multi-Agent LLM for Character Relation Extraction | May 30, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs | May 30, 2025 | Large Language Model | —Unverified | 0 |
| MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |