| The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective | Jun 4, 2025 | AI AgentLarge Language Model | —Unverified | 0 |
| MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Jun 3, 2025 | Data AugmentationInstruction Following | —Unverified | 0 |
| A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning | Jun 3, 2025 | Decision MakingDiagnostic | CodeCode Available | 3 |
| TaxAgent: How Large Language Model Designs Fiscal Policy | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive Graph Pruning for Multi-Agent Communication | Jun 3, 2025 | Code GenerationLarge Language Model | CodeCode Available | 0 |
| TestAgent: An Adaptive and Intelligent Expert for Human Assessment | Jun 3, 2025 | Large Language ModelQuestion Selection | —Unverified | 0 |
| CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale | Jun 3, 2025 | Large Language Model | CodeCode Available | 2 |
| TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models | Jun 3, 2025 | DecoderKnowledge Distillation | —Unverified | 0 |
| LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback | Jun 2, 2025 | Large Language Model | —Unverified | 0 |
| PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Gradients Rapidly Increase Near the End of Training | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning | Jun 2, 2025 | Knowledge DistillationLarge Language Model | —Unverified | 0 |
| PointT2I: LLM-based text-to-image generation via keypoints | Jun 2, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks | Jun 2, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding | Jun 2, 2025 | 3D GenerationLarge Language Model | CodeCode Available | 4 |
| MLorc: Momentum Low-rank Compression for Large Language Model Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning | Jun 2, 2025 | AI AgentDiversity | CodeCode Available | 5 |
| Compiler Optimization via LLM Reasoning for Efficient Model Serving | Jun 2, 2025 | Compiler OptimizationLarge Language Model | CodeCode Available | 2 |
| Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning | Jun 2, 2025 | Fact VerificationLanguage Modeling | CodeCode Available | 2 |
| COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents | Jun 2, 2025 | GPULarge Language Model | —Unverified | 0 |
| Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws | Jun 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models | Jun 2, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Image Generation from Contextually-Contradictory Prompts | Jun 2, 2025 | DenoisingImage Generation | —Unverified | 0 |
| OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation | Jun 1, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |