| Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Apr 29, 2025 | Action GenerationFAD | —Unverified | 0 |
| SPECI: Skill Prompts based Hierarchical Continual Imitation Learning for Robot Manipulation | Apr 22, 2025 | Action GenerationImitation Learning | —Unverified | 0 |
| Modality Selection and Skill Segmentation via Cross-Modality Attention | Apr 20, 2025 | Action GenerationContact-rich Manipulation | —Unverified | 0 |
| InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners | Apr 19, 2025 | Action GenerationLogical Reasoning | CodeCode Available | 2 |
| Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models | Apr 14, 2025 | Action GenerationDenoising | CodeCode Available | 2 |
| A Survey on (M)LLM-Based GUI Agents | Mar 27, 2025 | Action GenerationInformation Retrieval | —Unverified | 0 |
| LLM Agents That Act Like Us: Accurate Human Behavior Simulation with Real-World Data | Mar 26, 2025 | Action Generation | —Unverified | 0 |
| ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Mar 25, 2025 | Action GenerationAutonomous Driving | —Unverified | 0 |
| Mind with Eyes: from Language Reasoning to Multimodal Reasoning | Mar 23, 2025 | Action GenerationMultimodal Reasoning | —Unverified | 0 |
| PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning | Mar 21, 2025 | Action GenerationMotion Generation | —Unverified | 0 |
| Diffuse-CLoC: Guided Diffusion for Physics-based Character Look-ahead Control | Mar 14, 2025 | Action GenerationMotion Generation | —Unverified | 0 |
| TLA: Tactile-Language-Action Model for Contact-Rich Manipulation | Mar 11, 2025 | Action GenerationContact-rich Manipulation | —Unverified | 0 |
| Agent models: Internalizing Chain-of-Action Generation into Reasoning models | Mar 9, 2025 | Action GenerationReinforcement Learning (RL) | CodeCode Available | 2 |
| LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications | Mar 4, 2025 | Action Generation | CodeCode Available | 2 |
| FRMD: Fast Robot Motion Diffusion with Consistency-Distilled Movement Primitives for Smooth Action Generation | Mar 3, 2025 | Action GenerationDenoising | —Unverified | 0 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 |
| Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis | Feb 27, 2025 | Action GenerationAI Agent | —Unverified | 0 |
| Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success | Feb 27, 2025 | Action GenerationChunking | CodeCode Available | 5 |
| VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers | Feb 27, 2025 | Action GenerationAutonomous Driving | —Unverified | 0 |
| Evolution 6.0: Evolving Robotic Capabilities Through Generative Design | Feb 24, 2025 | Action GenerationText to 3D | —Unverified | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning | Feb 21, 2025 | Action GenerationDecoder | —Unverified | 0 |
| IMLE Policy: Fast and Sample Efficient Visuomotor Policy Learning via Implicit Maximum Likelihood Estimation | Feb 17, 2025 | Action GenerationImitation Learning | —Unverified | 0 |
| Large Language Models for Multi-Robot Systems: A Survey | Feb 6, 2025 | Action GenerationBenchmarking | CodeCode Available | 1 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 |