| A Survey on (M)LLM-Based GUI Agents | Mar 27, 2025 | Action GenerationInformation Retrieval | —Unverified | 0 |
| LLM Agents That Act Like Us: Accurate Human Behavior Simulation with Real-World Data | Mar 26, 2025 | Action Generation | —Unverified | 0 |
| ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation | Mar 25, 2025 | Action GenerationAutonomous Driving | —Unverified | 0 |
| Mind with Eyes: from Language Reasoning to Multimodal Reasoning | Mar 23, 2025 | Action GenerationMultimodal Reasoning | —Unverified | 0 |
| PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning | Mar 21, 2025 | Action GenerationMotion Generation | —Unverified | 0 |
| Diffuse-CLoC: Guided Diffusion for Physics-based Character Look-ahead Control | Mar 14, 2025 | Action GenerationMotion Generation | —Unverified | 0 |
| TLA: Tactile-Language-Action Model for Contact-Rich Manipulation | Mar 11, 2025 | Action GenerationContact-rich Manipulation | —Unverified | 0 |
| Agent models: Internalizing Chain-of-Action Generation into Reasoning models | Mar 9, 2025 | Action GenerationReinforcement Learning (RL) | CodeCode Available | 2 |
| LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications | Mar 4, 2025 | Action Generation | CodeCode Available | 2 |
| FRMD: Fast Robot Motion Diffusion with Consistency-Distilled Movement Primitives for Smooth Action Generation | Mar 3, 2025 | Action GenerationDenoising | —Unverified | 0 |