| SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics | Jun 2, 2025 | Action GenerationGPU | CodeCode Available | 11 |
| Large Action Models: From Inception to Implementation | Dec 13, 2024 | Action Generation | CodeCode Available | 9 |
| Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success | Feb 27, 2025 | Action GenerationChunking | CodeCode Available | 5 |
| WorldVLA: Towards Autoregressive Action World Model | Jun 26, 2025 | Action Generationmodel | CodeCode Available | 4 |
| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 |
| Distilling LLM Agent into Small Models with Retrieval and Code Tools | May 23, 2025 | Action GenerationDomain Generalization | CodeCode Available | 3 |
| AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning | Jun 16, 2025 | Action GenerationAutonomous Driving | CodeCode Available | 3 |
| Affordance-based Robot Manipulation with Flow Matching | Sep 2, 2024 | Action GenerationRobot Manipulation | CodeCode Available | 3 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Apr 19, 2024 | Action Generation | CodeCode Available | 3 |
| Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends | Jun 26, 2025 | Action GenerationVision-Language-Action | CodeCode Available | 2 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 |
| Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving | Oct 3, 2023 | Action GenerationAutonomous Driving | CodeCode Available | 2 |
| Agent models: Internalizing Chain-of-Action Generation into Reasoning models | Mar 9, 2025 | Action GenerationReinforcement Learning (RL) | CodeCode Available | 2 |
| Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models | Apr 14, 2025 | Action GenerationDenoising | CodeCode Available | 2 |
| Learning Physically Realizable Skills for Online Packing of General 3D Shapes | Dec 5, 2022 | 3D geometryAction Generation | CodeCode Available | 2 |
| LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications | Mar 4, 2025 | Action Generation | CodeCode Available | 2 |
| InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners | Apr 19, 2025 | Action GenerationLogical Reasoning | CodeCode Available | 2 |
| AICL: Action In-Context Learning for Video Diffusion Model | Mar 18, 2024 | Action GenerationIn-Context Learning | CodeCode Available | 1 |
| Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Jun 5, 2025 | Action GenerationAsynchronous Group Communication | CodeCode Available | 1 |
| Human Action Generation with Generative Adversarial Networks | May 26, 2018 | Action GenerationGenerative Adversarial Network | CodeCode Available | 1 |
| COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities | Sep 14, 2022 | Action Generation | CodeCode Available | 1 |
| Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches | May 14, 2025 | Action GenerationImage Generation | CodeCode Available | 1 |
| Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs | Jul 26, 2024 | Action GenerationLarge Language Model | CodeCode Available | 1 |
| Structure-Aware Human-Action Generation | Jul 4, 2020 | Action Generationgraph construction | CodeCode Available | 1 |
| Graph Constrained Reinforcement Learning for Natural Language Action Spaces | Jan 23, 2020 | Action GenerationNatural Language Understanding | CodeCode Available | 1 |
| Generative Adversarial Graph Convolutional Networks for Human Action Synthesis | Oct 21, 2021 | Action GenerationDisentanglement | CodeCode Available | 1 |
| OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Jun 4, 2025 | Action GenerationDecision Making | CodeCode Available | 1 |
| Large Language Models for Multi-Robot Systems: A Survey | Feb 6, 2025 | Action GenerationBenchmarking | CodeCode Available | 1 |
| LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps | May 15, 2025 | Action Generation | CodeCode Available | 1 |
| Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks | Nov 4, 2024 | Action GenerationBenchmarking | CodeCode Available | 1 |
| Keep CALM and Explore: Language Models for Action Generation in Text-based Games | Oct 6, 2020 | Action GenerationLanguage Modeling | CodeCode Available | 1 |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Aug 28, 2024 | Action GenerationDecision Making | CodeCode Available | 1 |
| ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning | Sep 12, 2023 | Action Generation | CodeCode Available | 1 |
| MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion | Oct 21, 2021 | Action GenerationDiversity | CodeCode Available | 1 |
| Action2Motion: Conditioned Generation of 3D Human Motions | Jul 30, 2020 | Action GenerationHuman action generation | CodeCode Available | 1 |
| Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Apr 29, 2025 | Action GenerationFAD | —Unverified | 0 |
| A Survey on (M)LLM-Based GUI Agents | Mar 27, 2025 | Action GenerationInformation Retrieval | —Unverified | 0 |
| Active Generation Network of Human Skeleton for Action Recognition | Jan 30, 2024 | Action GenerationAction Recognition | —Unverified | 0 |
| Actions Generation from Captions | Feb 14, 2019 | Action GenerationGenerative Adversarial Network | —Unverified | 0 |
| Distilled Thompson Sampling: Practical and Efficient Thompson Sampling via Imitation Learning | Nov 29, 2020 | Action GenerationDecision Making | —Unverified | 0 |
| A Survey on GUI Agents with Foundation Models Enhanced by Reinforcement Learning | Apr 29, 2025 | Action GenerationPrompt Engineering | —Unverified | 0 |
| Diffuse-CLoC: Guided Diffusion for Physics-based Character Look-ahead Control | Mar 14, 2025 | Action GenerationMotion Generation | —Unverified | 0 |
| Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning | Feb 28, 2024 | Action GenerationMulti-agent Reinforcement Learning | —Unverified | 0 |
| Context-aware taxi dispatching at city-scale using deep reinforcement learning | May 26, 2021 | Action GenerationDeep Reinforcement Learning | —Unverified | 0 |
| AssistGUI: Task-Oriented PC Graphical User Interface Automation | Jan 1, 2024 | Action GenerationLanguage Modeling | —Unverified | 0 |
| ITCMA: A Generative Agent Based on a Computational Consciousness Structure | Mar 29, 2024 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| H^3DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning | May 12, 2025 | Action Generation | —Unverified | 0 |
| IMLE Policy: Fast and Sample Efficient Visuomotor Policy Learning via Implicit Maximum Likelihood Estimation | Feb 17, 2025 | Action GenerationImitation Learning | —Unverified | 0 |
| Hierarchical Instruction-aware Embodied Visual Tracking | May 27, 2025 | Action GenerationPosition | —Unverified | 0 |