| SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics | Jun 2, 2025 | Action GenerationGPU | CodeCode Available | 11 | 5 |
| Large Action Models: From Inception to Implementation | Dec 13, 2024 | Action Generation | CodeCode Available | 9 | 5 |
| Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success | Feb 27, 2025 | Action GenerationChunking | CodeCode Available | 5 | 5 |
| WorldVLA: Towards Autoregressive Action World Model | Jun 26, 2025 | Action Generationmodel | CodeCode Available | 4 | 5 |
| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 | 5 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 | 5 |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Apr 19, 2024 | Action Generation | CodeCode Available | 3 | 5 |
| Affordance-based Robot Manipulation with Flow Matching | Sep 2, 2024 | Action GenerationRobot Manipulation | CodeCode Available | 3 | 5 |
| Distilling LLM Agent into Small Models with Retrieval and Code Tools | May 23, 2025 | Action GenerationDomain Generalization | CodeCode Available | 3 | 5 |
| AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning | Jun 16, 2025 | Action GenerationAutonomous Driving | CodeCode Available | 3 | 5 |
| Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving | Oct 3, 2023 | Action GenerationAutonomous Driving | CodeCode Available | 2 | 5 |
| Learning Physically Realizable Skills for Online Packing of General 3D Shapes | Dec 5, 2022 | 3D geometryAction Generation | CodeCode Available | 2 | 5 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 | 5 |
| Agent models: Internalizing Chain-of-Action Generation into Reasoning models | Mar 9, 2025 | Action GenerationReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models | Apr 14, 2025 | Action GenerationDenoising | CodeCode Available | 2 | 5 |
| InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners | Apr 19, 2025 | Action GenerationLogical Reasoning | CodeCode Available | 2 | 5 |
| Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends | Jun 26, 2025 | Action GenerationVision-Language-Action | CodeCode Available | 2 | 5 |
| LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications | Mar 4, 2025 | Action Generation | CodeCode Available | 2 | 5 |
| AICL: Action In-Context Learning for Video Diffusion Model | Mar 18, 2024 | Action GenerationIn-Context Learning | CodeCode Available | 1 | 5 |
| Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches | May 14, 2025 | Action GenerationImage Generation | CodeCode Available | 1 | 5 |
| Structure-Aware Human-Action Generation | Jul 4, 2020 | Action Generationgraph construction | CodeCode Available | 1 | 5 |
| Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs | Jul 26, 2024 | Action GenerationLarge Language Model | CodeCode Available | 1 | 5 |
| Human Action Generation with Generative Adversarial Networks | May 26, 2018 | Action GenerationGenerative Adversarial Network | CodeCode Available | 1 | 5 |
| Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Jun 5, 2025 | Action GenerationAsynchronous Group Communication | CodeCode Available | 1 | 5 |
| COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities | Sep 14, 2022 | Action Generation | CodeCode Available | 1 | 5 |
| MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion | Oct 21, 2021 | Action GenerationDiversity | CodeCode Available | 1 | 5 |
| OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Jun 4, 2025 | Action GenerationDecision Making | CodeCode Available | 1 | 5 |
| Graph Constrained Reinforcement Learning for Natural Language Action Spaces | Jan 23, 2020 | Action GenerationNatural Language Understanding | CodeCode Available | 1 | 5 |
| Generative Adversarial Graph Convolutional Networks for Human Action Synthesis | Oct 21, 2021 | Action GenerationDisentanglement | CodeCode Available | 1 | 5 |
| LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps | May 15, 2025 | Action Generation | CodeCode Available | 1 | 5 |
| Action2Motion: Conditioned Generation of 3D Human Motions | Jul 30, 2020 | Action GenerationHuman action generation | CodeCode Available | 1 | 5 |
| Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks | Nov 4, 2024 | Action GenerationBenchmarking | CodeCode Available | 1 | 5 |
| Keep CALM and Explore: Language Models for Action Generation in Text-based Games | Oct 6, 2020 | Action GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Aug 28, 2024 | Action GenerationDecision Making | CodeCode Available | 1 | 5 |
| ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning | Sep 12, 2023 | Action Generation | CodeCode Available | 1 | 5 |
| Large Language Models for Multi-Robot Systems: A Survey | Feb 6, 2025 | Action GenerationBenchmarking | CodeCode Available | 1 | 5 |
| Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation | Nov 1, 2021 | Action GenerationMachine Translation | CodeCode Available | 0 | 5 |
| Dynamic Compositional Graph Convolutional Network for Efficient Composite Human Motion Prediction | Nov 23, 2023 | Action GenerationHuman motion prediction | CodeCode Available | 0 | 5 |
| Text Editing as Imitation Game | Oct 21, 2022 | Action GenerationGrammatical Error Correction | CodeCode Available | 0 | 5 |
| Seg2Act: Global Context-aware Action Generation for Document Logical Structuring | Oct 9, 2024 | Action GenerationTransfer Learning | CodeCode Available | 0 | 5 |
| STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization | Jun 4, 2025 | Action GenerationQuantization | CodeCode Available | 0 | 5 |
| Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction | Sep 4, 2018 | Action GenerationConditional Image Generation | CodeCode Available | 0 | 5 |
| CogIntAc: Modeling the Relationships between Intention, Emotion and Action in Interactive Process from Cognitive Perspective | May 7, 2022 | Action Generation | CodeCode Available | 0 | 5 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 | 5 |
| Learning Diverse Stochastic Human-Action Generators by Learning Smooth Latent Transitions | Dec 21, 2019 | Action GenerationDecoder | CodeCode Available | 0 | 5 |
| Efficient Motion Planning for Automated Lane Change based on Imitation Learning and Mixed-Integer Optimization | Apr 18, 2019 | Action GenerationAutonomous Driving | CodeCode Available | 0 | 5 |
| Language-free Compositional Action Generation via Decoupling Refinement | Jul 7, 2023 | Action Generation | CodeCode Available | 0 | 5 |
| FLAG3D: A 3D Fitness Activity Dataset with Language Instruction | Dec 9, 2022 | Action GenerationAction Recognition | CodeCode Available | 0 | 5 |
| JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning | Sep 1, 2023 | Action GenerationDiversity | CodeCode Available | 0 | 5 |
| Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion | Apr 29, 2025 | Action GenerationFAD | —Unverified | 0 | 0 |