| R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner | Jan 1, 2025 | Action GenerationGame of Chess | —Unverified | 0 |
| Large Action Models: From Inception to Implementation | Dec 13, 2024 | Action Generation | CodeCode Available | 9 |
| CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction | Dec 9, 2024 | Action GenerationDenoising | —Unverified | 0 |
| Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios | Nov 16, 2024 | Action GenerationAI Agent | —Unverified | 0 |
| Grounding Video Models to Actions through Goal Conditioned Exploration | Nov 11, 2024 | Action GenerationVisual Navigation | —Unverified | 0 |
| Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks | Nov 4, 2024 | Action GenerationBenchmarking | CodeCode Available | 1 |
| Seg2Act: Global Context-aware Action Generation for Document Logical Structuring | Oct 9, 2024 | Action GenerationTransfer Learning | CodeCode Available | 0 |
| ActionFlow: Equivariant, Accurate, and Efficient Policies with Spatially Symmetric Flow Matching | Sep 6, 2024 | Action GenerationSpatial Reasoning | —Unverified | 0 |
| Affordance-based Robot Manipulation with Flow Matching | Sep 2, 2024 | Action GenerationRobot Manipulation | CodeCode Available | 3 |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Aug 28, 2024 | Action GenerationDecision Making | CodeCode Available | 1 |
| General-purpose Clothes Manipulation with Semantic Keypoints | Aug 15, 2024 | Action GenerationLanguage Modeling | —Unverified | 0 |
| Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs | Jul 26, 2024 | Action GenerationLarge Language Model | CodeCode Available | 1 |
| Robots Can Multitask Too: Integrating a Memory Architecture and LLMs for Enhanced Cross-Task Robot Action Generation | Jul 18, 2024 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft | Jun 25, 2024 | Action GenerationCode Generation | —Unverified | 0 |
| Introducing Brain-like Concepts to Embodied Hand-crafted Dialog Management System | Jun 13, 2024 | Action GenerationChatbot | —Unverified | 0 |
| T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences | Jun 2, 2024 | Action GenerationDecoder | —Unverified | 0 |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Apr 19, 2024 | Action Generation | CodeCode Available | 3 |
| ITCMA: A Generative Agent Based on a Computational Consciousness Structure | Mar 29, 2024 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| AICL: Action In-Context Learning for Video Diffusion Model | Mar 18, 2024 | Action GenerationIn-Context Learning | CodeCode Available | 1 |
| Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning | Feb 28, 2024 | Action GenerationMulti-agent Reinforcement Learning | —Unverified | 0 |
| Return-Aligned Decision Transformer | Feb 6, 2024 | Action Generation | —Unverified | 0 |
| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 |
| Active Generation Network of Human Skeleton for Action Recognition | Jan 30, 2024 | Action GenerationAction Recognition | —Unverified | 0 |
| AssistGUI: Task-Oriented PC Graphical User Interface Automation | Jan 1, 2024 | Action GenerationLanguage Modeling | —Unverified | 0 |
| Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives | Dec 19, 2023 | Action GenerationLanguage Modeling | —Unverified | 0 |