| MMRo: Are Multimodal LLMs Eligible as the Brain for In-Home Robotics? | Jun 28, 2024 | Task PlanningVisual Reasoning | —Unverified | 0 |
| DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning | Jun 25, 2024 | Robot Task PlanningTask Planning | —Unverified | 0 |
| Automating Transfer of Robot Task Plans using Functorial Data Migrations | Jun 22, 2024 | Task Planning | —Unverified | 0 |
| Diffusion-Based Failure Sampling for Evaluating Safety-Critical Autonomous Systems | Jun 20, 2024 | DenoisingTask Planning | CodeCode Available | 0 |
| Embodied Instruction Following in Unknown Environments | Jun 17, 2024 | Instruction FollowingTask Planning | —Unverified | 0 |
| Details Make a Difference: Object State-Sensitive Neurorobotic Task Planning | Jun 14, 2024 | Dense CaptioningObject | CodeCode Available | 0 |
| DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning | Jun 14, 2024 | Task Planning | —Unverified | 0 |
| RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent | Jun 11, 2024 | AI AgentDescriptive | CodeCode Available | 2 |
| NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security | Jun 8, 2024 | Task PlanningVulnerability Detection | CodeCode Available | 11 |
| Tool-Planner: Task Planning with Clusters across Multiple Tools | Jun 6, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |