| On the Performance of Multimodal Language Models | Oct 4, 2023 | BenchmarkingBinary Classification | —Unverified | 0 |
| On the Use of Linguistic Features for the Evaluation of Generative Dialogue Systems | Apr 13, 2021 | Task-Oriented Dialogue SystemsZero-shot Generalization | —Unverified | 0 |
| On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach | Jan 1, 2025 | Adversarial RobustnessZero-shot Generalization | —Unverified | 0 |
| On the Zero-Shot Generalization of Machine-Generated Text Detectors | Oct 8, 2023 | Zero-shot Generalization | —Unverified | 0 |
| OpenSU3D: Open World 3D Scene Understanding using Foundation Models | Jul 19, 2024 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 |
| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 |
| OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction | Mar 5, 2025 | Vision-Language-ActionZero-shot Generalization | —Unverified | 0 |
| Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Aug 7, 2024 | Adversarial RobustnessImage Segmentation | —Unverified | 0 |
| PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture | Jun 26, 2023 | Visual ReasoningZero-shot Generalization | —Unverified | 0 |
| PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment | Dec 18, 2022 | Data AugmentationDialogue Evaluation | —Unverified | 0 |
| PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM | Mar 10, 2025 | DecoderPose Estimation | —Unverified | 0 |
| From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models | Dec 31, 2024 | Decision MakingZero-shot Generalization | —Unverified | 0 |
| Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization | May 8, 2025 | Object LocalizationWeakly-Supervised Object Localization | —Unverified | 0 |
| Program Guided Agent | May 1, 2020 | MinecraftZero-shot Generalization | —Unverified | 0 |
| Prompt-based Visual Alignment for Zero-shot Policy Transfer | Jun 5, 2024 | Autonomous DrivingLanguage Modelling | —Unverified | 0 |
| PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination | Apr 11, 2024 | Contrastive LearningDomain Generalization | —Unverified | 0 |
| RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning | Jun 2, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| RAILGUN: A Unified Convolutional Policy for Multi-Agent Path Finding Across Different Environments and Tasks | Mar 4, 2025 | Multi-Agent Path FindingZero-shot Generalization | —Unverified | 0 |
| RD-GAN: Few/Zero-Shot Chinese Character Style Transfer via Radical Decomposition and Rendering | Aug 1, 2020 | Style TransferZero-shot Generalization | —Unverified | 0 |
| Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Jul 11, 2024 | Anomaly DetectionAutonomous Vehicles | —Unverified | 0 |
| Reinforcement Learning of Implicit and Explicit Control Flow in Instructions | Feb 25, 2021 | Minecraftreinforcement-learning | —Unverified | 0 |
| Revisiting the Robust Generalization of Adversarial Prompt Tuning | May 18, 2024 | Adversarial RobustnessPrompt Learning | —Unverified | 0 |
| RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering | Nov 16, 2021 | Entity LinkingKnowledge Base Question Answering | —Unverified | 0 |
| Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Jan 8, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models | Oct 23, 2023 | Skill GeneralizationZero-shot Generalization | —Unverified | 0 |