| SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies | Jun 17, 2021 | Autonomous DrivingImage Augmentation | CodeCode Available | 1 |
| What Can I Do Here? Learning New Skills by Imagining Visual Affordances | Jun 1, 2021 | Zero-shot Generalization | CodeCode Available | 1 |
| Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition | May 18, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| ZePHyR: Zero-shot Pose Hypothesis Rating | Apr 28, 2021 | Motion PlanningPose Estimation | CodeCode Available | 1 |
| NaturalProofs: Mathematical Theorem Proving in Natural Language | Mar 24, 2021 | Automated Theorem ProvingDomain Generalization | CodeCode Available | 1 |
| Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning | Jan 19, 2021 | reinforcement-learningReinforcement Learning (RL) | CodeCode Available | 1 |
| Generalization to New Actions in Reinforcement Learning | Nov 3, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning | Oct 26, 2020 | ClusteringModel-based Reinforcement Learning | CodeCode Available | 1 |
| STAR: A Schema-Guided Dialog Dataset for Transfer Learning | Oct 22, 2020 | Transfer LearningZero-shot Generalization | CodeCode Available | 1 |
| Learning Quadrupedal Locomotion over Challenging Terrain | Oct 21, 2020 | Zero-shot Generalization | CodeCode Available | 1 |
| An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels | Oct 4, 2020 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | CodeCode Available | 1 |
| Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy Decomposition | Sep 25, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The Scattering Compositional Learner: Discovering Objects, Attributes, Relationships in Analogical Reasoning | Jul 8, 2020 | Zero-shot Generalization | CodeCode Available | 1 |
| Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup | Jul 1, 2020 | General ClassificationManagement | CodeCode Available | 1 |
| Learning the Travelling Salesperson Problem Requires Rethinking Generalization | Jun 12, 2020 | Combinatorial OptimizationTransfer Learning | CodeCode Available | 1 |
| Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas | Jun 1, 2020 | MinecraftMulti-Task Learning | CodeCode Available | 1 |
| Schema-Guided Dialogue State Tracking Task at DSTC8 | Feb 2, 2020 | Data AugmentationDialogue State Tracking | CodeCode Available | 1 |
| Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset | Sep 12, 2019 | 16kDialogue State Tracking | CodeCode Available | 1 |
| Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks | Oct 31, 2017 | Machine TranslationTranslation | CodeCode Available | 1 |
| Zero-Shot Relation Extraction via Reading Comprehension | Jun 13, 2017 | Reading ComprehensionRelation | CodeCode Available | 1 |
| SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation | Jul 16, 2025 | Boundary DetectionPseudo Label | —Unverified | 0 |
| Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Jul 15, 2025 | 3D ReconstructionAutonomous Driving | —Unverified | 0 |
| PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP Alignment | Jul 12, 2025 | Large Language ModelPose Estimation | CodeCode Available | 0 |
| Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data | Jul 9, 2025 | Motion GenerationZero-shot Generalization | CodeCode Available | 0 |
| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach | Jul 4, 2025 | AttributeContrastive Learning | —Unverified | 0 |
| RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather | Jul 2, 2025 | DenoisingDepth Estimation | —Unverified | 0 |
| TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design | Jun 24, 2025 | Deep Reinforcement LearningZero-shot Generalization | CodeCode Available | 0 |
| VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy | Jun 17, 2025 | Decision MakingSemantic Segmentation | —Unverified | 0 |
| LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction | Jun 16, 2025 | Instruction FollowingVision-Language-Action | —Unverified | 0 |
| Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling | Jun 13, 2025 | DecoderImage Segmentation | CodeCode Available | 0 |
| DEAL: Disentangling Transformer Head Activations for LLM Steering | Jun 10, 2025 | Binary ClassificationZero-shot Generalization | —Unverified | 0 |
| ZeroVO: Visual Odometry with Minimal Assumptions | Jun 9, 2025 | Autonomous DrivingCamera Calibration | —Unverified | 0 |
| Deep Equivariant Multi-Agent Control Barrier Functions | Jun 9, 2025 | Robot NavigationZero-shot Generalization | —Unverified | 0 |
| CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray | Jun 9, 2025 | ClassificationDiagnostic | —Unverified | 0 |
| Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application | Jun 6, 2025 | DenoisingSemantic Communication | —Unverified | 0 |
| Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation | Jun 5, 2025 | Zero-shot Generalization | —Unverified | 0 |
| Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer | Jun 5, 2025 | 3DGSDataset Generation | —Unverified | 0 |
| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers | May 26, 2025 | cross-modal alignmentPosition | —Unverified | 0 |
| WHISTRESS: Enriching Transcriptions with Sentence Stress Detection | May 25, 2025 | SentenceZero-shot Generalization | —Unverified | 0 |
| G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning | May 24, 2025 | Link PredictionNode Classification | —Unverified | 0 |
| Anchored Diffusion Language Model | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning | May 22, 2025 | Zero-shot Generalization | —Unverified | 0 |
| EasyInsert: A Data-Efficient and Generalizable Insertion Policy | May 22, 2025 | Pose PredictionZero-shot Generalization | —Unverified | 0 |
| Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution Shifts | May 21, 2025 | Few-Shot LearningTask 2 | CodeCode Available | 0 |
| AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation | May 21, 2025 | Zero-shot Generalization | —Unverified | 0 |
| gen2seg: Generative Models Enable Generalizable Instance Segmentation | May 21, 2025 | DecoderInstance Segmentation | —Unverified | 0 |
| EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy | May 21, 2025 | Motion PlanningVision-Language-Action | —Unverified | 0 |
| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 |