| Learning to navigate by distilling visual information and natural language instructions | Jan 1, 2018 | NavigateZero-shot Generalization | —Unverified | 0 |
| Learning to Represent State with Perceptual Schemata | Jun 13, 2021 | Zero-shot Generalization | —Unverified | 0 |
| Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains | Feb 24, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction | Jun 16, 2025 | Instruction FollowingVision-Language-Action | —Unverified | 0 |
| Light Field Diffusion for Single-View Novel View Synthesis | Sep 20, 2023 | DenoisingNovel View Synthesis | —Unverified | 0 |
| LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Oct 22, 2024 | 3DGSDecoder | —Unverified | 0 |
| Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion | Dec 18, 2024 | DenoisingDepth Completion | —Unverified | 0 |
| MASP: Scalable GNN-based Planning for Multi-Agent Navigation | Dec 5, 2023 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 |
| Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning | Jun 12, 2022 | Continual LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| F^2Depth: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis | Mar 27, 2024 | Depth EstimationIndoor Monocular Depth Estimation | —Unverified | 0 |
| SAM^Med: A medical image annotation framework based on large vision model | Jul 11, 2023 | Image SegmentationLiver Segmentation | —Unverified | 0 |
| Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers | Feb 7, 2025 | Zero-shot Generalization | —Unverified | 0 |
| MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Jan 20, 2025 | Keypoint DetectionZero-shot Generalization | —Unverified | 0 |
| Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning | Dec 19, 2023 | DiversityInstruction Following | —Unverified | 0 |
| MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning | Dec 14, 2023 | DecoderLanguage Modelling | —Unverified | 0 |
| Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching | Nov 14, 2024 | Depth EstimationKnowledge Distillation | —Unverified | 0 |
| Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio | Dec 23, 2024 | Contrastive LearningPrompt Learning | —Unverified | 0 |
| Multi-View Unsupervised Image Generation with Cross Attention Guidance | Dec 7, 2023 | Hard AttentionImage Generation | —Unverified | 0 |
| Neural Attention Memory | Feb 18, 2023 | Few-Shot LearningZero-shot Generalization | —Unverified | 0 |
| Neural Field Dynamics Model for Granular Object Piles Manipulation | Nov 1, 2023 | ObjectZero-shot Generalization | —Unverified | 0 |
| NeuralSCF: Neural network self-consistent fields for density functional theory | Jun 22, 2024 | Zero-shot Generalization | —Unverified | 0 |
| NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning | May 15, 2025 | Novel View SynthesisRobot Manipulation | —Unverified | 0 |
| On the Evaluation of Generative Robotic Simulations | Oct 10, 2024 | Diversitytext similarity | —Unverified | 0 |
| On the Out-Of-Distribution Generalization of Multimodal Large Language Models | Feb 9, 2024 | In-Context LearningOut-of-Distribution Generalization | —Unverified | 0 |
| On the Out-Of-Distribution Generalization of Large Multimodal Models | Jan 1, 2025 | In-Context LearningOut-of-Distribution Generalization | —Unverified | 0 |