SOTAVerified

Zero-shot Generalization

Papers

Showing 401450 of 572 papers

TitleStatusHype
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer0
Generative Negative Text Replay for Continual Vision-Language Pretraining0
GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation0
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
Gradient Projection For Continual Parameter-Efficient Tuning0
Grounding Language to Entities for Generalization in Reinforcement Learning0
Group Equivariant Conditional Neural Processes0
Habitat Synthetic Scenes Dataset (HSSD-200): An Analysis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation0
Hamiltonian Graph Networks with ODE Integrators0
HDL-GPT: High-Quality HDL is All You Need0
HEIGHT: Heterogeneous Interaction Graph Transformer for Robot Navigation in Crowded and Constrained Environments0
Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach0
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas0
Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition0
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model0
InCoRo: In-Context Learning for Robotics Control with Feedback Loops0
Inference of Affordances and Active Motor Control in Simulated Agents0
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining0
Interaction Modeling with Multiplex Attention0
In the Era of Prompt Learning with Vision-Language Models0
I-PHYRE: Interactive Physical Reasoning0
ISCUTE: Instance Segmentation of Cables Using Text Embedding0
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation0
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking0
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
Language Models are General-Purpose Interfaces0
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment0
Large Model Based Referring Camouflaged Object Detection0
Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application0
Learning Diverse Bimanual Dexterous Manipulation Skills from Human Demonstrations0
Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models0
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer0
Learning Symbolic Physics with Graph Networks0
Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation0
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels0
Learning to navigate by distilling visual information and natural language instructions0
Learning to Represent State with Perceptual Schemata0
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains0
LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction0
Light Field Diffusion for Single-View Novel View Synthesis0
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias0
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion0
MASP: Scalable GNN-based Planning for Multi-Agent Navigation0
Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning0
F^2Depth: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis0
SAM^Med: A medical image annotation framework based on large vision model0
Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers0
Show:102550
← PrevPage 9 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified