SOTAVerified

Zero-shot Generalization

Papers

Showing 301325 of 572 papers

TitleStatusHype
Zero-Shot Monocular Scene Flow Estimation in the Wild0
StereoGen: High-quality Stereo Image Generation from a Single Image0
Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot CoordinationCode0
Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation0
MADation: Face Morphing Attack Detection with Foundation ModelsCode0
Spot Risks Before Speaking! Unraveling Safety Attention Heads in Large Vision-Language ModelsCode0
On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach0
On the Out-Of-Distribution Generalization of Large Multimodal Models0
From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models0
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation0
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio0
Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-TreesCode0
Zero-Shot Generalization for Blockage Localization in mmWave Communication0
Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction0
Memorizing SAM: 3D Medical Segment Anything Model with Memorizing TransformerCode0
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion0
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM0
WiFo: Wireless Foundation Model for Channel Prediction0
Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result FusionCode0
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models0
ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement LearningCode0
S^3: Synonymous Semantic Space for Improving Zero-Shot Generalization of Vision-Language Models0
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance0
UTSD: Unified Time Series Diffusion Model0
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control0
Show:102550
← PrevPage 13 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified