SOTAVerified

Zero-shot Generalization

Papers

Showing 151160 of 572 papers

TitleStatusHype
S^3: Synonymous Semantic Space for Improving Zero-Shot Generalization of Vision-Language Models0
Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono FailCode3
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance0
UTSD: Unified Time Series Diffusion Model0
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control0
COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detectionCode1
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis0
vesselFM: A Foundation Model for Universal 3D Blood Vessel SegmentationCode2
Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models0
Show:102550
← PrevPage 16 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified