SOTAVerified

Zero-shot Generalization

Papers

Showing 6170 of 572 papers

TitleStatusHype
RoboUniView: Visual-Language Model with Unified View Representation for Robotic ManipulationCode2
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation ModelsCode2
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language ModelsCode2
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?Code2
GeoSynth: Contextually-Aware High-Resolution Satellite Image SynthesisCode2
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model PerformanceCode2
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language ReasoningCode2
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion ModelCode2
RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation ModelCode2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
Show:102550
← PrevPage 7 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified