SOTAVerified

Zero-shot Generalization

Papers

Showing 2130 of 572 papers

TitleStatusHype
Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation0
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer0
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data SynthesisCode1
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?Code1
Beyond the LUMIR challenge: The pathway to foundational registration modelsCode1
Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache CompressionCode2
ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers0
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous DrivingCode1
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection0
Show:102550
← PrevPage 3 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified