SOTAVerified

Zero-shot Generalization

Papers

Showing 4150 of 572 papers

TitleStatusHype
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task GeneralizationCode2
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by SegmentationCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
Bokehlicious: Photorealistic Bokeh Rendering with Controllable AperturesCode2
Autoregressive Image Generation with Randomized Parallel DecodingCode2
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in ClutterCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Show:102550
← PrevPage 5 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified