SOTAVerified

Zero-shot Generalization

Papers

Showing 3140 of 572 papers

TitleStatusHype
General Object Foundation Model for Images and Videos at ScaleCode3
RobustSAM: Segment Anything Robustly on Degraded ImagesCode3
Expanding Language-Image Pretrained Models for General Video RecognitionCode3
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-ExpertsCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Objaverse-XL: A Universe of 10M+ 3D ObjectsCode3
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
LLM+P: Empowering Large Language Models with Optimal Planning ProficiencyCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Show:102550
← PrevPage 4 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified