SOTAVerified

Zero-shot Generalization

Papers

Showing 4150 of 572 papers

TitleStatusHype
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
Multitask Prompted Training Enables Zero-Shot Task GeneralizationCode2
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation CapabilitiesCode2
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
BigBIO: A Framework for Data-Centric Biomedical Natural Language ProcessingCode2
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model PerformanceCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
LLM+P: Empowering Large Language Models with Optimal Planning ProficiencyCode2
Show:102550
← PrevPage 5 of 58Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified