SOTAVerified

Zero-shot Generalization

Papers

Showing 2650 of 572 papers

TitleStatusHype
RobustSAM: Segment Anything Robustly on Degraded ImagesCode3
SMART: Scalable Multi-agent Real-time Motion Generation via Next-token PredictionCode3
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-ExpertsCode3
IEPile: Unearthing Large-Scale Schema-Based Information Extraction CorpusCode3
3D Diffuser Actor: Policy Diffusion with 3D Scene RepresentationsCode3
General Object Foundation Model for Images and Videos at ScaleCode3
Lag-Llama: Towards Foundation Models for Probabilistic Time Series ForecastingCode3
Separate Anything You DescribeCode3
Objaverse-XL: A Universe of 10M+ 3D ObjectsCode3
What Language Model to Train if You Have One Million GPU Hours?Code3
Expanding Language-Image Pretrained Models for General Video RecognitionCode3
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
WAFT: Warping-Alone Field Transforms for Optical FlowCode2
RecGPT: A Foundation Model for Sequential RecommendationCode2
Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache CompressionCode2
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task GeneralizationCode2
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by SegmentationCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
Bokehlicious: Photorealistic Bokeh Rendering with Controllable AperturesCode2
Autoregressive Image Generation with Randomized Parallel DecodingCode2
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in ClutterCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Show:102550
← PrevPage 2 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified