SOTAVerified

Zero-shot Generalization

Papers

Showing 126150 of 572 papers

TitleStatusHype
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action EnvironmentsCode1
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over QuantityCode1
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
μLO: Compute-Efficient Meta-Generalization of Learned OptimizersCode1
MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for EchocardiographyCode1
MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large ModelCode1
MAgNet: Mesh Agnostic Neural PDE SolverCode1
M^2PT: Multimodal Prompt Tuning for Zero-shot Instruction LearningCode1
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot LabelsCode1
Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold MixupCode1
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense EncodersCode1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly DetectionsCode1
FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
Learning Quadrupedal Locomotion over Challenging TerrainCode1
Learning the Travelling Salesperson Problem Requires Rethinking GeneralizationCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Model Generalization on Text Attribute Graphs: Principles with Large Language ModelsCode1
A Universal Discriminator for Zero-Shot GeneralizationCode1
Contextualize Me -- The Case for Context in Reinforcement LearningCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
Label Agnostic Pre-training for Zero-shot Text ClassificationCode1
Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTVCode1
A Multi-Task BERT Model for Schema-Guided Dialogue State TrackingCode1
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language ModelsCode1
Show:102550
← PrevPage 6 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified