SOTAVerified

Zero-shot Generalization

Papers

Showing 101150 of 572 papers

TitleStatusHype
One-Prompt to Segment All Medical ImagesCode1
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon PredictionCode1
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data SynthesisCode1
DePT: Decoupled Prompt TuningCode1
Boosting Segment Anything Model Towards Open-Vocabulary LearningCode1
Label Agnostic Pre-training for Zero-shot Text ClassificationCode1
Delving into Out-of-Distribution Detection with Medical Vision-Language ModelsCode1
Large Language Models are Good Prompt Learners for Low-Shot Image ClassificationCode1
OW-OVD: Unified Open World and Open Vocabulary Object DetectionCode1
Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTVCode1
Deeply Coupled Cross-Modal Prompt LearningCode1
IRanker: Towards Ranking Foundation ModelCode1
Visual Grounding for Object-Level Generalization in Reinforcement LearningCode1
PartDistillation: Learning Parts From Instance SegmentationCode1
Beyond the LUMIR challenge: The pathway to foundational registration modelsCode1
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language ModelsCode1
Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image SegmentationCode1
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial RobustnessCode1
RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question AnsweringCode1
Neural Disparity Refinement for Arbitrary Resolution StereoCode1
Neural-Logic Human-Object Interaction DetectionCode1
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action EnvironmentsCode1
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot GeneralizationCode1
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?Code1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over QuantityCode1
Improving Zero-Shot Object-Level Change Detection by Incorporating Visual CorrespondenceCode1
Nature-Inspired Population-Based Evolution of Large Language ModelsCode1
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingCode1
Improving Zero-Shot Generalization for CLIP with Synthesized PromptsCode1
NaturalProofs: Mathematical Theorem Proving in Natural LanguageCode1
Improving Diffusion Models for Scene Text Editing with Dual EncodersCode1
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot LabelsCode1
Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold MixupCode1
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense EncodersCode1
MuRF: Multi-Baseline Radiance FieldsCode1
Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly DetectionsCode1
Improving Zero-shot Generalization and Robustness of Multi-modal ModelsCode1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility PredictionCode1
Multi-Task Learning for Routing Problem with Cross-Problem Zero-Shot GeneralizationCode1
A Universal Discriminator for Zero-Shot GeneralizationCode1
Contextualize Me -- The Case for Context in Reinforcement LearningCode1
Grounding Language to Entities and Dynamics for Generalization in Reinforcement LearningCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
Exploring the Best Practices of Query Expansion with Large Language ModelsCode1
M^2PT: Multimodal Prompt Tuning for Zero-shot Instruction LearningCode1
A Multi-Task BERT Model for Schema-Guided Dialogue State TrackingCode1
How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary InvestigationCode1
Model Generalization on Text Attribute Graphs: Principles with Large Language ModelsCode1
Show:102550
← PrevPage 3 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified