SOTAVerified

Zero-shot Generalization

Papers

Showing 151200 of 572 papers

TitleStatusHype
Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly DetectionsCode1
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
Large Language Models are Good Prompt Learners for Low-Shot Image ClassificationCode1
FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
Foundation Models Knowledge Distillation For Battery Capacity Degradation ForecastCode1
SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth EstimationCode1
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
FluoroSAM: A Language-aligned Foundation Model for X-ray Image SegmentationCode1
A Universal Discriminator for Zero-Shot GeneralizationCode1
Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTVCode1
Contextualize Me -- The Case for Context in Reinforcement LearningCode1
Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image SegmentationCode1
ScaleFlow++: Robust and Accurate Estimation of 3D Motion from VideoCode1
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language ModelsCode1
FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical ImagesCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
IRanker: Towards Ranking Foundation ModelCode1
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp SegmentationCode1
ScaleFlow++: Robust and Accurate Estimation of 3D Motion from VideoCode1
A Multi-Task BERT Model for Schema-Guided Dialogue State TrackingCode1
Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement LearningCode1
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot LabelsCode1
Generalization to New Actions in Reinforcement LearningCode1
Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networksCode1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility PredictionCode1
COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detectionCode1
RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question AnsweringCode1
Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-InCode1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
Equivariant Image ModelingCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Label Agnostic Pre-training for Zero-shot Text ClassificationCode1
SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain AdaptationCode1
Schema-Guided Dialogue State Tracking Task at DSTC8Code1
Improving Zero-Shot Object-Level Change Detection by Incorporating Visual CorrespondenceCode1
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action EnvironmentsCode1
MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for EchocardiographyCode1
Gradient Ascent Post-training Enhances Language Model GeneralizationCode1
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment AnythingCode1
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task AllocationCode1
Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulasCode1
Improving Zero-Shot Generalization for CLIP with Synthesized PromptsCode1
Visual Grounding for Object-Level Generalization in Reinforcement LearningCode1
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous DrivingCode1
One-Prompt to Segment All Medical ImagesCode1
Prompting Language-Informed Distribution for Compositional Zero-Shot LearningCode1
Improving Zero-shot Generalization and Robustness of Multi-modal ModelsCode1
Multimodal Instruction Tuning with Conditional Mixture of LoRACode1
Improving Diffusion Models for Scene Text Editing with Dual EncodersCode1
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingCode1
Show:102550
← PrevPage 4 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified