SOTAVerified

Zero-shot Generalization

Papers

Showing 201250 of 572 papers

TitleStatusHype
How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary InvestigationCode1
Learning Modular Simulations for Homogeneous SystemsCode1
Multimodal Knowledge Alignment with Reinforcement LearningCode1
Model Generalization on Text Attribute Graphs: Principles with Large Language ModelsCode1
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement LearningCode1
Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text TransformersCode1
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team CompositionCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Improving Zero-shot Generalization and Robustness of Multi-modal ModelsCode1
Improving Zero-Shot Generalization for CLIP with Synthesized PromptsCode1
Learning Quadrupedal Locomotion over Challenging TerrainCode1
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive TasksCode1
Improving Zero-Shot Object-Level Change Detection by Incorporating Visual CorrespondenceCode1
DePT: Decoupled Prompt TuningCode1
Learning the Travelling Salesperson Problem Requires Rethinking GeneralizationCode1
M^2PT: Multimodal Prompt Tuning for Zero-shot Instruction LearningCode1
Exploring the Best Practices of Query Expansion with Large Language ModelsCode1
Where to Move Next: Zero-shot Generalization of LLMs for Next POI RecommendationCode1
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous DrivingCode1
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data OnlyCode1
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance0
DynaPrompt: Dynamic Test-Time Prompt Tuning0
A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs0
Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application0
Large Model Based Referring Camouflaged Object Detection0
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment0
Do We Need to Create Big Datasets to Learn a Task?0
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization0
Do Transformers know symbolic rules, and would we know if they did?0
A Coach-Player Framework for Dynamic Team Composition0
MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching0
Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars0
Disentangling Representations through Multi-task Learning0
A Review of 3D Object Detection with Vision-Language Models0
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking0
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models0
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation0
ISCUTE: Instance Segmentation of Cables Using Text Embedding0
DiffuVolume: Diffusion Model for Volume based Stereo Matching0
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective0
DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation0
I-PHYRE: Interactive Physical Reasoning0
In the Era of Prompt Learning with Vision-Language Models0
Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent0
A Recipe for Improving Remote Sensing VLM Zero Shot Generalization0
Interaction Modeling with Multiplex Attention0
DEUX: Active Exploration for Learning Unsupervised Depth Perception0
Aether: Geometric-Aware Unified World Modeling0
Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation0
Show:102550
← PrevPage 5 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified