SOTAVerified

Zero-shot Generalization

Papers

Showing 326350 of 572 papers

TitleStatusHype
Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis0
Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models0
Generating Out-Of-Distribution Scenarios Using Language Models0
Context-Aware Multimodal Pretraining0
SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical SegmentationCode0
HEIGHT: Heterogeneous Interaction Graph Transformer for Robot Navigation in Crowded and Constrained Environments0
Scalable Autoregressive Monocular Depth Estimation0
MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language ModelsCode0
Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos0
Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching0
In the Era of Prompt Learning with Vision-Language Models0
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting DiversityCode0
Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuliCode0
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning0
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking0
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images0
Adversarial Environment Design via Regret-Guided Diffusion Models0
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons0
BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning0
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias0
DEL-Ranking: Ranking-Correction Denoising Framework for Elucidating Molecular Affinities in DNA-Encoded Libraries0
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge TransferCode0
On the Evaluation of Generative Robotic Simulations0
Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels0
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation0
Show:102550
← PrevPage 14 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified