SOTAVerified

Zero-shot Generalization

Papers

Showing 251300 of 572 papers

TitleStatusHype
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMsCode0
AoP-SAM: Automation of Prompts for Efficient Segmentation0
RVTBench: A Benchmark for Visual Reasoning TasksCode0
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge SubtractionCode0
NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning0
Depth Anything with Any Prior0
Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing0
Visual Image Reconstruction from Brain Activity via Latent Representation0
Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence0
Learning Graph Representation of Agent DiffusersCode0
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization0
TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution AlignmentCode0
A Review of 3D Object Detection with Vision-Language Models0
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision0
Dysarthria Normalization via Local Lie Group Transformations for Robust ASRCode0
Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models0
Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study0
Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models0
Thinking agents for zero-shot generalization to qualitatively novel tasks0
Aether: Geometric-Aware Unified World Modeling0
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance0
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation0
GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
Learning with Expert Abstractions for Efficient Multi-Task Continuous ControlCode0
Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach0
Compound Expression Recognition via Large Vision-Language Models0
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model AdaptationCode0
A Recipe for Improving Remote Sensing VLM Zero Shot Generalization0
PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM0
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction0
RAILGUN: A Unified Convolutional Policy for Multi-Agent Path Finding Across Different Environments and Tasks0
Re-Imagining Multimodal Instruction Tuning: A Representation ViewCode0
Contrastive Learning of English Language and Crystal Graphs for Multimodal Representation of Materials Knowledge0
Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models0
GeLLMO: Generalizing Large Language Models for Multi-property Molecule OptimizationCode0
WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing0
Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning0
Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers0
SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation0
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning0
FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM0
Test-time Loss Landscape Adaptation for Zero-Shot Generalization in Vision-Language Models0
A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation0
DynaPrompt: Dynamic Test-Time Prompt Tuning0
Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks0
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models0
Survey on Monocular Metric Depth Estimation0
MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching0
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective0
Show:102550
← PrevPage 6 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified