SOTAVerified

Zero-shot Generalization

Papers

Showing 251275 of 572 papers

TitleStatusHype
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling0
AoP-SAM: Automation of Prompts for Efficient Segmentation0
RVTBench: A Benchmark for Visual Reasoning TasksCode0
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge SubtractionCode0
Depth Anything with Any Prior0
NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning0
Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing0
Visual Image Reconstruction from Brain Activity via Latent Representation0
Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence0
Learning Graph Representation of Agent DiffusersCode0
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization0
TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution AlignmentCode0
A Review of 3D Object Detection with Vision-Language Models0
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision0
Dysarthria Normalization via Local Lie Group Transformations for Robust ASRCode0
Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models0
Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study0
Thinking agents for zero-shot generalization to qualitatively novel tasks0
Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models0
Aether: Geometric-Aware Unified World Modeling0
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation0
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation0
Learning with Expert Abstractions for Efficient Multi-Task Continuous ControlCode0
Show:102550
← PrevPage 11 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified