SOTAVerified

Zero-shot Generalization

Papers

Showing 101150 of 572 papers

TitleStatusHype
WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing0
Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning0
Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers0
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation0
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning0
FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM0
Test-time Loss Landscape Adaptation for Zero-Shot Generalization in Vision-Language Models0
A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation0
DynaPrompt: Dynamic Test-Time Prompt Tuning0
Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks0
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models0
Survey on Monocular Metric Depth Estimation0
MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching0
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective0
Zero-Shot Monocular Scene Flow Estimation in the Wild0
FoundationStereo: Zero-Shot Stereo MatchingCode7
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
MonSter: Marry Monodepth to Stereo Unleashes PowerCode4
StereoGen: High-quality Stereo Image Generation from a Single Image0
Capability-Aware Shared Hypernetworks for Flexible Heterogeneous Multi-Robot CoordinationCode0
Improving Zero-Shot Object-Level Change Detection by Incorporating Visual CorrespondenceCode1
Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation0
MADation: Face Morphing Attack Detection with Foundation ModelsCode0
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any CameraCode3
Spot Risks Before Speaking! Unraveling Safety Attention Heads in Large Vision-Language ModelsCode0
On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach0
OW-OVD: Unified Open World and Open Vocabulary Object DetectionCode1
On the Out-Of-Distribution Generalization of Large Multimodal Models0
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models0
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation0
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio0
Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-TreesCode0
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
Zero-Shot Generalization for Blockage Localization in mmWave Communication0
Memorizing SAM: 3D Medical Segment Anything Model with Memorizing TransformerCode0
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion0
Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction0
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
WiFo: Wireless Foundation Model for Channel Prediction0
Towards Open-Vocabulary Video Semantic SegmentationCode1
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM0
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models0
Large Concept Models: Language Modeling in a Sentence Representation SpaceCode7
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp SegmentationCode1
Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result FusionCode0
ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement LearningCode0
Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image SegmentationCode1
Show:102550
← PrevPage 3 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified