SOTAVerified

Zero-shot Generalization

Papers

Showing 126150 of 572 papers

TitleStatusHype
Spot Risks Before Speaking! Unraveling Safety Attention Heads in Large Vision-Language ModelsCode0
On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach0
OW-OVD: Unified Open World and Open Vocabulary Object DetectionCode1
On the Out-Of-Distribution Generalization of Large Multimodal Models0
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models0
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation0
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio0
Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-TreesCode0
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
Zero-Shot Generalization for Blockage Localization in mmWave Communication0
Memorizing SAM: 3D Medical Segment Anything Model with Memorizing TransformerCode0
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion0
Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction0
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
WiFo: Wireless Foundation Model for Channel Prediction0
Towards Open-Vocabulary Video Semantic SegmentationCode1
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM0
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models0
Large Concept Models: Language Modeling in a Sentence Representation SpaceCode7
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp SegmentationCode1
Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result FusionCode0
ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement LearningCode0
Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image SegmentationCode1
Show:102550
← PrevPage 6 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified