SOTAVerified

Zero-shot Generalization

Papers

Showing 5175 of 572 papers

TitleStatusHype
Visual Image Reconstruction from Brain Activity via Latent Representation0
Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence0
Learning Graph Representation of Agent DiffusersCode0
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action EnvironmentsCode1
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization0
TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution AlignmentCode0
Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real TransferCode1
A Review of 3D Object Detection with Vision-Language Models0
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision0
Dysarthria Normalization via Local Lie Group Transformations for Robust ASRCode0
Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly DetectionsCode1
Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation ModelsCode4
Detect Anything 3D in the WildCode3
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by SegmentationCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose EstimationCode1
Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models0
Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study0
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
Thinking agents for zero-shot generalization to qualitatively novel tasks0
Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models0
FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
Aether: Geometric-Aware Unified World Modeling0
Equivariant Image ModelingCode1
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance0
Show:102550
← PrevPage 3 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified