SOTAVerified

Zero-shot Generalization

Papers

Showing 101125 of 572 papers

TitleStatusHype
Equivariant Image ModelingCode1
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video UnderstandingCode1
Nature-Inspired Population-Based Evolution of Large Language ModelsCode1
Delving into Out-of-Distribution Detection with Medical Vision-Language ModelsCode1
Model Generalization on Text Attribute Graphs: Principles with Large Language ModelsCode1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
Improving Zero-Shot Object-Level Change Detection by Incorporating Visual CorrespondenceCode1
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
OW-OVD: Unified Open World and Open Vocabulary Object DetectionCode1
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
Towards Open-Vocabulary Video Semantic SegmentationCode1
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp SegmentationCode1
Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image SegmentationCode1
COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detectionCode1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility PredictionCode1
M^2PT: Multimodal Prompt Tuning for Zero-shot Instruction LearningCode1
ScaleFlow++: Robust and Accurate Estimation of 3D Motion from VideoCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
Generalizable Facial Expression RecognitionCode1
Visual Grounding for Object-Level Generalization in Reinforcement LearningCode1
Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language ModelsCode1
ScaleFlow++: Robust and Accurate Estimation of 3D Motion from VideoCode1
Unified Embedding Alignment for Open-Vocabulary Video Instance SegmentationCode1
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task AllocationCode1
GOMAA-Geo: GOal Modality Agnostic Active Geo-localizationCode1
Show:102550
← PrevPage 5 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified