SOTAVerified

Zero-shot Generalization

Papers

Showing 276300 of 572 papers

TitleStatusHype
F^2Depth: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis0
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal EstimationCode7
Federated reinforcement learning for robot motion planning with zero-shot generalization0
Quantifying uncertainty in lung cancer segmentation with foundation models applied to mixed-domain datasets0
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language ModelsCode1
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over QuantityCode1
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion ModelCode2
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot GeneralizationCode1
Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation0
FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical ImagesCode1
Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models0
SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration0
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment AnythingCode1
FluoroSAM: A Language-aligned Foundation Model for X-ray Image SegmentationCode1
RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation ModelCode2
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model0
SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt DenoisingCode0
Zero-shot Generalizable Incremental Learning for Vision-Language Object DetectionCode1
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality imagesCode0
Multimodal Instruction Tuning with Conditional Mixture of LoRACode1
Multi-Task Learning for Routing Problem with Cross-Problem Zero-Shot GeneralizationCode1
IEPile: Unearthing Large-Scale Schema-Based Information Extraction CorpusCode3
ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance LabelingCode0
Zero-shot generalization across architectures for visual classificationCode0
Show:102550
← PrevPage 12 of 23Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified