SOTAVerified

Zero-shot Generalization

Papers

Showing 251300 of 572 papers

TitleStatusHype
Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation0
Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent0
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective0
Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars0
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance0
CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning0
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning0
Compositional generalization through abstract representations in human and artificial neural networks0
Compound Expression Recognition via Large Vision-Language Models0
Concept-modulated model-based offline reinforcement learning for rapid generalization0
Context-Aware Multimodal Pretraining0
Contrastive Learning of English Language and Crystal Graphs for Multimodal Representation of Materials Knowledge0
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model0
Cross-Embodiment Dexterous Grasping with Reinforcement Learning0
Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval0
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval0
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray0
DEAL: Disentangling Transformer Head Activations for LLM Steering0
Decision Transformer as a Foundation Model for Partially Observable Continuous Control0
Deep Equivariant Multi-Agent Control Barrier Functions0
Deep Generative Adversarial Network for Occlusion Removal from a Single Image0
Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring0
Deep learning universal crater detection using Segment Anything Model (SAM)0
DEL-Ranking: Ranking-Correction Denoising Framework for Elucidating Molecular Affinities in DNA-Encoded Libraries0
Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing0
Depth Anything with Any Prior0
DEUX: Active Exploration for Learning Unsupervised Depth Perception0
DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation0
DiffuVolume: Diffusion Model for Volume based Stereo Matching0
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models0
Disentangling Representations through Multi-task Learning0
Do Transformers know symbolic rules, and would we know if they did?0
Do We Need to Create Big Datasets to Learn a Task?0
DynaPrompt: Dynamic Test-Time Prompt Tuning0
EasyInsert: A Data-Efficient and Generalizable Insertion Policy0
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM0
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation0
Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction0
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance0
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning0
Encoding Explanatory Knowledge for Zero-shot Science Question Answering0
EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy0
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Self-Regularization0
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance0
Error-Aware Policy Learning: Zero-Shot Generalization in Partially Observable Dynamic Environments0
Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models0
Fairness-Aware Online Meta-learning0
Federated reinforcement learning for robot motion planning with zero-shot generalization0
Show:102550
← PrevPage 6 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified