SOTAVerified

Zero-shot Generalization

Papers

Showing 351400 of 572 papers

TitleStatusHype
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model0
Cross-Embodiment Dexterous Grasping with Reinforcement Learning0
Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval0
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval0
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray0
DEAL: Disentangling Transformer Head Activations for LLM Steering0
Decision Transformer as a Foundation Model for Partially Observable Continuous Control0
Deep Equivariant Multi-Agent Control Barrier Functions0
Deep Generative Adversarial Network for Occlusion Removal from a Single Image0
Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring0
Deep learning universal crater detection using Segment Anything Model (SAM)0
DEL-Ranking: Ranking-Correction Denoising Framework for Elucidating Molecular Affinities in DNA-Encoded Libraries0
Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing0
Depth Anything with Any Prior0
DEUX: Active Exploration for Learning Unsupervised Depth Perception0
DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation0
DiffuVolume: Diffusion Model for Volume based Stereo Matching0
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models0
Disentangling Representations through Multi-task Learning0
Do Transformers know symbolic rules, and would we know if they did?0
Do We Need to Create Big Datasets to Learn a Task?0
DynaPrompt: Dynamic Test-Time Prompt Tuning0
EasyInsert: A Data-Efficient and Generalizable Insertion Policy0
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM0
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation0
Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction0
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance0
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning0
Encoding Explanatory Knowledge for Zero-shot Science Question Answering0
EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy0
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Self-Regularization0
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance0
Error-Aware Policy Learning: Zero-Shot Generalization in Partially Observable Dynamic Environments0
Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models0
Fairness-Aware Online Meta-learning0
Federated reinforcement learning for robot motion planning with zero-shot generalization0
Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings0
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models0
FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM0
Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach0
From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models0
From Real World to Logic and Back: Learning Generalizable Relational Concepts For Long Horizon Robot Planning0
G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning0
gen2seg: Generative Models Enable Generalizable Instance Segmentation0
Generalization Through Hand-Eye Coordination: An Action Space for Learning Spatially-Invariant Visuomotor Control0
Enhancing Vision-Language Models Generalization via Diversity-Driven Novel Feature Synthesis0
Generalizing Reinforcement Learning to Unseen Actions0
Generating Out-Of-Distribution Scenarios Using Language Models0
Show:102550
← PrevPage 8 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified