SOTAVerified

Zero-shot Generalization

Papers

Showing 201250 of 572 papers

TitleStatusHype
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual PoliciesCode1
What Can I Do Here? Learning New Skills by Imagining Visual AffordancesCode1
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team CompositionCode1
ZePHyR: Zero-shot Pose Hypothesis RatingCode1
NaturalProofs: Mathematical Theorem Proving in Natural LanguageCode1
Grounding Language to Entities and Dynamics for Generalization in Reinforcement LearningCode1
Generalization to New Actions in Reinforcement LearningCode1
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement LearningCode1
STAR: A Schema-Guided Dialog Dataset for Transfer LearningCode1
Learning Quadrupedal Locomotion over Challenging TerrainCode1
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot LabelsCode1
Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy DecompositionCode1
The Scattering Compositional Learner: Discovering Objects, Attributes, Relationships in Analogical ReasoningCode1
Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold MixupCode1
Learning the Travelling Salesperson Problem Requires Rethinking GeneralizationCode1
Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulasCode1
Schema-Guided Dialogue State Tracking Task at DSTC8Code1
Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue DatasetCode1
Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networksCode1
Zero-Shot Relation Extraction via Reading ComprehensionCode1
SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation0
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation0
PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP AlignmentCode0
Go to Zero: Towards Zero-shot Motion Generation with Million-scale DataCode0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach0
RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather0
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment DesignCode0
VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy0
LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction0
Prohibited Items Segmentation via Occlusion-aware Bilayer ModelingCode0
DEAL: Disentangling Transformer Head Activations for LLM Steering0
ZeroVO: Visual Odometry with Minimal Assumptions0
Deep Equivariant Multi-Agent Control Barrier Functions0
CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray0
Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application0
Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation0
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer0
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers0
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection0
G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning0
Anchored Diffusion Language Model0
CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning0
EasyInsert: A Data-Efficient and Generalizable Insertion Policy0
Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution ShiftsCode0
AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation0
gen2seg: Generative Models Enable Generalizable Instance Segmentation0
EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy0
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling0
Show:102550
← PrevPage 5 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified