SOTAVerified

Zero-shot Generalization

Papers

Showing 501550 of 572 papers

TitleStatusHype
Scoring-Aggregating-Planning: Learning task-agnostic priors from interactions and sparse rewards for zero-shot generalization0
Segment Anything Model for Grain Characterization in Hard Drive Design0
Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models0
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition0
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs0
Sequence-Based Plan Feasibility Prediction for Efficient Task and Motion Planning0
Show, Don’t Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue0
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue0
SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation0
Solving Continual Offline Reinforcement Learning with Decision Transformer0
Solving the Same-Different Task with Convolutional Neural Networks0
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning0
SSTD: Stripe-Like Space Target Detection Using Single-Point Weak Supervision0
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models0
StereoGen: High-quality Stereo Image Generation from a Single Image0
Still not systematic after all these years: On the compositional skills of sequence-to-sequence recurrent networks0
Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models0
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization0
Survey on Monocular Metric Depth Estimation0
TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs0
A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments0
ConfusionPrompt: Practical Private Inference for Online Large Language Models0
Test-time Loss Landscape Adaptation for Zero-Shot Generalization in Vision-Language Models0
Text2Model: Text-based Model Induction for Zero-shot Image Classification0
Text-only Synthesis for Image Captioning0
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision0
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control0
The Third Monocular Depth Estimation Challenge0
Thinking agents for zero-shot generalization to qualitatively novel tasks0
TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability0
TimeGraphs: Graph-based Temporal Reasoning0
Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence0
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation0
Towards Generalist Biomedical AI0
Towards the Unification of Generative and Discriminative Visual Foundation Model: A Survey0
Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation0
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning0
Transductive CLIP with Class-Conditional Contrastive Learning0
Transferable and Distributed User Association Policies for 5G and Beyond Networks0
Quantifying uncertainty in lung cancer segmentation with foundation models applied to mixed-domain datasets0
Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers0
Unifying Few- and Zero-Shot Egocentric Action Recognition0
Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos0
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers0
Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models0
Unsupervised Discovery of Object-Centric Neural Fields0
Unsupervised Prompt Tuning for Text-Driven Object Detection0
UTSD: Unified Time Series Diffusion Model0
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
Show:102550
← PrevPage 11 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified