SOTAVerified

Zero-shot Generalization

Papers

Showing 401450 of 572 papers

TitleStatusHype
Decision Transformer as a Foundation Model for Partially Observable Continuous Control0
F^2Depth: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis0
Federated reinforcement learning for robot motion planning with zero-shot generalization0
Quantifying uncertainty in lung cancer segmentation with foundation models applied to mixed-domain datasets0
Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation0
SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration0
Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models0
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model0
SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt DenoisingCode0
Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality imagesCode0
ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance LabelingCode0
Zero-shot generalization across architectures for visual classificationCode0
ISCUTE: Instance Segmentation of Cables Using Text Embedding0
From Real World to Logic and Back: Learning Generalizable Relational Concepts For Long Horizon Robot Planning0
Unsupervised Discovery of Object-Centric Neural Fields0
On the Out-Of-Distribution Generalization of Multimodal Large Language Models0
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology ControlCode0
InCoRo: In-Context Learning for Robotics Control with Feedback Loops0
Image-Caption Encoding for Improving Zero-Shot GeneralizationCode0
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model0
Data-Free Generalized Zero-Shot LearningCode0
Solving Continual Offline Reinforcement Learning with Decision Transformer0
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models0
TimeGraphs: Graph-based Temporal Reasoning0
ConfusionPrompt: Practical Private Inference for Online Large Language Models0
A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments0
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning0
Towards the Unification of Generative and Discriminative Visual Foundation Model: A Survey0
MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning0
Adaptive Human Trajectory Prediction via Latent Corridors0
Multi-View Unsupervised Image Generation with Cross Attention Guidance0
MASP: Scalable GNN-based Planning for Multi-Agent Navigation0
I-PHYRE: Interactive Physical Reasoning0
Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent0
Large Model Based Referring Camouflaged Object Detection0
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers0
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing0
A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs0
Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual SpeechCode0
Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question PromptsCode0
Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels0
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection0
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoECode0
Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization0
Neural Field Dynamics Model for Granular Object Piles Manipulation0
ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language AdaptersCode0
Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models0
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining0
What Matters to You? Towards Visual Representation Alignment for Robot Learning0
From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language ModelsCode0
Show:102550
← PrevPage 9 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified