SOTAVerified

Zero-shot Generalization

Papers

Showing 301350 of 572 papers

TitleStatusHype
ISCUTE: Instance Segmentation of Cables Using Text Embedding0
Triple-Encoders: Representations That Fire Together, Wire TogetherCode1
From Real World to Logic and Back: Learning Generalizable Relational Concepts For Long Horizon Robot Planning0
3D Diffuser Actor: Policy Diffusion with 3D Scene RepresentationsCode3
Unsupervised Discovery of Object-Centric Neural Fields0
On the Out-Of-Distribution Generalization of Multimodal Large Language Models0
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology ControlCode0
Learning to Route Among Specialized Experts for Zero-Shot GeneralizationCode2
InCoRo: In-Context Learning for Robotics Control with Feedback Loops0
Tag-LLM: Repurposing General-Purpose LLMs for Specialized DomainsCode1
Image-Caption Encoding for Improving Zero-Shot GeneralizationCode0
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot LearningCode2
Symbol: Generating Flexible Black-Box Optimizers through Symbolic Equation LearningCode1
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model0
Data-Free Generalized Zero-Shot LearningCode0
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with InstructionsCode2
Solving Continual Offline Reinforcement Learning with Decision Transformer0
Exploring the Best Practices of Query Expansion with Large Language ModelsCode1
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models0
MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large ModelCode1
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial RobustnessCode1
TimeGraphs: Graph-based Temporal Reasoning0
ConfusionPrompt: Practical Private Inference for Online Large Language Models0
Semantic Guidance Tuning for Text-To-Image Diffusion ModelsCode2
Unleashing Large-Scale Video Generative Pre-training for Visual Robot ManipulationCode2
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning0
A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments0
Towards the Unification of Generative and Discriminative Visual Foundation Model: A Survey0
General Object Foundation Model for Images and Videos at ScaleCode3
MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning0
How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary InvestigationCode1
Adaptive Human Trajectory Prediction via Latent Corridors0
Multi-View Unsupervised Image Generation with Cross Attention Guidance0
MuRF: Multi-Baseline Radiance FieldsCode1
Large Language Models are Good Prompt Learners for Low-Shot Image ClassificationCode1
Boosting Segment Anything Model Towards Open-Vocabulary LearningCode1
MASP: Scalable GNN-based Planning for Multi-Agent Navigation0
I-PHYRE: Interactive Physical Reasoning0
Repurposing Diffusion-Based Image Generators for Monocular Depth EstimationCode4
Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent0
Large Model Based Referring Camouflaged Object Detection0
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers0
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing0
VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt LearningCode1
A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs0
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense EncodersCode1
Neural-Logic Human-Object Interaction DetectionCode1
Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question PromptsCode0
Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual SpeechCode0
Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels0
Show:102550
← PrevPage 7 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified