SOTAVerified

Zero-shot Generalization

Papers

Showing 501550 of 572 papers

TitleStatusHype
Improving existing segmentators performance with zero-shot segmentatorsCode0
SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt DenoisingCode0
Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction TuningCode0
Data-Free Generalized Zero-Shot LearningCode0
Re-Imagining Multimodal Instruction Tuning: A Representation ViewCode0
Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution ShiftsCode0
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model AdaptationCode0
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive PhysicsCode0
Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware ObservationCode0
Cross-Trajectory Representation Learning for Zero-Shot Generalization in RLCode0
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment DesignCode0
Prompt Learning for Generalized Vehicle RoutingCode0
Prohibited Items Segmentation via Occlusion-aware Bilayer ModelingCode0
PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP AlignmentCode0
Image-Caption Encoding for Improving Zero-Shot GeneralizationCode0
Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality imagesCode0
One Shot is Enough for Sequential Infrared Small Target SegmentationCode0
Zero-shot generalization across architectures for visual classificationCode0
OLIVE: Object Level In-Context Visual EmbeddingsCode0
Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and GranularityCode0
Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask DependenciesCode0
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoECode0
SemSup: Semantic Supervision for Simple and Scalable Zero-shot GeneralizationCode0
Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuliCode0
Convolutional Conditional Neural ProcessesCode0
Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and SequencingCode0
Go to Zero: Towards Zero-shot Motion Generation with Million-scale DataCode0
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge SubtractionCode0
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge TransferCode0
MMG-Ego4D: Multimodal Generalization in Egocentric Action RecognitionCode0
Consistency by Agreement in Zero-shot Neural Machine TranslationCode0
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMsCode0
Spot Risks Before Speaking! Unraveling Safety Attention Heads in Large Vision-Language ModelsCode0
From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language ModelsCode0
MMG-Ego4D: Multi-Modal Generalization in Egocentric Action RecognitionCode0
Model Synthesis for Zero-Shot Model AttributionCode0
MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language ModelsCode0
ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement LearningCode0
Compositional Generalization with Tree Stack Memory UnitsCode0
Zero-Shot Generalization using Intrinsically Motivated Compositional Emergent ProtocolsCode0
Compositional Learning of Visually-Grounded Concepts Using ReinforcementCode0
Memorizing SAM: 3D Medical Segment Anything Model with Memorizing TransformerCode0
CLUTR: Curriculum Learning via Unsupervised Task Representation LearningCode0
CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form ExtractionCode0
MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generationCode0
GeLLMO: Generalizing Large Language Models for Multi-property Molecule OptimizationCode0
Factored World Models for Zero-Shot Generalization in Robotic ManipulationCode0
Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object SearchCode0
GAMR: A Guided Attention Model for (visual) ReasoningCode0
MADation: Face Morphing Attack Detection with Foundation ModelsCode0
Show:102550
← PrevPage 11 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GR-MGAvg. sequence length4.04Unverified
2MoDEAvg. sequence length4.01Unverified
3RoboUniViewAvg. sequence length3.65Unverified
43D Diffuser ActorAvg. sequence length3.27Unverified
5GR-1Avg. sequence length3.06Unverified