| How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation | Dec 12, 2023 | Anomaly DetectionAutonomous Driving | CodeCode Available | 1 |
| Learning Modular Simulations for Homogeneous Systems | Oct 28, 2022 | Zero-shot Generalization | CodeCode Available | 1 |
| Multimodal Knowledge Alignment with Reinforcement Learning | May 25, 2022 | Audio captioningLanguage Modeling | CodeCode Available | 1 |
| Model Generalization on Text Attribute Graphs: Principles with Large Language Models | Feb 17, 2025 | AttributeGraph Learning | CodeCode Available | 1 |
| Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning | Sep 30, 2023 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 1 |
| Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers | May 21, 2023 | MMLUZero-shot Generalization | CodeCode Available | 1 |
| Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition | May 18, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Zero-shot Generalization and Robustness of Multi-modal Models | Dec 4, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Improving Zero-Shot Generalization for CLIP with Synthesized Prompts | Jul 14, 2023 | Generalized Zero-Shot LearningTransfer Learning | CodeCode Available | 1 |
| Learning Quadrupedal Locomotion over Challenging Terrain | Oct 21, 2020 | Zero-shot Generalization | CodeCode Available | 1 |
| Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks | Nov 1, 2023 | InformativenessOut-of-Distribution Generalization | CodeCode Available | 1 |
| Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence | Jan 9, 2025 | Change DetectionZero-shot Generalization | CodeCode Available | 1 |
| DePT: Decoupled Prompt Tuning | Sep 14, 2023 | Prompt EngineeringZero-shot Generalization | CodeCode Available | 1 |
| Learning the Travelling Salesperson Problem Requires Rethinking Generalization | Jun 12, 2020 | Combinatorial OptimizationTransfer Learning | CodeCode Available | 1 |
| M^2PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning | Sep 24, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| Exploring the Best Practices of Query Expansion with Large Language Models | Jan 12, 2024 | Information RetrievalRe-Ranking | CodeCode Available | 1 |
| Where to Move Next: Zero-shot Generalization of LLMs for Next POI Recommendation | Apr 2, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving | May 26, 2025 | Autonomous DrivingBench2Drive | CodeCode Available | 1 |
| The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only | Jun 1, 2023 | Zero-shot Generalization | CodeCode Available | 1 |
| CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance | Dec 5, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 |
| DynaPrompt: Dynamic Test-Time Prompt Tuning | Jan 27, 2025 | Zero-shot Generalization | —Unverified | 0 |
| A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs | Nov 21, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application | Jun 6, 2025 | DenoisingSemantic Communication | —Unverified | 0 |
| Large Model Based Referring Camouflaged Object Detection | Nov 28, 2023 | modelObject | —Unverified | 0 |
| Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment | Aug 22, 2024 | Multi-Task LearningRetrieval | —Unverified | 0 |
| Do We Need to Create Big Datasets to Learn a Task? | Nov 1, 2020 | Zero-shot Generalization | —Unverified | 0 |
| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization | Nov 2, 2023 | Domain GeneralizationPrompt Learning | —Unverified | 0 |
| Do Transformers know symbolic rules, and would we know if they did? | Feb 19, 2022 | Zero-shot Generalization | —Unverified | 0 |
| A Coach-Player Framework for Dynamic Team Composition | Jan 1, 2021 | Zero-shot Generalization | —Unverified | 0 |
| MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Jan 20, 2025 | Keypoint DetectionZero-shot Generalization | —Unverified | 0 |
| Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars | Aug 27, 2023 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 |
| Disentangling Representations through Multi-task Learning | Jul 15, 2024 | Decision MakingMulti-Task Learning | —Unverified | 0 |
| A Review of 3D Object Detection with Vision-Language Models | Apr 25, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking | Oct 31, 2024 | Code CompletionOpen-Domain Question Answering | —Unverified | 0 |
| Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models | Dec 11, 2024 | DisentanglementPosition | —Unverified | 0 |
| Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation | Mar 20, 2025 | Depth EstimationImage Reconstruction | —Unverified | 0 |
| ISCUTE: Instance Segmentation of Cables Using Text Embedding | Feb 19, 2024 | Instance SegmentationObject Recognition | —Unverified | 0 |
| DiffuVolume: Diffusion Model for Volume based Stereo Matching | Aug 30, 2023 | modelStereo Matching | —Unverified | 0 |
| Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective | Jan 19, 2025 | Automated Theorem ProvingMath | —Unverified | 0 |
| DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation | Apr 8, 2024 | Zero-shot Generalization | —Unverified | 0 |
| I-PHYRE: Interactive Physical Reasoning | Dec 4, 2023 | Zero-shot Generalization | —Unverified | 0 |
| In the Era of Prompt Learning with Vision-Language Models | Nov 7, 2024 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent | Nov 30, 2023 | Autonomous VehiclesCommon Sense Reasoning | —Unverified | 0 |
| A Recipe for Improving Remote Sensing VLM Zero Shot Generalization | Mar 10, 2025 | Cross-Modal RetrievalZero-Shot Cross-Modal Retrieval | —Unverified | 0 |
| Interaction Modeling with Multiplex Attention | Aug 23, 2022 | Social NavigationTrajectory Forecasting | —Unverified | 0 |
| DEUX: Active Exploration for Learning Unsupervised Depth Perception | Sep 16, 2023 | Depth CompletionDepth Estimation | —Unverified | 0 |
| Aether: Geometric-Aware Unified World Modeling | Mar 24, 2025 | Dynamic ReconstructionPrediction | —Unverified | 0 |
| Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation | Mar 15, 2024 | Myocardium SegmentationSegmentation | —Unverified | 0 |