| SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising | Mar 7, 2024 | DenoisingInstance Segmentation | CodeCode Available | 0 | 5 |
| Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data | Jul 9, 2025 | Motion GenerationZero-shot Generalization | CodeCode Available | 0 | 5 |
| Long Range Language Modeling via Gated State Spaces | Jun 27, 2022 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning | Jun 13, 2024 | Zero-shot Generalization | CodeCode Available | 0 | 5 |
| CLUTR: Curriculum Learning via Unsupervised Task Representation Learning | Oct 19, 2022 | Reinforcement Learning (RL)Representation Learning | CodeCode Available | 0 | 5 |
| SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation | Nov 19, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 | 5 |
| One Shot is Enough for Sequential Infrared Small Target Segmentation | Aug 9, 2024 | One-Shot SegmentationSegmentation | CodeCode Available | 0 | 5 |
| RVTBench: A Benchmark for Visual Reasoning Tasks | May 17, 2025 | Reasoning SegmentationVisual Question Answering (VQA) | CodeCode Available | 0 | 5 |
| Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Aug 7, 2024 | Adversarial RobustnessImage Segmentation | —Unverified | 0 | 0 |
| A Coach-Player Framework for Dynamic Team Composition | Jan 1, 2021 | Zero-shot Generalization | —Unverified | 0 | 0 |
| Adaptive Human Trajectory Prediction via Latent Corridors | Dec 11, 2023 | PredictionTrajectory Prediction | —Unverified | 0 | 0 |
| Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jul 12, 2024 | Autonomous DrivingDeep Learning | —Unverified | 0 | 0 |
| Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels | Nov 12, 2023 | PathfinderVisual Reasoning | —Unverified | 0 | 0 |
| Adversarial Environment Design via Regret-Guided Diffusion Models | Oct 25, 2024 | Deep Reinforcement LearningDiversity | —Unverified | 0 | 0 |
| Aether: Geometric-Aware Unified World Modeling | Mar 24, 2025 | Dynamic ReconstructionPrediction | —Unverified | 0 | 0 |
| Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization | Nov 2, 2023 | Domain GeneralizationPrompt Learning | —Unverified | 0 | 0 |
| A Minimalist Prompt for Zero-Shot Policy Learning | May 9, 2024 | Zero-shot Generalization | —Unverified | 0 | 0 |
| Amortized Active Causal Induction with Deep Reinforcement Learning | May 26, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Anchored Diffusion Language Model | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models | Jan 12, 2024 | Active LearningDiversity | —Unverified | 0 | 0 |
| Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering | May 2, 2022 | DecoderImage Captioning | —Unverified | 0 | 0 |
| AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation | May 21, 2025 | Zero-shot Generalization | —Unverified | 0 | 0 |
| AnyMorph: Learning Transferable Polices By Inferring Agent Morphology | Jun 17, 2022 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| AnySkin: Plug-and-play Skin Sensing for Robotic Touch | Sep 12, 2024 | Zero-shot Generalization | —Unverified | 0 | 0 |
| AoP-SAM: Automation of Prompts for Efficient Segmentation | May 17, 2025 | Image SegmentationPrompt Engineering | —Unverified | 0 | 0 |
| A Recipe for Improving Remote Sensing VLM Zero Shot Generalization | Mar 10, 2025 | Cross-Modal RetrievalZero-Shot Cross-Modal Retrieval | —Unverified | 0 | 0 |
| A Review of 3D Object Detection with Vision-Language Models | Apr 25, 2025 | 3D Object DetectionObject | —Unverified | 0 | 0 |
| A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs | Nov 21, 2023 | object-detectionObject Detection | —Unverified | 0 | 0 |
| A Simple yet Efficient Ensemble Approach for AI-generated Text Detection | Nov 6, 2023 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment | Apr 12, 2021 | Zero-shot Generalization | —Unverified | 0 | 0 |
| Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories | Feb 7, 2023 | RetrievalZero-shot Generalization | —Unverified | 0 | 0 |
| A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation | Jan 31, 2025 | Sequential RecommendationTransfer Learning | —Unverified | 0 | 0 |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Jul 18, 2024 | HallucinationLanguage Modelling | —Unverified | 0 | 0 |
| BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision | Sep 21, 2023 | Brain DecodingContrastive Learning | —Unverified | 0 | 0 |
| Benchmarking General-Purpose In-Context Learning | May 27, 2024 | BenchmarkingDecision Making | —Unverified | 0 | 0 |
| Benchmarking VLMs' Reasoning About Persuasive Atypical Images | Sep 16, 2024 | BenchmarkingObject Recognition | —Unverified | 0 | 0 |
| BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning | Oct 24, 2024 | Instruction FollowingNatural Language Understanding | —Unverified | 0 | 0 |
| From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models | Sep 24, 2024 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 | 0 |
| Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation | Mar 15, 2024 | Myocardium SegmentationSegmentation | —Unverified | 0 | 0 |
| Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent | Nov 30, 2023 | Autonomous VehiclesCommon Sense Reasoning | —Unverified | 0 | 0 |
| Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective | Jan 19, 2025 | Automated Theorem ProvingMath | —Unverified | 0 | 0 |
| Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars | Aug 27, 2023 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 | 0 |
| CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance | Dec 5, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning | May 22, 2025 | Zero-shot Generalization | —Unverified | 0 | 0 |
| Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning | Oct 31, 2024 | Graph Neural Networkreinforcement-learning | —Unverified | 0 | 0 |
| Compositional generalization through abstract representations in human and artificial neural networks | Sep 15, 2022 | Zero-shot Generalization | —Unverified | 0 | 0 |
| Compound Expression Recognition via Large Vision-Language Models | Mar 14, 2025 | Emotion RecognitionZero-shot Generalization | —Unverified | 0 | 0 |
| Concept-modulated model-based offline reinforcement learning for rapid generalization | Sep 7, 2022 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Context-Aware Multimodal Pretraining | Nov 22, 2024 | Contrastive LearningRepresentation Learning | —Unverified | 0 | 0 |
| Contrastive Learning of English Language and Crystal Graphs for Multimodal Representation of Materials Knowledge | Feb 23, 2025 | Contrastive LearningZero-shot Generalization | —Unverified | 0 | 0 |