GLAD: Generalizable Tuning for Vision-Language Models Jul 17, 2025 Domain Generalization Few-Shot Learning
— Unverified 0DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation Jul 14, 2025 Decoder GPU
Code Code Available 0Zero-Shot Learning for Obsolescence Risk Forecasting Jun 26, 2025 Prediction Zero-Shot Learning
— Unverified 0EVA: Mixture-of-Experts Semantic Variant Alignment for Compositional Zero-Shot Learning Jun 26, 2025 Compositional Zero-Shot Learning Mixture-of-Experts
— Unverified 0SEZ-HARN: Self-Explainable Zero-shot Human Activity Recognition Network Jun 25, 2025 Activity Recognition Human Activity Recognition
Code Code Available 0A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement Jun 23, 2025 Autonomous Navigation Computational Efficiency
— Unverified 0AnyTraverse: An off-road traversability framework with VLM and human operator in the loop Jun 20, 2025 Autonomous Navigation Zero-Shot Learning
— Unverified 0Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation Jun 20, 2025 Multi-agent Reinforcement Learning SMAC
Code Code Available 0OTFusion: Bridging Vision-only and Vision-Language Models via Optimal Transport for Transductive Zero-Shot Learning Jun 16, 2025 Zero-Shot Learning
— Unverified 0Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography Jun 16, 2025 breast density classification Classification
— Unverified 0An Interdisciplinary Review of Commonsense Reasoning and Intent Detection Jun 16, 2025 Intent Detection Natural Language Understanding
— Unverified 0Harmonizing and Merging Source Models for CLIP-based Domain Generalization Jun 11, 2025 Domain Generalization zero-shot-classification
— Unverified 0Low-Rank Augmented Implicit Neural Representation for Unsupervised High-Dimensional Quantitative MRI Reconstruction Jun 10, 2025 Image Reconstruction MRI Reconstruction
— Unverified 0Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models Jun 10, 2025 Contrastive Learning Image-text matching
Code Code Available 1Hyperbolic Dual Feature Augmentation for Open-Environment Jun 10, 2025 class-incremental learning Class Incremental Learning
— Unverified 0CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray Jun 9, 2025 Classification Diagnostic
— Unverified 0MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories Jun 5, 2025 Benchmarking Optical Character Recognition
Code Code Available 2Large Language Models for EEG: A Comprehensive Survey and Taxonomy Jun 2, 2025 Diagnostic EEG
— Unverified 0A Brain Graph Foundation Model: Pre-Training and Prompt-Tuning for Any Atlas and Disorder May 31, 2025 Contrastive Learning Meta-Learning
Code Code Available 1GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models May 30, 2025 Classification Disaster Response
Code Code Available 2Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning May 26, 2025 Zero-Shot Learning
— Unverified 0Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation May 25, 2025 Contrastive Learning Image-text Retrieval
— Unverified 0AmorLIP: Efficient Language-Image Pretraining via Amortization May 25, 2025 Contrastive Learning Representation Learning
Code Code Available 0Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning May 23, 2025 Decoder Image Captioning
Code Code Available 4Monocular Marker-free Patient-to-Image Intraoperative Registration for Cochlear Implant Surgery May 23, 2025 Zero-Shot Learning
— Unverified 0Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge May 22, 2025 Anomaly Detection Question Answering
— Unverified 0Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment May 20, 2025 Representation Learning Retrieval
— Unverified 0From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection May 19, 2025 feature selection Out-of-Distribution Generalization
Code Code Available 1Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image Corruption May 19, 2025 Knowledge Distillation Test-time Adaptation
Code Code Available 0StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity Alignment May 19, 2025 zero-shot-classification Zero-Shot Learning
Code Code Available 0GenZSL: Generative Zero-Shot Learning Via Inductive Variational Autoencoder May 17, 2025 Diversity Zero-Shot Learning
Code Code Available 0CrypticBio: A Large Multimodal Dataset for Visually Confusing Biodiversity May 16, 2025 Zero-Shot Learning
Code Code Available 0Feasibility with Language Models for Open-World Compositional Zero-Shot Learning May 16, 2025 Attribute Compositional Zero-Shot Learning
— Unverified 0SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision May 16, 2025 Depth Estimation Instance Segmentation
— Unverified 0Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner May 16, 2025 Cross-Modal Retrieval Diagnostic
Code Code Available 2ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data May 15, 2025 Clustering Deep Learning
Code Code Available 0MSCI: Addressing CLIP's Inherent Limitations for Compositional Zero-Shot Learning May 15, 2025 Compositional Zero-Shot Learning cross-modal alignment
Code Code Available 1Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors May 15, 2025 Language Modeling Language Modelling
— Unverified 0Human-like Cognitive Generalization for Large Models via Brain-in-the-loop Supervision May 14, 2025 Natural Language Understanding Zero-Shot Learning
— Unverified 0Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models May 12, 2025 Continual Learning Few-Shot Learning
— Unverified 0TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining May 12, 2025 Audio captioning Audio Generation
— Unverified 0Implementing Long Text Style Transfer with LLMs through Dual-Layered Sentence and Paragraph Structure Extraction and Mapping May 11, 2025 Sentence Style Transfer
— Unverified 0Image Classification Using a Diffusion Model as a Pre-Training Model May 11, 2025 Contrastive Learning image-classification
— Unverified 0MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks May 9, 2025 Diagnostic Instruction Following
Code Code Available 1scDrugMap: Benchmarking Large Foundation Models for Drug Response Prediction May 8, 2025 Benchmarking Drug Discovery
Code Code Available 1The Pitfalls of Growing Group Complexity: LLMs and Social Choice-Based Aggregation for Group Recommendations May 8, 2025 In-Context Learning Recommendation Systems
— Unverified 0FG-CLIP: Fine-Grained Visual and Textual Alignment May 8, 2025 Image-text Retrieval object-detection
Code Code Available 4Exploring Zero-Shot App Review Classification with ChatGPT: Challenges and Potential May 7, 2025 Zero-Shot Learning
— Unverified 0Interpretable Zero-shot Learning with Infinite Class Concepts May 6, 2025 Hallucination Zero-Shot Learning
— Unverified 0CXR-AD: Component X-ray Image Dataset for Industrial Anomaly Detection May 6, 2025 Anomaly Detection Defect Detection
— Unverified 0