Compound Expression Recognition via Multi Model Ensemble for the ABAW7 Challenge Jul 17, 2024 Ensemble Learning Zero-Shot Learning
— Unverified 0InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains Jul 16, 2024 Decision Making Language Modeling
Code Code Available 1Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts Jul 15, 2024 Zero-Shot Learning
— Unverified 0Anticipating Future Object Compositions without Forgetting Jul 15, 2024 Attribute Compositional Zero-Shot Learning
— Unverified 0Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding Jul 13, 2024 Scene Understanding Zero-Shot Learning
— Unverified 0PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models Jul 13, 2024 Language Modeling Language Modelling
— Unverified 0STD-PLM: Understanding Both Spatial and Temporal Properties of Spatial-Temporal Data with PLM Jul 12, 2024 Few-Shot Learning Imputation
Code Code Available 1Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning Jul 11, 2024 Temporal Sequences Zero-Shot Learning
— Unverified 0CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging Jul 10, 2024 Contrastive Learning Image-text Retrieval
— Unverified 0DuInNet: Dual-Modality Feature Interaction for Point Cloud Completion Jul 10, 2024 Denoising Point Cloud Completion
— Unverified 0Malicious Path Manipulations via Exploitation of Representation Vulnerabilities of Vision-Language Navigation Systems Jul 10, 2024 Language Modeling Language Modelling
— Unverified 0Towards a text-based quantitative and explainable histopathology image analysis Jul 10, 2024 image-classification Image Classification
Code Code Available 0Pseudo-triplet Guided Few-shot Composed Image Retrieval Jul 8, 2024 Active Learning Image Retrieval
— Unverified 0FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models Jul 1, 2024 Benchmarking Fairness
Code Code Available 2Semantic Compositions Enhance Vision-Language Contrastive Learning Jul 1, 2024 Classification Contrastive Learning
— Unverified 0Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Jun 25, 2024 cross-modal alignment Image Classification
Code Code Available 2BioTrove: A Large Curated Image Dataset Enabling AI for Biodiversity Jun 25, 2024 Zero-Shot Learning
Code Code Available 1At First Sight: Zero-Shot Classification of Astronomical Images with Large Multimodal Models Jun 24, 2024 Astronomy Classification
— Unverified 0Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings Jun 24, 2024 Conditional Text Generation Language Modelling
Code Code Available 0Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain Jun 23, 2024 Few-Shot Learning object-detection
— Unverified 0Serial Position Effects of Large Language Models Jun 23, 2024 Position Zero-Shot Learning
— Unverified 0A Simple Framework for Open-Vocabulary Zero-Shot Segmentation Jun 23, 2024 Representation Learning zero-shot-classification
— Unverified 0Contextual Interaction via Primitive-based Adversarial Training For Compositional Zero-shot Learning Jun 21, 2024 Attribute Compositional Zero-Shot Learning
Code Code Available 0CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation Jun 21, 2024 Classification Decoder
Code Code Available 0Factual Dialogue Summarization via Learning from Large Language Models Jun 20, 2024 Contrastive Learning Data Augmentation
— Unverified 0A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning Jun 20, 2024 Diagnostic Image to text
Code Code Available 0Using Multimodal Large Language Models for Automated Detection of Traffic Safety Critical Events Jun 19, 2024 Few-Shot Learning Zero-Shot Learning
— Unverified 0Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition Jun 19, 2024 Action Recognition Skeleton Based Action Recognition
Code Code Available 1FuseGen: PLM Fusion for Data-generation based Zero-shot Learning Jun 18, 2024 Zero-Shot Learning
Code Code Available 0MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning Jun 18, 2024 Attribute Compositional Zero-Shot Learning
— Unverified 0BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity Jun 18, 2024 Contrastive Learning Language Modelling
Code Code Available 1BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM Jun 17, 2024 Continual Pretraining zero-shot-classification
— Unverified 0Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments Jun 17, 2024 Fairness Language Modeling
Code Code Available 1Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition Jun 13, 2024 Retrieval zero-shot-classification
Code Code Available 1Zero-Shot Learning Over Large Output Spaces : Utilizing Indirect Knowledge Extraction from Large Language Models Jun 13, 2024 Language Modelling Large Language Model
— Unverified 0RWKV-CLIP: A Robust Vision-Language Representation Learner Jun 11, 2024 Image-text Retrieval Representation Learning
Code Code Available 2Understanding Visual Concepts Across Models Jun 11, 2024 Image Generation object-detection
Code Code Available 0BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Jun 7, 2024 Common Sense Reasoning Sentence
Code Code Available 0CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment Jun 7, 2024 Contrastive Learning Zero-Shot Learning
Code Code Available 1CountCLIP -- [Re] Teaching CLIP to Count to Ten Jun 5, 2024 zero-shot-classification Zero-Shot Counting
Code Code Available 1Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning Jun 5, 2024 Attribute Domain Generalization
— Unverified 0Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models Jun 5, 2024 Generalized Zero-Shot Learning Zero-Shot Learning
— Unverified 0Description Boosting for Zero-Shot Entity and Relation Classification Jun 4, 2024 Relation Relation Classification
Code Code Available 3SLANT: Spurious Logo ANalysis Toolkit Jun 3, 2024 zero-shot-classification Zero-Shot Learning
— Unverified 0Multi-Modal Generative Embedding Model May 29, 2024 Caption Generation Cross-Modal Retrieval
— Unverified 0It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap May 28, 2024 image-classification Image Classification
— Unverified 0MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding May 28, 2024 3D Classification 3D Object Recognition
— Unverified 0CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale May 27, 2024 Contrastive Learning Zero-Shot Learning
Code Code Available 1Listenable Maps for Zero-Shot Audio Classifiers May 27, 2024 Decoder zero-shot-classification
— Unverified 0TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection May 27, 2024 Few-Shot Learning Language Modeling
Code Code Available 0