MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training Nov 28, 2023 Image Captioning Transfer Learning
— Unverified 0MVBench: A Comprehensive Multi-modal Video Understanding Benchmark Nov 28, 2023 3D Question Answering (3D-QA) Diagnostic
Code Code Available 2IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers Nov 27, 2023 Caption Generation Image-text Retrieval
— Unverified 0GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition? Nov 27, 2023 Zero-Shot Learning
Code Code Available 1Italian Crossword Generator: Enhancing Education through Interactive Word Puzzles Nov 27, 2023 Few-Shot Learning Zero-Shot Learning
— Unverified 0MEDITRON-70B: Scaling Medical Pretraining for Large Language Models Nov 27, 2023 Articles Conditional Text Generation
Code Code Available 4ViT-Lens: Towards Omni-modal Representations Nov 27, 2023 EEG Image Generation
Code Code Available 1Effective Backdoor Mitigation in Vision-Language Models Depends on the Pre-training Objective Nov 25, 2023 zero-shot-classification Zero-Shot Learning
— Unverified 0tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models Nov 24, 2023 Audio Generation Event Detection
— Unverified 0Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data Nov 23, 2023 Data Integration Sentiment Analysis
— Unverified 0Compositional Zero-shot Learning via Progressive Language-based Observations Nov 23, 2023 Compositional Zero-Shot Learning Zero-Shot Learning
— Unverified 0Attribute-Aware Representation Rectification for Generalized Zero-Shot Learning Nov 23, 2023 Attribute Generalized Zero-Shot Learning
Code Code Available 0HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts Nov 23, 2023 Compositional Zero-Shot Learning Mixture-of-Experts
— Unverified 0Understanding the Vulnerability of CLIP to Image Compression Nov 23, 2023 Image Compression Language Modeling
Code Code Available 0Self-guided Few-shot Semantic Segmentation for Remote Sensing Imagery Based on Large Vision Models Nov 22, 2023 Few-Shot Semantic Segmentation Prompt Learning
— Unverified 0Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models Nov 21, 2023 Image Segmentation Language Modelling
Code Code Available 0Boosting Audio-visual Zero-shot Learning with Large Language Models Nov 21, 2023 audio-visual learning Descriptive
Code Code Available 0GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation Nov 19, 2023 Image Segmentation Large Language Model
Code Code Available 1MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification Nov 16, 2023 Zero-Shot Learning
Code Code Available 0Investigating the Emergent Audio Classification Ability of ASR Foundation Models Nov 15, 2023 Audio Classification Decoder
Code Code Available 0Zero-Shot Segmentation of Eye Features Using the Segment Anything Model (SAM) Nov 14, 2023 Gaze Estimation Image Segmentation
Code Code Available 0Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions Nov 13, 2023 Classification Language Modeling
— Unverified 0CLAMP: A Contrastive Language And Molecule Pre-training Network Nov 12, 2023 Graph Neural Network zero-shot-classification
Code Code Available 0Unified machine learning tasks and datasets for enhancing renewable energy Nov 12, 2023 Zero-Shot Learning
— Unverified 0Automatic Report Generation for Histopathology images using pre-trained Vision Transformers Nov 10, 2023 Decoder Image Segmentation
Code Code Available 0GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning Nov 9, 2023 Attribute Compositional Zero-Shot Learning
Code Code Available 0Analysis and Applications of Deep Learning with Finite Samples in Full Life-Cycle Intelligence of Nuclear Power Generation Nov 7, 2023 Few-Shot Learning Open Set Learning
— Unverified 0Generalized zero-shot audio-to-intent classification Nov 4, 2023 Classification Goal-Oriented Dialog
— Unverified 0Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation Nov 3, 2023 3D Semantic Segmentation Point Cloud Segmentation
— Unverified 0CLIP-AD: A Language-Guided Staged Dual-Path Model for Zero-shot Anomaly Detection Nov 1, 2023 Anomaly Detection Language Modeling
— Unverified 0Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection Nov 1, 2023 Classification Few-Shot Object Detection
Code Code Available 1Class Incremental Learning with Pre-trained Vision-Language Models Oct 31, 2023 class-incremental learning Class Incremental Learning
— Unverified 0AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection Oct 29, 2023 Anomaly Detection Prompt Learning
Code Code Available 2Using Large Language Models to Support Thematic Analysis in Empirical Legal Studies Oct 28, 2023 Language Modelling Large Language Model
— Unverified 0ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models Oct 27, 2023 Column Type Annotation Table annotation
Code Code Available 1Is ChatGPT a Good Multi-Party Conversation Solver? Oct 25, 2023 Zero-Shot Learning
Code Code Available 0Zephyr: Direct Distillation of LM Alignment Oct 25, 2023 2D Cyclist Detection Few-Shot Learning
Code Code Available 5XFEVER: Exploring Fact Verification across Languages Oct 25, 2023 Benchmarking Fact Verification
Code Code Available 0EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition Oct 25, 2023 Facial Expression Recognition Facial Expression Recognition (FER)
Code Code Available 1Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization Oct 23, 2023 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0Linear Representations of Sentiment in Large Language Models Oct 23, 2023 zero-shot-classification Zero-Shot Learning
Code Code Available 0Customising General Large Language Models for Specialised Emotion Recognition Tasks Oct 22, 2023 Emotion Recognition Language Modeling
— Unverified 0NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval Oct 22, 2023 named-entity-recognition Named Entity Recognition
Code Code Available 1Zero-shot Learning of Individualized Task Contrast Prediction from Resting-state Functional Connectomes Oct 21, 2023 Zero-Shot Learning
— Unverified 0CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement Oct 21, 2023 Depth Estimation image-classification
— Unverified 0SILC: Improving Vision Language Pretraining with Self-Distillation Oct 20, 2023 Classification Contrastive Learning
— Unverified 0Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation Oct 20, 2023 Image Segmentation Semantic Segmentation
Code Code Available 1Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models Oct 19, 2023 Segmentation Semantic Segmentation
Code Code Available 0GraphGPT: Graph Instruction Tuning for Large Language Models Oct 19, 2023 Data Augmentation Graph Learning
Code Code Available 2Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning Oct 19, 2023 Combinatorial Optimization Zero-Shot Learning
Code Code Available 1