SOTAVerified

zero-shot-classification

Papers

Showing 351400 of 422 papers

TitleStatusHype
Learning Portrait Style RepresentationsCode0
Zero-Shot Classification by Logical Reasoning on Natural Language ExplanationsCode0
Design of the topology for contrastive visual-textual alignmentCode0
Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image ScenesCode0
Automatic Report Generation for Histopathology images using pre-trained Vision TransformersCode0
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep LearningCode0
Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention NetworksCode0
Learning Deep Representations of Fine-grained Visual DescriptionsCode0
A Unified Debiasing Approach for Vision-Language Models across Modalities and TasksCode0
ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual DescriptionsCode0
What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language ModelsCode0
NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion ModelsCode0
Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?Code0
Boosting Visual-Language Models by Exploiting Hard SamplesCode0
Non-Contrastive Learning Meets Language-Image Pre-TrainingCode0
Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular DataCode0
OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-AdjustmentCode0
Online Zero-Shot Classification with CLIPCode0
On the effectiveness of Large Language Models in the mechanical design domainCode0
On the use of Silver Standard Data for Zero-shot Classification Tasks in Information ExtractionCode0
LAION-5B: An open large-scale dataset for training next generation image-text modelsCode0
Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-TrainingCode0
KPL: Training-Free Medical Knowledge Mining of Vision-Language ModelsCode0
Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding AlignmentCode0
M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient PretrainingCode0
OverPrompt: Enhancing ChatGPT through Efficient In-Context LearningCode0
Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot LearningCode0
Investigating the Emergent Audio Classification Ability of ASR Foundation ModelsCode0
Improving Zero-Shot Detection of Low Prevalence Chest Pathologies using Domain Pre-trained Language ModelsCode0
StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity AlignmentCode0
Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIPCode0
AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate DiagnosisCode0
Understanding Visual Concepts Across ModelsCode0
Describe me an Aucklet: Generating Grounded Perceptual Category DescriptionsCode0
DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic SegmentationCode0
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognitionCode0
Data-Free Generalized Zero-Shot LearningCode0
Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor AttacksCode0
Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image CorruptionCode0
WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization RetrievalCode0
Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual RepresentationsCode0
Gradient Matching Generative Networks for Zero-Shot LearningCode0
Robustifying Point Cloud Networks by RefocusingCode0
Task-Driven Modular Networks for Zero-Shot Compositional LearningCode0
Connecting NeRFs, Images, and TextCode0
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language ModelsCode0
Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep LearningCode0
Geodesic Multi-Modal Mixup for Robust Fine-TuningCode0
Telling Stories for Common Sense Zero-Shot Action RecognitionCode0
Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism DetectionCode0
Show:102550
← PrevPage 8 of 9Next →

No leaderboard results yet.