SOTAVerified

zero-shot-classification

Papers

Showing 101150 of 422 papers

TitleStatusHype
ProtoCLIP: Prototypical Contrastive Language Image PretrainingCode1
CyCLIP: Cyclic Contrastive Language-Image PretrainingCode1
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot LearningCode1
No Token Left Behind: Explainability-Aided Image Classification and GenerationCode1
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation LearningCode1
Decoupling Zero-Shot Semantic SegmentationCode1
CLIP-Lite: Information Efficient Visual Representation Learning with Language SupervisionCode1
Florence: A New Foundation Model for Computer VisionCode1
Wav2CLIP: Learning Robust Audio Representations From CLIPCode1
Zero-Shot Out-of-Distribution Detection Based on the Pre-trained Model CLIPCode1
Discriminative Region-based Multi-Label Zero-Shot LearningCode1
Contrastive Language-Image Pre-training for the Italian LanguageCode1
Toward Zero-Shot Unsupervised Image-to-Image TranslationCode1
Discovering Human Interactions With Novel Objects via Zero-Shot LearningCode1
Deep Learning Models for Multilingual Hate Speech DetectionCode1
Latent Embedding Feedback and Discriminative Features for Zero-Shot ClassificationCode1
Parameter Space Factorization for Zero-Shot Learning across Tasks and LanguagesCode1
Episode-based Prototype Generating Network for Zero-Shot LearningCode1
Zero-Shot Semantic SegmentationCode1
DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic SegmentationCode0
Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography0
Harmonizing and Merging Source Models for CLIP-based Domain Generalization0
AmorLIP: Efficient Language-Image Pretraining via AmortizationCode0
Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation0
Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment0
Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image CorruptionCode0
StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity AlignmentCode0
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors0
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining0
Image Classification Using a Diffusion Model as a Pre-Training Model0
Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning0
Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System0
On the effectiveness of Large Language Models in the mechanical design domainCode0
Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism DetectionCode0
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability0
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective0
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization0
ViLAaD: Enhancing "Attracting and Dispersing'' Source-Free Domain Adaptation with Vision-and-Language Model0
Enhancing Small Language Models for Cross-Lingual Generalized Zero-Shot Classification with Soft Prompt Tuning0
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection0
Bayesian Modeling of Zero-Shot Classifications for Urban Flood DetectionCode0
Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep LearningCode0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images0
OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-AdjustmentCode0
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models0
Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study0
SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning0
Progressive Local Alignment for Medical Multimodal Pre-training0
Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs0
Show:102550
← PrevPage 3 of 9Next →

No leaderboard results yet.