SOTAVerified

zero-shot-classification

Papers

Showing 126150 of 422 papers

TitleStatusHype
Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image CorruptionCode0
StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity AlignmentCode0
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors0
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining0
Image Classification Using a Diffusion Model as a Pre-Training Model0
Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning0
Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System0
On the effectiveness of Large Language Models in the mechanical design domainCode0
Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism DetectionCode0
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability0
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective0
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization0
ViLAaD: Enhancing "Attracting and Dispersing'' Source-Free Domain Adaptation with Vision-and-Language Model0
Enhancing Small Language Models for Cross-Lingual Generalized Zero-Shot Classification with Soft Prompt Tuning0
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection0
Bayesian Modeling of Zero-Shot Classifications for Urban Flood DetectionCode0
Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep LearningCode0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images0
OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-AdjustmentCode0
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models0
Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study0
SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning0
Progressive Local Alignment for Medical Multimodal Pre-training0
Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs0
Show:102550
← PrevPage 6 of 17Next →

No leaderboard results yet.