SOTAVerified

Descriptive

Papers

Showing 76100 of 1477 papers

TitleStatusHype
What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable InsightsCode1
GOAL: Global-local Object Alignment LearningCode1
IDAS: Intent Discovery with Abstractive SummarizationCode1
FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font ApplicationsCode1
Can Knowledge Graphs Simplify Text?Code1
From Artificially Real to Real: Leveraging Pseudo Data from Large Language Models for Low-Resource Molecule DiscoveryCode1
First Steps of an Approach to the ARC Challenge based on Descriptive Grid Models and the Minimum Description Length PrincipleCode1
Finetune like you pretrain: Improved finetuning of zero-shot vision modelsCode1
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-AnsweringCode1
A Visual Analytics Framework for Explaining and Diagnosing Transfer Learning ProcessesCode1
A Variational Algorithm for Quantum Neural NetworksCode1
Zero-Shot Compositional Policy Learning via Language GroundingCode1
Automatic Generation of Topic LabelsCode1
Beyond Co-occurrence: Multi-modal Session-based RecommendationCode1
Bias Loss for Mobile Neural NetworksCode1
Natural scene reconstruction from fMRI signals using generative latent diffusionCode1
A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D FacesCode1
FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language ModelsCode1
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel SizesCode1
FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis AssistantCode1
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as PromptsCode1
Comprehensive Information Integration Modeling Framework for Video TitlingCode1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous DrivingCode1
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space SearchCode1
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language UnderstandingCode1
Show:102550
← PrevPage 4 of 60Next →

No leaderboard results yet.