SOTAVerified

Caption Generation

Papers

Showing 101125 of 310 papers

TitleStatusHype
Bi-directional Contextual Attention for 3D Dense Captioning0
Dual-path Collaborative Generation Network for Emotional Video CaptioningCode0
XMeCap: Meme Caption Generation with Sub-Image Adaptability0
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing ImagesCode0
Explainable Image Captioning using CNN- CNN architecture and Hierarchical Attention0
Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?0
Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target TokensCode0
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon CaptioningCode0
DS@BioMed at ImageCLEFmedical Caption 2024: Enhanced Attention Mechanisms in Medical Caption Generation through Concept Detection Integration0
Multi-Modal Generative Embedding Model0
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation0
MICap: A Unified Model for Identity-aware Movie Descriptions0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge0
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival0
PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning0
Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback0
LLMs in Political Science: Heralding a New Era of Visual Analysis0
Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation0
Social Media Ready Caption Generation for Brands0
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving0
Set Prediction Guided by Semantic Concepts for Diverse Video Captioning0
Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERTCode0
Enhancing Image Captioning with Neural Models0
IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers0
Show:102550
← PrevPage 5 of 13Next →

No leaderboard results yet.