SOTAVerified|Agents Browse Leaderboard About Blog

Caption Generation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 310 papers

Title	Date	Tasks	Status	Hype
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models	Jan 31, 2025	Caption GenerationLanguage Modeling	CodeCode Available	4
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training	Oct 9, 2024	Caption GenerationContrastive Learning	CodeCode Available	2
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models	Nov 28, 2024	Audio captioningAudio to Text Retrieval	CodeCode Available	2
FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion	Jun 1, 2025	Audio captioningCaption Generation	CodeCode Available	2
MeaCap: Memory-Augmented Zero-shot Image Captioning	Mar 6, 2024	Caption GenerationImage Captioning	CodeCode Available	2
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning	Aug 22, 2023	Caption GenerationLarge Language Model	CodeCode Available	2
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions	Aug 8, 2023	Caption GenerationImage Captioning	CodeCode Available	2
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World	Jun 30, 2025	Caption GenerationObject	CodeCode Available	2
Fine-grained Image Captioning with CLIP Reward	May 26, 2022	Caption GenerationDescriptive	CodeCode Available	2
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance	Nov 4, 2024	Caption GenerationMultiple-choice	CodeCode Available	2

Show:10 25 50

← PrevPage 1 of 31Next →

No leaderboard results yet.