SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Audio-Visual Captioning
Audio-Visual Captioning
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 1–4 of 4 papers
Title
Date
Tasks
Status
Hype
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
May 29, 2023
Audio captioning
Audio-Visual Captioning
Code
Code Available
2
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Apr 17, 2023
Audio captioning
Audio-Video Question Answering (AVQA)
Code
Code Available
2
LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport
Jan 16, 2025
AudioCaps
Audio captioning
Code
Code Available
1
AVCap: Leveraging Audio-Visual Features as Text Tokens for Captioning
Jul 10, 2024
Audio-Visual Captioning
Image Captioning
Code
Code Available
1
Show:
10
25
50
No leaderboard results yet.