Zero-shot Audio Captioning
Zero-shot audio captioning aims at automatically generating descriptive textual captions for audio content without any prior training for this task. Audio captioning is commonly concerned with ambient sounds, or sounds produced by a human performing an action.
Papers
No papers found.
No leaderboard results yet.