SOTAVerified

AudioCaps

Papers

Showing 1120 of 64 papers

TitleStatusHype
LAVCap: LLM-based Audio-Visual Captioning using Optimal TransportCode1
Bridging Language Gaps in Audio-Text RetrievalCode1
Revisiting Deep Audio-Text Retrieval Through the Lens of TransportationCode1
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency DistillationCode1
RECAP: Retrieval-Augmented Audio CaptioningCode1
Prefix tuning for automated audio captioningCode1
Target Sound Extraction with Variable Cross-modality CluesCode1
Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidatesCode1
Visually-Aware Audio Captioning With Adaptive Audio-Visual AttentionCode1
Audio Retrieval with WavText5K and CLAP TrainingCode1
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.