SOTAVerified

AudioCaps

Papers

Showing 4150 of 64 papers

TitleStatusHype
Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer0
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment0
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion ModelCode3
Prefix tuning for automated audio captioningCode1
Target Sound Extraction with Variable Cross-modality CluesCode1
Accommodating Audio Modality in CLIP for Multimodal ProcessingCode0
AudioLDM: Text-to-Audio Generation with Latent Diffusion ModelsCode4
Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidatesCode1
Visually-Aware Audio Captioning With Adaptive Audio-Visual AttentionCode1
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.