RECAP: Retrieval-Augmented Audio Captioning Sep 18, 2023 AudioCaps Audio captioning
Code Code Available 1Audio Difference Learning for Audio Captioning Sep 15, 2023 Audio captioning
— Unverified 0Training Audio Captioning Models without Audio Sep 14, 2023 Audio captioning Decoder
Code Code Available 1Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation Sep 6, 2023 Audio captioning Data Augmentation
— Unverified 0Generating Realistic Images from In-the-wild Sounds Sep 5, 2023 Audio captioning Sentence
— Unverified 0Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval? Aug 29, 2023 AudioCaps Audio captioning
— Unverified 0Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement Aug 23, 2023 Audio captioning Disentanglement
Code Code Available 0Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer Aug 20, 2023 AudioCaps Audio captioning
— Unverified 0Improving Audio Caption Fluency with Automatic Error Correction Jun 16, 2023 Audio captioning Sentence
— Unverified 0Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances Jun 16, 2023 Audio captioning Contrastive Learning
Code Code Available 0Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning May 30, 2023 Audio captioning Decoder
— Unverified 0VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset May 29, 2023 Audio captioning Audio-Visual Captioning
Code Code Available 2Pengi: An Audio Language Model for Audio Tasks May 19, 2023 Audio captioning Audio Question Answering
Code Code Available 2A Whisper transformer for audio captioning trained with synthetic captions and transfer learning May 15, 2023 Audio captioning Speech-to-Text
Code Code Available 1VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset Apr 17, 2023 Audio captioning Audio-Video Question Answering (AVQA)
Code Code Available 2Efficient Audio Captioning Transformer with Patchout and Text Guidance Apr 6, 2023 Audio captioning Caption Generation
— Unverified 0Prefix tuning for automated audio captioning Mar 30, 2023 AudioCaps Audio captioning
Code Code Available 1WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research Mar 30, 2023 Audio captioning Event Detection
Code Code Available 2Towards Generating Diverse Audio Captions via Adversarial Training Dec 5, 2022 Audio captioning Diversity
— Unverified 0Impact of visual assistance for automated audio captioning Nov 18, 2022 Audio captioning Event Detection
— Unverified 0Diversity and bias in audio captioning datasets Nov 15, 2022 Audio captioning Diversity
— Unverified 0Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates Nov 14, 2022 AudioCaps Audio captioning
Code Code Available 1Investigations in Audio Captioning: Addressing Vocabulary Imbalance and Evaluating Suitability of Language-Centric Performance Metrics Nov 12, 2022 Audio captioning Image Captioning
— Unverified 0Exploring Train and Test-Time Augmentations for Audio-Language Learning Oct 31, 2022 Audio captioning Audio to Text Retrieval
— Unverified 0Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention Oct 28, 2022 AudioCaps Audio captioning
Code Code Available 1Automated Audio Captioning via Fusion of Low- and High- Dimensional Features Oct 10, 2022 AudioCaps Audio captioning
— Unverified 0Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity Oct 3, 2022 Audio captioning Image Captioning
— Unverified 0Audio Retrieval with WavText5K and CLAP Training Sep 28, 2022 AudioCaps Audio captioning
Code Code Available 1Language-based Audio Retrieval Task in DCASE 2022 Challenge Sep 20, 2022 Audio captioning Retrieval
— Unverified 0An investigation on selecting audio pre-trained models for audio captioning Aug 12, 2022 Audio captioning
— Unverified 0Automated Audio Captioning and Language-Based Audio Retrieval Jul 8, 2022 Audio captioning Retrieval
Code Code Available 0Language-based Audio Retrieval Task in DCASE 2022 Challenge Jun 13, 2022 Audio captioning Retrieval
Code Code Available 0Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning Jun 4, 2022 Audio captioning
— Unverified 0Multimodal Knowledge Alignment with Reinforcement Learning May 25, 2022 Audio captioning Language Modeling
Code Code Available 1Automated Audio Captioning: An Overview of Recent Progress and New Challenges May 12, 2022 Audio captioning Caption Generation
— Unverified 0Caption Feature Space Regularization for Audio Captioning Apr 18, 2022 Audio captioning Contrastive Learning
Code Code Available 0Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning Mar 29, 2022 Audio captioning Contrastive Learning
— Unverified 0Leveraging Pre-trained BERT for Audio Captioning Mar 6, 2022 AudioCaps Audio captioning
— Unverified 0Joint Speech Recognition and Audio Captioning Feb 3, 2022 AudioCaps Audio captioning
— Unverified 0Automatic Audio Captioning using Attention weighted Event based Embeddings Jan 28, 2022 Audio captioning Decoder
— Unverified 0Local Information Assisted Attention-free Decoder for Audio Captioning Jan 10, 2022 Audio captioning Caption Generation
Code Code Available 0Audio Retrieval with Natural Language Queries: A Benchmark Study Dec 17, 2021 AudioCaps Audio captioning
Code Code Available 1AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGS Nov 15, 2021 AudioCaps Audio captioning
Code Code Available 0Evaluating Off-the-Shelf Machine Listening and Natural Language Models for Automated Audio Captioning Oct 14, 2021 Audio captioning Word Embeddings
— Unverified 0Diverse Audio Captioning via Adversarial Training Oct 13, 2021 Audio captioning Diversity
— Unverified 0Can Audio Captions Be Evaluated with Image Caption Metrics? Oct 10, 2021 AudioCaps Audio captioning
Code Code Available 1Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization Aug 10, 2021 Audio captioning Decoder
— Unverified 0An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning Aug 5, 2021 Audio captioning Decoder
Code Code Available 1Audio Captioning Transformer Jul 21, 2021 AudioCaps Audio captioning
Code Code Available 1CL4AC: A Contrastive Loss for Audio Captioning Jul 21, 2021 Audio captioning Decoder
Code Code Available 1