AC/DC: LLM-based Audio Comprehension via Dialogue Continuation Jun 12, 2025 AudioCaps Audio captioning
— Unverified 0Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning Sep 20, 2023 Audio captioning Caption Generation
— Unverified 0An Attempt towards Interpretable Audio-Visual Video Captioning Dec 7, 2018 Audio captioning Audio-Visual Video Captioning
— Unverified 0An investigation on selecting audio pre-trained models for audio captioning Aug 12, 2022 Audio captioning
— Unverified 0A Transformer-based Audio Captioning Model with Keyword Estimation Jul 1, 2020 Acoustic Scene Classification Audio captioning
— Unverified 0AudioCaps: Generating Captions for Audios in The Wild Jun 1, 2019 AudioCaps Audio captioning
— Unverified 0Audio Captioning using Gated Recurrent Units Jun 5, 2020 Audio captioning
— Unverified 0Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval Dec 14, 2020 Audio captioning Language Modeling
— Unverified 0Enhancing Retrieval-Augmented Audio Captioning with Generation-Assisted Multimodal Querying and Progressive Learning Oct 14, 2024 AudioCaps Audio captioning
— Unverified 0Audio Captioning with Composition of Acoustic and Semantic Information May 13, 2021 AudioCaps Audio captioning
— Unverified 0Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model Jan 13, 2025 Audio captioning Instruction Following
— Unverified 0Audio Dialogues: Dialogues dataset for audio and music understanding Apr 11, 2024 Audio captioning Audio Question Answering
— Unverified 0Audio Difference Learning for Audio Captioning Sep 15, 2023 Audio captioning
— Unverified 0Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Mar 6, 2025 Audio captioning Language Modeling
— Unverified 0Automated Audio Captioning: An Overview of Recent Progress and New Challenges May 12, 2022 Audio captioning Caption Generation
— Unverified 0Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization Aug 10, 2021 Audio captioning Decoder
— Unverified 0Automated Audio Captioning via Fusion of Low- and High- Dimensional Features Oct 10, 2022 AudioCaps Audio captioning
— Unverified 0Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning Jun 4, 2022 Audio captioning
— Unverified 0Automated Audio Captioning with Recurrent Neural Networks Jun 30, 2017 Audio captioning Decoder
— Unverified 0Automatic Audio Captioning using Attention weighted Event based Embeddings Jan 28, 2022 Audio captioning Decoder
— Unverified 0CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer Jun 1, 2025 Audio captioning Language Modeling
— Unverified 0Classifier-Guided Captioning Across Modalities Jan 3, 2025 Audio captioning Video Captioning
— Unverified 0CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic Captions Jan 28, 2025 Audio captioning Audio Generation
— Unverified 0Diverse Audio Captioning via Adversarial Training Oct 13, 2021 Audio captioning Diversity
— Unverified 0Diversity and bias in audio captioning datasets Nov 15, 2022 Audio captioning Diversity
— Unverified 0Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning May 30, 2023 Audio captioning Decoder
— Unverified 0Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning Sep 24, 2020 Audio captioning Data Augmentation
— Unverified 0Efficient Audio Captioning Transformer with Patchout and Text Guidance Apr 6, 2023 Audio captioning Caption Generation
— Unverified 0EmotionCaps: Enhancing Audio Captioning Through Emotion-Augmented Data Generation Oct 15, 2024 Audio captioning Emotion Recognition
— Unverified 0Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models Sep 17, 2024 Audio captioning Instruction Following
— Unverified 0Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization Oct 9, 2024 Audio captioning Large Language Model
— Unverified 0Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders Feb 21, 2025 Audio captioning Automatic Speech Recognition
— Unverified 0Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models Sep 10, 2024 Audio captioning Audio Question Answering
— Unverified 0Evaluating Off-the-Shelf Machine Listening and Natural Language Models for Automated Audio Captioning Oct 14, 2021 Audio captioning Word Embeddings
— Unverified 0Expanding on EnCLAP with Auxiliary Retrieval Model for Automated Audio Captioning Sep 2, 2024 Audio captioning Reranking
— Unverified 0Generating Realistic Images from In-the-wild Sounds Sep 5, 2023 Audio captioning Sentence
— Unverified 0Impact of visual assistance for automated audio captioning Nov 18, 2022 Audio captioning Event Detection
— Unverified 0Improved Baselines for Data-efficient Perceptual Augmentation of LLMs Mar 20, 2024 Audio captioning Image Captioning
— Unverified 0Improving Audio Caption Fluency with Automatic Error Correction Jun 16, 2023 Audio captioning Sentence
— Unverified 0Exploring Train and Test-Time Augmentations for Audio-Language Learning Oct 31, 2022 Audio captioning Audio to Text Retrieval
— Unverified 0Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer Aug 20, 2023 AudioCaps Audio captioning
— Unverified 0TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining May 12, 2025 Audio captioning Audio Generation
— Unverified 0Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity Oct 3, 2022 Audio captioning Image Captioning
— Unverified 0THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS Jul 6, 2021 Audio captioning Caption Generation
— Unverified 0The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation Jul 1, 2020 Audio captioning Caption Generation
— Unverified 0Towards Diverse and Efficient Audio Captioning via Diffusion Models Sep 14, 2024 Audio captioning Diversity
— Unverified 0Towards Generating Diverse Audio Captions via Adversarial Training Dec 5, 2022 Audio captioning Diversity
— Unverified 0Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning Feb 8, 2025 AudioCaps Audio captioning
— Unverified 0Learning Audio Concepts from Counterfactual Natural Language Jan 10, 2024 Audio captioning Audio Classification
Code Code Available 0DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning Oct 12, 2024 Audio captioning Large Language Model
Code Code Available 0