CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic Captions Jan 28, 2025 Audio captioning Audio Generation
— Unverified 0Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model Jan 13, 2025 Audio captioning Instruction Following
— Unverified 0Classifier-Guided Captioning Across Modalities Jan 3, 2025 Audio captioning Video Captioning
— Unverified 0EmotionCaps: Enhancing Audio Captioning Through Emotion-Augmented Data Generation Oct 15, 2024 Audio captioning Emotion Recognition
— Unverified 0Enhancing Retrieval-Augmented Audio Captioning with Generation-Assisted Multimodal Querying and Progressive Learning Oct 14, 2024 AudioCaps Audio captioning
— Unverified 0DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning Oct 12, 2024 Audio captioning Large Language Model
Code Code Available 0SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Oct 12, 2024 AudioCaps Audio captioning
Code Code Available 0Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization Oct 9, 2024 Audio captioning Large Language Model
— Unverified 0An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment Oct 8, 2024 Audio captioning Contrastive Learning
Code Code Available 0OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation Sep 28, 2024 Audio captioning
Code Code Available 0CLAIR-A: Leveraging Large Language Models to Judge Audio Captions Sep 19, 2024 Audio captioning Language Modeling
Code Code Available 0Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models Sep 17, 2024 Audio captioning Instruction Following
— Unverified 0Towards Diverse and Efficient Audio Captioning via Diffusion Models Sep 14, 2024 Audio captioning Diversity
— Unverified 0Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models Sep 10, 2024 Audio captioning Audio Question Answering
— Unverified 0Expanding on EnCLAP with Auxiliary Retrieval Model for Automated Audio Captioning Sep 2, 2024 Audio captioning Reranking
— Unverified 0Audio Dialogues: Dialogues dataset for audio and music understanding Apr 11, 2024 Audio captioning Audio Question Answering
— Unverified 0Improved Baselines for Data-efficient Perceptual Augmentation of LLMs Mar 20, 2024 Audio captioning Image Captioning
— Unverified 0Learning Audio Concepts from Counterfactual Natural Language Jan 10, 2024 Audio captioning Audio Classification
Code Code Available 0AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning Nov 21, 2023 Acoustic Scene Classification Audio captioning
Code Code Available 0Weakly-supervised Automated Audio Captioning via text only training Sep 21, 2023 AudioCaps Audio captioning
Code Code Available 0Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning Sep 20, 2023 Audio captioning Caption Generation
— Unverified 0Audio Difference Learning for Audio Captioning Sep 15, 2023 Audio captioning
— Unverified 0Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation Sep 6, 2023 Audio captioning Data Augmentation
— Unverified 0Generating Realistic Images from In-the-wild Sounds Sep 5, 2023 Audio captioning Sentence
— Unverified 0Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval? Aug 29, 2023 AudioCaps Audio captioning
— Unverified 0Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement Aug 23, 2023 Audio captioning Disentanglement
Code Code Available 0Rethinking Transfer and Auxiliary Learning for Improving Audio Captioning Transformer Aug 20, 2023 AudioCaps Audio captioning
— Unverified 0Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances Jun 16, 2023 Audio captioning Contrastive Learning
Code Code Available 0Improving Audio Caption Fluency with Automatic Error Correction Jun 16, 2023 Audio captioning Sentence
— Unverified 0Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning May 30, 2023 Audio captioning Decoder
— Unverified 0Efficient Audio Captioning Transformer with Patchout and Text Guidance Apr 6, 2023 Audio captioning Caption Generation
— Unverified 0Towards Generating Diverse Audio Captions via Adversarial Training Dec 5, 2022 Audio captioning Diversity
— Unverified 0Impact of visual assistance for automated audio captioning Nov 18, 2022 Audio captioning Event Detection
— Unverified 0Diversity and bias in audio captioning datasets Nov 15, 2022 Audio captioning Diversity
— Unverified 0Investigations in Audio Captioning: Addressing Vocabulary Imbalance and Evaluating Suitability of Language-Centric Performance Metrics Nov 12, 2022 Audio captioning Image Captioning
— Unverified 0Exploring Train and Test-Time Augmentations for Audio-Language Learning Oct 31, 2022 Audio captioning Audio to Text Retrieval
— Unverified 0Automated Audio Captioning via Fusion of Low- and High- Dimensional Features Oct 10, 2022 AudioCaps Audio captioning
— Unverified 0Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity Oct 3, 2022 Audio captioning Image Captioning
— Unverified 0Language-based Audio Retrieval Task in DCASE 2022 Challenge Sep 20, 2022 Audio captioning Retrieval
— Unverified 0An investigation on selecting audio pre-trained models for audio captioning Aug 12, 2022 Audio captioning
— Unverified 0Automated Audio Captioning and Language-Based Audio Retrieval Jul 8, 2022 Audio captioning Retrieval
Code Code Available 0Language-based Audio Retrieval Task in DCASE 2022 Challenge Jun 13, 2022 Audio captioning Retrieval
Code Code Available 0Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning Jun 4, 2022 Audio captioning
— Unverified 0Automated Audio Captioning: An Overview of Recent Progress and New Challenges May 12, 2022 Audio captioning Caption Generation
— Unverified 0Caption Feature Space Regularization for Audio Captioning Apr 18, 2022 Audio captioning Contrastive Learning
Code Code Available 0Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning Mar 29, 2022 Audio captioning Contrastive Learning
— Unverified 0Leveraging Pre-trained BERT for Audio Captioning Mar 6, 2022 AudioCaps Audio captioning
— Unverified 0Joint Speech Recognition and Audio Captioning Feb 3, 2022 AudioCaps Audio captioning
— Unverified 0Automatic Audio Captioning using Attention weighted Event based Embeddings Jan 28, 2022 Audio captioning Decoder
— Unverified 0Local Information Assisted Attention-free Decoder for Audio Captioning Jan 10, 2022 Audio captioning Caption Generation
Code Code Available 0