| A Survey on Speech Large Language Models | Oct 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection) | May 2, 2024 | Acoustic Scene ClassificationEvent Detection | —Unverified | 0 | 0 |
| Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems | Mar 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Attention-Based End-to-End Speech Recognition on Voice Search | Jul 22, 2017 | DecoderL2 Regularization | —Unverified | 0 | 0 |
| Audio Adversarial Examples: Attacks Using Vocal Masks | Feb 4, 2021 | Adversarial AttackSpeech-to-Text | —Unverified | 0 | 0 |
| Audio Interval Retrieval using Convolutional Neural Networks | Sep 21, 2021 | Audio ClassificationRetrieval | —Unverified | 0 | 0 |
| AudioPaLM: A Large Language Model That Can Speak and Listen | Jun 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Automated Testing of AI Models | Oct 7, 2021 | FairnessSpeech-to-Text | —Unverified | 0 | 0 |
| A Voice Controlled E-Commerce Web Application | Nov 16, 2018 | Medical Diagnosisspeech-recognition | —Unverified | 0 | 0 |
| Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM | Feb 24, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 | 0 |