| Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR | Feb 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification | Feb 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena | Feb 20, 2024 | Automatic Speech Recognitionimage-classification | —Unverified | 0 |
| Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru | Feb 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models | Feb 14, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| An Embarrassingly Simple Approach for LLM with Strong ASR Capacity | Feb 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models | Feb 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension | Feb 12, 2024 | 2kAutomatic Speech Recognition | CodeCode Available | 2 |
| The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese | Feb 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-consistent context aware conformer transducer for speech recognition | Feb 9, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition | Feb 8, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 |
| Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation | Feb 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training | Feb 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR | Feb 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Resolving Transcription Ambiguity in Spanish: A Hybrid Acoustic-Lexical System for Punctuation Restoration | Feb 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems | Feb 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens | Feb 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Digits micro-model for accurate and secure transactions | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Streaming Sequence Transduction through Dynamic Compression | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition | Jan 28, 2024 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline | Jan 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction | Jan 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Locality enhanced dynamic biasing and sampling strategies for contextual ASR | Jan 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Consistency Based Unsupervised Self-training For ASR Personalisation | Jan 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers | Jan 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Using Large Language Model for End-to-End Chinese ASR and NER | Jan 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric | Jan 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Large Language Models are Efficient Learners of Noise-Robust Speech Recognition | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition | Jan 18, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement | Jan 17, 2024 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization | Jan 16, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Improving ASR Contextual Biasing with Guided Attention | Jan 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription | Jan 16, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| SeMaScore : a new evaluation metric for automatic speech recognition tasks | Jan 15, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Cascaded Cross-Modal Transformer for Audio-Textual Classification | Jan 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Promptformer: Prompted Conformer Transducer for ASR | Jan 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization | Jan 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications | Jan 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese | Jan 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction | Jan 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2 | Jan 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification? | Jan 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Continuously Learning New Words in Automatic Speech Recognition | Jan 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LUPET: Incorporating Hierarchical Information Path into Multilingual ASR | Jan 8, 2024 | Acoustic Unit DiscoveryAutomatic Speech Recognition | —Unverified | 0 |
| BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators | Jan 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |