| The OCON model: an old but gold solution for distributable supervised classification | Oct 5, 2024 | Automatic Speech RecognitionClassification | CodeCode Available | 0 |
| The OCON model: an old but green solution for distributable supervised classification for acoustic monitoring in smart cities | Oct 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques | Oct 4, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Oct 4, 2024 | Automatic Speech RecognitionInstruction Following | CodeCode Available | 0 |
| Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems | Oct 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Convolutional Variational Autoencoders for Spectrogram Compression in Automatic Speech Recognition | Oct 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Spoken Grammar Assessment Using LLM | Oct 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition for the Ika Language | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Speech Recognition with Pre-trained Masked Language Model | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Alignment-Free Training for Transducer-based Multi-Talker ASR | Sep 30, 2024 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems | Sep 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AfriHuBERT: A self-supervised speech representation model for African languages | Sep 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility | Sep 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Long-Form Speech Recognition for General Speech In-Context Learning | Sep 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A GEN AI Framework for Medical Note Generation | Sep 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models | Sep 27, 2024 | Automatic Speech RecognitionMamba | —Unverified | 0 |
| Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking | Sep 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study | Sep 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unveiling the Role of Pretraining in Direct Speech Translation | Sep 26, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Deep CLAS: Deep Contextual Listen, Attend and Spell | Sep 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events | Sep 25, 2024 | Audio TaggingAutomatic Speech Recognition | —Unverified | 0 |
| Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling | Sep 25, 2024 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 0 |
| Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Sep 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Speech Recognition Rescoring with Large Speech-Text Foundation Models | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Revisiting Acoustic Features for Robust ASR | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction | Sep 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder | Sep 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering | Sep 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR | Sep 20, 2024 | ARCAutomatic Speech Recognition | —Unverified | 0 |
| Large Language Model Should Understand Pinyin for Chinese ASR Error Correction | Sep 20, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper | Sep 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection | Sep 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Personalized Speech Recognition for Children with Test-Time Adaptation | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space | Sep 19, 2024 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| ASR Benchmarking: Need for a More Representative Conversational Dataset | Sep 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR | Sep 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses | Sep 17, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Augmenting Automatic Speech Recognition Models with Disfluency Detection | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models | Sep 16, 2024 | Automatic Speech RecognitionPrompt Engineering | —Unverified | 0 |
| SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |