| Political corpus creation through automatic speech recognition on EU debates | Apr 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multimodal Short Video Rumor Detection System Based on Contrastive Learning | Apr 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers | Apr 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition | Apr 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluation of Speaker Anonymization on Emotional Speech | Apr 15, 2023 | Automatic Speech RecognitionEmotion Recognition | —Unverified | 0 |
| Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10 | Apr 14, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training | Apr 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Regularizing Contrastive Predictive Coding for Speech Applications | Apr 12, 2023 | Acoustic Unit DiscoveryAutomatic Speech Recognition | —Unverified | 0 |
| Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR | Apr 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data | Apr 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-Supervised Learning-Based Source Separation for Meeting Data | Apr 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multilingual Word Error Rate Estimation: e-WER3 | Apr 2, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Dialog act guided contextual adapter for personalized speech recognition | Mar 31, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R | Mar 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR | Mar 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers | Mar 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR | Mar 29, 2023 | Automatic Speech RecognitionDomain Adaptation | —Unverified | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis | Mar 27, 2023 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition | Mar 23, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Enhancing Unsupervised Speech Recognition with Diffusion GANs | Mar 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition | Mar 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised Learning with Speech Modulation Dropout | Mar 22, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| Transformers in Speech Processing: A Survey | Mar 21, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations | Mar 21, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition | Mar 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Code-Switching Text Generation and Injection in Mandarin-English ASR | Mar 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Deep Learning System for Domain-specific Speech Recognition | Mar 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model | Mar 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Visual Information Matters for ASR Error Correction | Mar 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Trustera: A Live Conversation Redaction System | Mar 16, 2023 | Automatic Speech RecognitionNatural Language Understanding | —Unverified | 0 |
| HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism | Mar 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences | Mar 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Accented Speech Recognition with Multi-Domain Training | Mar 14, 2023 | Accented Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study | Mar 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving the Intent Classification accuracy in Noisy Environment | Mar 12, 2023 | Automatic Speech RecognitionClassification | —Unverified | 0 |
| Transcription free filler word detection with Neural semi-CRFs | Mar 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings | Mar 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MIXPGD: Hybrid Adversarial Training for Speech Recognition Systems | Mar 10, 2023 | Adversarial AttackAutomatic Speech Recognition | —Unverified | 0 |
| wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts | Mar 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Speech Recognition: A Survey | Mar 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Large Text Corpora for End-to-End Speech Summarization | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition | Mar 1, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space | Mar 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition | Mar 1, 2023 | Acoustic echo cancellationAutomatic Speech Recognition | —Unverified | 0 |
| Practice of the conformer enhanced AUDIO-VISUAL HUBERT on Mandarin and English | Feb 28, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition | Feb 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Diacritic Recognition Performance in Arabic ASR | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |