| Convoifilter: A case study of doing cocktail party speech recognition | Aug 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model | Aug 22, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition | Aug 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Indonesian Automatic Speech Recognition with XLSR-53 | Aug 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bayes Risk Transducer: Transducer with Controllable Alignment Prediction | Aug 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals | Aug 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Accurate synthesis of Dysarthric Speech for ASR data augmentation | Aug 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving CTC-AED model with integrated-CTC and auxiliary loss regularization | Aug 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations | Aug 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Text Injection for Capitalization and Turn-Taking Prediction in Speech Models | Aug 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder | Aug 14, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 |
| Using Text Injection to Improve Recognition of Personal Identifiers in Speech | Aug 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations | Aug 14, 2023 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition | Aug 12, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss | Aug 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Novel Self-training Approach for Low-resource Speech Recognition | Aug 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio | Aug 9, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Comparative Analysis of the wav2vec 2.0 Feature Extractor | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation | Aug 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism | Aug 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging | Aug 5, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Federated Representation Learning for Automatic Speech Recognition | Aug 3, 2023 | Automatic Speech RecognitionFederated Learning | —Unverified | 0 |
| Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification | Aug 2, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text | Jul 30, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus | Jul 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures | Jul 27, 2023 | Automatic Speech RecognitionContrastive Learning | CodeCode Available | 1 |
| Cascaded Cross-Modal Transformer for Request and Complaint Detection | Jul 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition | Jul 26, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer | Jul 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training | Jul 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adaptation of Whisper models to child speech recognition | Jul 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization | Jul 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN | Jul 24, 2023 | Automatic Speech RecognitionSentiment Analysis | CodeCode Available | 0 |
| Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition | Jul 24, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Boosting Punctuation Restoration with Data Generation and Reinforcement Learning | Jul 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation | Jul 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A meta learning scheme for fast accent domain expansion in Mandarin speech recognition | Jul 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Prompting Large Language Models with Speech Recognition Abilities | Jul 21, 2023 | Abstractive Text SummarizationAutomatic Speech Recognition | —Unverified | 0 |
| A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion | Jul 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information | Jul 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC Videos | Jul 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| OxfordVGG Submission to the EGO4D AV Transcription Challenge | Jul 18, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 6 |
| Model Adaptation for ASR in low-resource Indian Languages | Jul 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices | Jul 14, 2023 | Automatic Speech RecognitionFederated Learning | —Unverified | 0 |
| Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition | Jul 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications | Jul 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study | Jul 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Writer adaptation for offline text recognition: An exploration of neural network-based methods | Jul 11, 2023 | Automatic Speech RecognitionHandwriting Recognition | CodeCode Available | 0 |
| Speech Diarization and ASR with GMM | Jul 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments | Jul 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |