ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus Jul 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models Jul 29, 2023 Representation Learning speech-recognition
— Unverified 0The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems Jul 28, 2023 Intent Recognition speech-recognition
— Unverified 0Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures Jul 27, 2023 Automatic Speech Recognition Contrastive Learning
Code Code Available 1Turning Whisper into Real-Time Transcription System Jul 27, 2023 speech-recognition Speech Recognition
Code Code Available 4Cascaded Cross-Modal Transformer for Request and Complaint Detection Jul 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition Jul 26, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer Jul 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN Jul 24, 2023 Automatic Speech Recognition Sentiment Analysis
Code Code Available 0Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition Jul 24, 2023 Automatic Speech Recognition Decoder
— Unverified 0Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Boosting Punctuation Restoration with Data Generation and Reinforcement Learning Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A meta learning scheme for fast accent domain expansion in Mandarin speech recognition Jul 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation Jul 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding Jul 22, 2023 speech-recognition Speech Recognition
— Unverified 0Prompting Large Language Models with Speech Recognition Abilities Jul 21, 2023 Abstractive Text Summarization Automatic Speech Recognition
— Unverified 0A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion Jul 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information Jul 21, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0Transsion TSUP's speech recognition system for ASRU 2023 MADASR Challenge Jul 20, 2023 Decoder Language Modeling
— Unverified 0A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC Videos Jul 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding Jul 20, 2023 speech-recognition Speech Recognition
— Unverified 0Globally Normalising the Transducer for Streaming Speech Recognition Jul 20, 2023 speech-recognition Speech Recognition
— Unverified 0MASR: Multi-label Aware Speech Representation Jul 20, 2023 Emotion Recognition Language Identification
— Unverified 0Leveraging Visemes for Better Visual Speech Representation and Lip Reading Jul 19, 2023 Lip Reading Sentence
— Unverified 0Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning Jul 18, 2023 Domain Adaptation speech-recognition
Code Code Available 1OxfordVGG Submission to the EGO4D AV Transcription Challenge Jul 18, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 6ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development Jul 17, 2023 Action Detection Activity Detection
Code Code Available 1Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition Jul 17, 2023 Decoder Language Modeling
Code Code Available 8Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound Jul 17, 2023 Backdoor Attack speech-recognition
Code Code Available 1Model Adaptation for ASR in low-resource Indian Languages Jul 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On the Sensitivity of Deep Load Disaggregation to Adversarial Attacks Jul 14, 2023 Adversarial Attack energy management
— Unverified 0Towards Model-Size Agnostic, Compute-Free, Memorization-based Inference of Deep Learning Jul 14, 2023 Bayesian Optimization Memorization
— Unverified 0Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices Jul 14, 2023 Automatic Speech Recognition Federated Learning
— Unverified 0Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications Jul 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition Jul 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards spoken dialect identification of Irish Jul 14, 2023 Dialect Identification Language Identification
— Unverified 0Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study Jul 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Personalization for BERT-based Discriminative Speech Recognition Rescoring Jul 13, 2023 Decoder speech-recognition
— Unverified 0Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling Jul 13, 2023 intent-classification Intent Classification
— Unverified 0SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding Jul 12, 2023 speech-recognition Speech Recognition
Code Code Available 0Writer adaptation for offline text recognition: An exploration of neural network-based methods Jul 11, 2023 Automatic Speech Recognition Handwriting Recognition
Code Code Available 0Speech Diarization and ASR with GMM Jul 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SparseVSR: Lightweight and Noise Robust Visual Speech Recognition Jul 10, 2023 speech-recognition Speech Recognition
— Unverified 0Can Generative Large Language Models Perform ASR Error Correction? Jul 9, 2023 Decoder speech-recognition
— Unverified 0Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments Jul 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment Jul 6, 2023 Speaker Identification speech-recognition
Code Code Available 0Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture Jul 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transgressing the boundaries: towards a rigorous understanding of deep learning and its (non-)robustness Jul 5, 2023 Adversarial Robustness Learning Theory
— Unverified 0