Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Augmenting conformers with structured state-space sequence models for online speech recognition Sep 15, 2023 speech-recognition Speech Recognition
— Unverified 0Unimodal Aggregation for CTC-based Speech Recognition Sep 15, 2023 Automatic Speech Recognition Decoder
Code Code Available 1Visual Speech Recognition for Languages with Limited Labeled Data using Automatic Labels from Whisper Sep 15, 2023 Language Identification speech-recognition
Code Code Available 1DiaCorrect: Error Correction Back-end For Speaker Diarization Sep 15, 2023 Automatic Speech Recognition Decoder
Code Code Available 1Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition Sep 15, 2023 Decoder Form
— Unverified 0Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS Sep 14, 2023 Self-Supervised Learning speech-recognition
— Unverified 0Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks Sep 14, 2023 Decoder Language Modeling
— Unverified 0PromptASR for contextualized ASR with controllable style Sep 14, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 2Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer Sep 14, 2023 Language Modeling Language Modelling
— Unverified 0CPPF: A contextual and post-processing-free model for automatic speech recognition Sep 14, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0DiariST: Streaming Speech Translation with Speaker Diarization Sep 14, 2023 speaker-diarization Speaker Diarization
Code Code Available 1CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders Sep 14, 2023 Contrastive Learning Knowledge Distillation
— Unverified 0Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition Sep 14, 2023 speech-recognition Speech Recognition
— Unverified 0Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation Sep 14, 2023 Automatic Speech Recognition Decoder
— Unverified 0EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec Sep 14, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 2Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Open-vocabulary Keyword-spotting with Adaptive Instance Normalization Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Can Whisper perform speech-based in-context learning? Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Co-learning synaptic delays, weights and adaptation in spiking neural networks Sep 12, 2023 speech-recognition Speech Recognition
— Unverified 0Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method Sep 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults Sep 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning Fusion Sep 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Minuteman: Machine and Human Joining Forces in Meeting Summarization Sep 11, 2023 Meeting Summarization speech-recognition
— Unverified 0Leveraging Large Language Models for Exploiting ASR Uncertainty Sep 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining Sep 8, 2023 Language Modeling Language Modelling
Code Code Available 0Active Learning for Classifying 2D Grid-Based Level Completability Sep 8, 2023 Active Learning speech-recognition
Code Code Available 0Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation Sep 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems Sep 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LanSER: Language-Model Supported Speech Emotion Recognition Sep 7, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0RoDia: A New Dataset for Romanian Dialect Identification from Speech Sep 6, 2023 Dialect Identification Speaker Verification
Code Code Available 0Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks Sep 6, 2023 Self-Supervised Learning speech-recognition
— Unverified 0TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models Sep 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition Sep 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning Sep 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation Sep 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SememeASR: Boosting Performance of End-to-End Speech Recognition against Domain and Long-Tailed Data Shift with Sememe Semantic Knowledge Sep 4, 2023 Domain Generalization speech-recognition
— Unverified 0Mapping AI Arguments in Journalism Studies Sep 3, 2023 Scheduling speech-recognition
— Unverified 0BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing Sep 2, 2023 speech-recognition Speech Recognition
Code Code Available 1Contextual Biasing of Named-Entities with Large Language Models Sep 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Mi-Go: Test Framework which uses YouTube as Data Source for Evaluating Speech Recognition Models like OpenAI's Whisper Sep 1, 2023 speech-recognition Speech Recognition
— Unverified 0Learning Speech Representation From Contrastive Token-Acoustic Pretraining Sep 1, 2023 Audio Classification Automatic Speech Recognition
— Unverified 0Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer Aug 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers Aug 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Wikimedia: A 77 Language Multilingual Speech Dataset Aug 30, 2023 Machine Translation speech-recognition
Code Code Available 0Adapting Text-based Dialogue State Tracker for Spoken Dialogues Aug 29, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Aug 28, 2023 Active Learning Automatic Speech Recognition
— Unverified 0