Transformer Based Punctuation Restoration for Turkish Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition Sep 15, 2023 Decoder Form
— Unverified 0The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction Sep 15, 2023 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS Sep 14, 2023 Self-Supervised Learning speech-recognition
— Unverified 0Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks Sep 14, 2023 Decoder Language Modeling
— Unverified 0Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer Sep 14, 2023 Language Modeling Language Modelling
— Unverified 0CPPF: A contextual and post-processing-free model for automatic speech recognition Sep 14, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation Sep 14, 2023 Automatic Speech Recognition Decoder
— Unverified 0Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders Sep 14, 2023 Contrastive Learning Knowledge Distillation
— Unverified 0Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition Sep 14, 2023 speech-recognition Speech Recognition
— Unverified 0Open-vocabulary Keyword-spotting with Adaptive Instance Normalization Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Can Whisper perform speech-based in-context learning? Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis Sep 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method Sep 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults Sep 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Co-learning synaptic delays, weights and adaptation in spiking neural networks Sep 12, 2023 speech-recognition Speech Recognition
— Unverified 0Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning Fusion Sep 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Minuteman: Machine and Human Joining Forces in Meeting Summarization Sep 11, 2023 Meeting Summarization speech-recognition
— Unverified 0Leveraging Large Language Models for Exploiting ASR Uncertainty Sep 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining Sep 8, 2023 Language Modeling Language Modelling
Code Code Available 0Active Learning for Classifying 2D Grid-Based Level Completability Sep 8, 2023 Active Learning speech-recognition
Code Code Available 0Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation Sep 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems Sep 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LanSER: Language-Model Supported Speech Emotion Recognition Sep 7, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks Sep 6, 2023 Self-Supervised Learning speech-recognition
— Unverified 0RoDia: A New Dataset for Romanian Dialect Identification from Speech Sep 6, 2023 Dialect Identification Speaker Verification
Code Code Available 0Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition Sep 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models Sep 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning Sep 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation Sep 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SememeASR: Boosting Performance of End-to-End Speech Recognition against Domain and Long-Tailed Data Shift with Sememe Semantic Knowledge Sep 4, 2023 Domain Generalization speech-recognition
— Unverified 0Mapping AI Arguments in Journalism Studies Sep 3, 2023 Scheduling speech-recognition
— Unverified 0Learning Speech Representation From Contrastive Token-Acoustic Pretraining Sep 1, 2023 Audio Classification Automatic Speech Recognition
— Unverified 0Mi-Go: Test Framework which uses YouTube as Data Source for Evaluating Speech Recognition Models like OpenAI's Whisper Sep 1, 2023 speech-recognition Speech Recognition
— Unverified 0Contextual Biasing of Named-Entities with Large Language Models Sep 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer Aug 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Wikimedia: A 77 Language Multilingual Speech Dataset Aug 30, 2023 Machine Translation speech-recognition
Code Code Available 0ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers Aug 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting Text-based Dialogue State Tracker for Spoken Dialogues Aug 29, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Aug 28, 2023 Active Learning Automatic Speech Recognition
— Unverified 0The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge Aug 28, 2023 speaker-diarization Speaker Diarization
— Unverified 0Neural approaches to spoken content embedding Aug 28, 2023 Automatic Speech Recognition Dynamic Time Warping
— Unverified 0Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks Aug 28, 2023 Speech Recognition
Code Code Available 0Decoupled Structure for Improved Adaptability of End-to-End Models Aug 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Small and Fast BERT for Chinese Medical Punctuation Restoration Aug 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0AdVerb: Visually Guided Audio Dereverberation Aug 23, 2023 Speaker Verification Speech Enhancement
— Unverified 0KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods Aug 23, 2023 Robust Speech Recognition speech-recognition
— Unverified 0