DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition May 18, 2023 Knowledge Distillation Quantization
— Unverified 00 WhisperKit: On-device Real-time ASR with Billion-Scale Transformers Jul 14, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages Dec 31, 2024 Automatic Speech Recognition Data Augmentation
— Unverified 00 Whispy: Adapting STT Whisper Models to Real-Time Environments May 6, 2024 Action Detection Activity Detection
— Unverified 00 Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision Jun 4, 2024 Automatic Speech Recognition speech-recognition
— Unverified 00 Whither the Priors for (Vocal) Interactivity? Mar 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Who Are We Talking About? Handling Person Names in Speech Translation Nov 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Who Are We Talking About? Handling Person Names in Speech Translation May 13, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Who Needs Decoders? Efficient Estimation of Sequence-level Attributes May 9, 2023 Attribute Automatic Speech Recognition
— Unverified 00 Who Needs Words? Lexicon-Free Speech Recognition Apr 9, 2019 speech-recognition Speech Recognition
— Unverified 00 Why Does Decentralized Training Outperform Synchronous Training In The Large Batch Setting? Jan 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? Apr 27, 2022 Self-Supervised Learning Speaker Recognition
— Unverified 00 WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification Jul 18, 2022 Cover song identification Data Augmentation
— Unverified 00 Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization Sep 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Will a Blind Model Hear Better? Advanced Audiovisual Recognition System with Brain-Like Compensating and Gating Sep 29, 2021 speech-recognition Speech Recognition
— Unverified 00 Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 2021 Aug 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition Apr 8, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Word Alignment Modeling with Context Dependent Deep Neural Network Aug 1, 2013 Speech Recognition Word Alignment
— Unverified 00 Word-Based Dialog State Tracking with Recurrent Neural Networks Jun 1, 2014 dialog state tracking Feature Engineering
— Unverified 00 Word-Embedding based Content Features for Automated Oral Proficiency Scoring Aug 1, 2018 Rhythm speech-recognition
— Unverified 00 Word-Free Spoken Language Understanding for Mandarin-Chinese Jul 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Word-level confidence estimation for RNN transducers Sep 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Word-Level Language Identification and Predicting Codeswitching Points in Swahili-English Language Data Nov 1, 2016 Language Identification Sentiment Analysis
— Unverified 00 Word-level Speech Recognition with a Letter to Word Encoder Jun 10, 2019 Decoder General Classification
— Unverified 00 Word Level Timestamp Generation for Automatic Speech Recognition and Translation May 21, 2025 Automatic Speech Recognition automatic-speech-translation
— Unverified 00 Word Order Does Not Matter For Speech Recognition Oct 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Word Recognition from Continuous Articulatory Movement Time-series Data using Symbolic Representations Aug 1, 2013 Speech Recognition Time Series
— Unverified 00 Word Segmentation of Informal Arabic with Domain Adaptation Jun 1, 2014 Domain Adaptation Machine Translation
— Unverified 00 Words Worth: Verbal Content and Hirability Impressions in YouTube Video Resumes Oct 1, 2018 Automatic Speech Recognition (ASR) Speech Recognition
— Unverified 00 Word Transduction for Addressing the OOV Problem in Machine Translation for Similar Resource-Scarce Languages Sep 1, 2017 Automatic Speech Recognition (ASR) Machine Translation
— Unverified 00 XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition Aug 20, 2024 speech-recognition Speech Recognition
— Unverified 00 XJSA at SemEval-2017 Task 4: A Deep System for Sentiment Classification in Twitter Aug 1, 2017 General Classification Semantic Parsing
— Unverified 00 XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception Mar 21, 2024 Audio-Visual Speech Recognition Representation Learning
— Unverified 00 XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese Jan 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models Jul 5, 2024 Automatic Speech Recognition speech-recognition
— Unverified 00 XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition Mar 15, 2021 Data Augmentation Representation Learning
— Unverified 00 XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers Oct 29, 2022 speech-recognition Speech Recognition
— Unverified 00 XTREME-S: Evaluating Cross-lingual Speech Representations Mar 21, 2022 Representation Learning Retrieval
— Unverified 00 XY Neural Networks Mar 31, 2021 speech-recognition Speech Recognition
— Unverified 00 YODAS: Youtube-Oriented Dataset for Audio and Speech Jun 2, 2024 Self-Supervised Learning speech-recognition
— Unverified 00 You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation May 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish May 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Your voice is your voice: Supporting Self-expression through Speech Generation and LLMs in Augmented and Alternative Communication Mar 21, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus Mar 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Zara: A Virtual Interactive Dialogue System Incorporating Emotion, Sentiment and Personality Recognition Dec 1, 2016 Emotion Recognition Feature Engineering
— Unverified 00 Zara The Supergirl: An Empathetic Personality Recognition System Jun 1, 2016 Emotion Recognition Sentiment Analysis
— Unverified 00 Zero-resource Speech Translation and Recognition with LLMs Dec 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Zero-Shot Automatic Pronunciation Assessment May 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition Apr 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Zero-shot Disfluency Detection for Indian Languages Oct 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00