| Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment | Mar 29, 2022 | Phoneme RecognitionPseudo Label | CodeCode Available | 1 |
| Attention-Based Models for Speech Recognition | Jun 24, 2015 | Machine TranslationPhoneme Recognition | CodeCode Available | 1 |
| Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring | Sep 19, 2023 | Feature EngineeringPhone-level pronunciation scoring | CodeCode Available | 1 |
| Text-Aware End-to-end Mispronunciation Detection and Diagnosis | Jun 15, 2022 | Contrastive LearningPhoneme Recognition | CodeCode Available | 1 |
| WaveNet: A Generative Model for Raw Audio | Sep 12, 2016 | Audio Generationmodel | CodeCode Available | 1 |
| Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes | Jun 7, 2023 | AttributeCross-Lingual Transfer | CodeCode Available | 1 |
| Word Error Rate Estimation Without ASR Output: e-WER2 | Aug 8, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning | Jul 1, 2022 | Knowledge DistillationPhoneme Recognition | CodeCode Available | 1 |
| A Comparison of Discrete Latent Variable Models for Speech Representation Learning | Oct 24, 2020 | Phoneme RecognitionRepresentation Learning | —Unverified | 0 |
| A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition | Oct 1, 2022 | Phoneme Recognitionspeech-recognition | —Unverified | 0 |
| Persian Vowel recognition with MFCC and ANN on PCVC speech dataset | Dec 17, 2018 | Phoneme Recognition | —Unverified | 0 |
| Ensemble knowledge distillation of self-supervised speech models | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic recognition of suprasegmentals in speech | Aug 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework | Sep 17, 2024 | Phoneme Recognitionspeech-recognition | —Unverified | 0 |
| Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties | Apr 4, 2021 | Phoneme Recognitionspeech-recognition | —Unverified | 0 |
| A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization | Jan 16, 2022 | Phoneme RecognitionRepresentation Learning | —Unverified | 0 |
| DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs | Jul 30, 2024 | EEGPhoneme Recognition | —Unverified | 0 |
| Online Sequence Training of Recurrent Neural Networks with Connectionist Temporal Classification | Nov 21, 2015 | General ClassificationPhoneme Recognition | —Unverified | 0 |
| Optimising The Input Window Alignment in CD-DNN Based Phoneme Recognition for Low Latency Processing | Jun 29, 2016 | Low-latency processingPhoneme Recognition | —Unverified | 0 |
| Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations | Feb 10, 2024 | Phoneme RecognitionSelf-Supervised Learning | —Unverified | 0 |
| Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking | Mar 20, 2020 | Phoneme Recognition | —Unverified | 0 |
| A Novel End-to-End CAPT System for L2 Children Learners | Nov 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Dual-Decoder Conformer for Multilingual Speech Recognition | Aug 22, 2021 | DecoderLanguage Identification | —Unverified | 0 |
| Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis | Sep 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Comprehensive Survey on Bengali Phoneme Recognition | Jan 27, 2017 | Automatic Phoneme RecognitionPhoneme Recognition | —Unverified | 0 |
| Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks | Apr 3, 2013 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fast frequency discrimination and phoneme recognition using a biomimetic membrane coupled to a neural network | Apr 9, 2020 | Phoneme Recognition | —Unverified | 0 |
| Finding phonemes: improving machine lip-reading | Oct 3, 2017 | Lip ReadingPhoneme Recognition | —Unverified | 0 |
| Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings | Apr 1, 2018 | Generative Adversarial NetworkPhoneme Recognition | —Unverified | 0 |
| A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Cross-Lingual Phonetic Representation of Low-Resource Languages Through Language Similarity Analysis | Jan 12, 2025 | Phoneme RecognitionSelf-Supervised Learning | —Unverified | 0 |
| German Phoneme Recognition with Text-to-Phoneme Data Augmentation | Nov 24, 2022 | Data AugmentationPhoneme Recognition | —Unverified | 0 |
| Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition | May 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| FPGA Based Implementation of Deep Neural Networks Using On-chip Memory Only | Feb 4, 2016 | GPUHandwritten Digit Recognition | —Unverified | 0 |
| Fixed-Point Performance Analysis of Recurrent Neural Networks | Dec 4, 2015 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Incorporating Belief Function in SVM for Phoneme Recognition | Jul 22, 2015 | Phoneme Recognition | —Unverified | 0 |
| L1-aware Multilingual Mispronunciation Detection Framework | Sep 14, 2023 | Phoneme Recognition | —Unverified | 0 |
| An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks | Jun 20, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Learning Hard Alignments with Variational Inference | May 16, 2017 | Hard AttentionImage Captioning | —Unverified | 0 |
| Learning linearly separable features for speech recognition using convolutional neural networks | Dec 22, 2014 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling | Jan 16, 2021 | Automatic Phoneme RecognitionPhoneme Recognition | —Unverified | 0 |
| Deep Triphone Embedding Improves Phoneme Recognition | Oct 22, 2017 | Dimensionality ReductionGeneral Classification | —Unverified | 0 |
| Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones | Oct 10, 2023 | Phoneme Recognition | —Unverified | 0 |
| More than words: Advancements and challenges in speech recognition for singing | Mar 14, 2024 | Keyword SpottingLanguage Identification | —Unverified | 0 |
| Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer | Aug 22, 2021 | DecoderMachine Translation | —Unverified | 0 |
| Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models | Oct 13, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-task Learning with Cross Attention for Keyword Spotting | Jul 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition | Apr 5, 2017 | DecoderPhoneme Recognition | —Unverified | 0 |
| Nonlinear ISA with Auxiliary Variables for Learning Speech Representations | Jul 25, 2020 | Phoneme RecognitionSpeaker Verification | —Unverified | 0 |
| Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints | Sep 16, 2023 | AttributeAutomatic Speech Recognition | —Unverified | 0 |