| Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring | Sep 19, 2023 | Feature EngineeringPhone-level pronunciation scoring | CodeCode Available | 1 | 5 |
| Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes | Jun 7, 2023 | AttributeCross-Lingual Transfer | CodeCode Available | 1 | 5 |
| Word Error Rate Estimation Without ASR Output: e-WER2 | Aug 8, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Text-Aware End-to-end Mispronunciation Detection and Diagnosis | Jun 15, 2022 | Contrastive LearningPhoneme Recognition | CodeCode Available | 1 | 5 |
| FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning | Jul 1, 2022 | Knowledge DistillationPhoneme Recognition | CodeCode Available | 1 | 5 |
| Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment | Mar 29, 2022 | Phoneme RecognitionPseudo Label | CodeCode Available | 1 | 5 |
| Attention-Based Models for Speech Recognition | Jun 24, 2015 | Machine TranslationPhoneme Recognition | CodeCode Available | 1 | 5 |
| WaveNet: A Generative Model for Raw Audio | Sep 12, 2016 | Audio Generationmodel | CodeCode Available | 1 | 5 |
| Benchmarking Generative Latent Variable Models for Speech | Feb 22, 2022 | BenchmarkingImage Generation | CodeCode Available | 0 | 5 |
| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Sequence Transduction with Recurrent Neural Networks | Nov 14, 2012 | Machine TranslationPhoneme Recognition | CodeCode Available | 0 | 5 |
| Regularizing RNNs by Stabilizing Activations | Nov 26, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Real-time low-resource phoneme recognition on edge devices | Mar 25, 2021 | Phoneme Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| Do Deep Nets Really Need to be Deep? | Dec 21, 2013 | Phoneme Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks | Jan 10, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks | Dec 7, 2013 | Phoneme Recognition | CodeCode Available | 0 | 5 |
| Singing Language Identification using a Deep Phonotactic Approach | May 31, 2021 | ClassificationLanguage Identification | CodeCode Available | 0 | 5 |
| Simple and Effective Zero-shot Cross-lingual Phoneme Recognition | Sep 23, 2021 | Cross-Lingual TransferPhoneme Recognition | CodeCode Available | 0 | 5 |
| Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments | Feb 21, 2022 | Data AugmentationPhoneme Recognition | CodeCode Available | 0 | 5 |
| Application of Word2vec in Phoneme Recognition | Dec 17, 2019 | Phoneme Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations | Mar 10, 2024 | Automatic Speech RecognitionData Augmentation | CodeCode Available | 0 | 5 |
| Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers | Sep 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models | Jun 24, 2022 | Phoneme RecognitionSelf-Supervised Learning | CodeCode Available | 0 | 5 |
| Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition | Apr 1, 2022 | Phoneme RecognitionRepresentation Learning | CodeCode Available | 0 | 5 |
| Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition | Jun 20, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition | Nov 16, 2021 | Phoneme RecognitionRepresentation Learning | CodeCode Available | 0 | 5 |
| Speech Recognition with Deep Recurrent Neural Networks | Mar 22, 2013 | Handwriting RecognitionPhoneme Recognition | CodeCode Available | 0 | 5 |
| Finding phonemes: improving machine lip-reading | Oct 3, 2017 | Lip ReadingPhoneme Recognition | —Unverified | 0 | 0 |
| Fast frequency discrimination and phoneme recognition using a biomimetic membrane coupled to a neural network | Apr 9, 2020 | Phoneme Recognition | —Unverified | 0 | 0 |
| Automatic recognition of suprasegmentals in speech | Aug 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework | Sep 17, 2024 | Phoneme Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks | Apr 3, 2013 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Ensemble knowledge distillation of self-supervised speech models | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Nonlinear ISA with Auxiliary Variables for Learning Speech Representations | Jul 25, 2020 | Phoneme RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Multi-task Learning with Cross Attention for Keyword Spotting | Jul 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis | Sep 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Dual-Decoder Conformer for Multilingual Speech Recognition | Aug 22, 2021 | DecoderLanguage Identification | —Unverified | 0 | 0 |
| A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition | Oct 1, 2022 | Phoneme Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Multilingual Speech Recognition for Low-Resource Indian Languages using Multi-Task conformer | Aug 22, 2021 | DecoderMachine Translation | —Unverified | 0 | 0 |
| Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones | Oct 10, 2023 | Phoneme Recognition | —Unverified | 0 | 0 |
| Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking | Mar 20, 2020 | Phoneme Recognition | —Unverified | 0 | 0 |
| A Novel End-to-End CAPT System for L2 Children Learners | Nov 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling | Jan 16, 2021 | Automatic Phoneme RecognitionPhoneme Recognition | —Unverified | 0 | 0 |
| More than words: Advancements and challenges in speech recognition for singing | Mar 14, 2024 | Keyword SpottingLanguage Identification | —Unverified | 0 | 0 |
| Deep Triphone Embedding Improves Phoneme Recognition | Oct 22, 2017 | Dimensionality ReductionGeneral Classification | —Unverified | 0 | 0 |
| Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models | Oct 13, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Learning linearly separable features for speech recognition using convolutional neural networks | Dec 22, 2014 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition | Apr 5, 2017 | DecoderPhoneme Recognition | —Unverified | 0 | 0 |
| Learning Hard Alignments with Variational Inference | May 16, 2017 | Hard AttentionImage Captioning | —Unverified | 0 | 0 |
| DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs | Jul 30, 2024 | EEGPhoneme Recognition | —Unverified | 0 | 0 |