SOTAVerified

Phoneme Recognition

Papers

Showing 2650 of 104 papers

TitleStatusHype
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis0
Allophant: Cross-lingual Phoneme Recognition with Articulatory AttributesCode1
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit0
Ensemble knowledge distillation of self-supervised speech models0
German Phoneme Recognition with Text-to-Phoneme Data Augmentation0
SAN: a robust end-to-end ASR model architecture0
Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models0
A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised LearningCode1
SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning0
Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained modelsCode0
Text-Aware End-to-end Mispronunciation Detection and DiagnosisCode1
Speech Data Augmentation for Improving Phoneme Transcriptions of Aphasic Speech Using Wav2Vec 2.0 for the PSST Challenge0
Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition0
Phoneme transcription of endangered languages: an evaluation of recent ASR architectures in the single speaker scenario0
STRATA: Word Boundaries & Phoneme Recognition From Continuous Urdu Speech using Transfer Learning, Attention, & Data Augmentation0
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionCode0
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentCode1
Benchmarking Generative Latent Variable Models for SpeechCode0
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning EnvironmentsCode0
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization0
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition0
A Novel End-to-End CAPT System for L2 Children Learners0
Show:102550
← PrevPage 2 of 5Next →

No leaderboard results yet.