SOTAVerified

Phoneme Recognition

Papers

Showing 150 of 104 papers

TitleStatusHype
Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation ScoringCode1
Allophant: Cross-lingual Phoneme Recognition with Articulatory AttributesCode1
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised LearningCode1
Text-Aware End-to-end Mispronunciation Detection and DiagnosisCode1
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentCode1
Word Error Rate Estimation Without ASR Output: e-WER2Code1
WaveNet: A Generative Model for Raw AudioCode1
Attention-Based Models for Speech RecognitionCode1
Using Neurogram Similarity Index Measure (NSIM) to Model Hearing Loss and Cochlear Neural Degeneration0
Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios0
Towards disentangling the contributions of articulation and acoustics in multimodal phoneme recognition0
Topological Deep Learning for Speech Data0
Self-Supervised Models for Phoneme Recognition: Applications in Children's Speech for Reading Learning0
SyntheticPop: Attacking Speaker Verification Systems With Synthetic VoicePops0
Improving Cross-Lingual Phonetic Representation of Low-Resource Languages Through Language Similarity Analysis0
A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework0
SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting0
DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs0
An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer0
More than words: Advancements and challenges in speech recognition for singing0
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content RepresentationsCode0
Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems0
Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations0
Segment Boundary Detection via Class Entropy Measurements in Connectionist Phoneme Recognition0
Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation0
Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones0
Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models0
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints0
Speech-dependent Modeling of Own Voice Transfer Characteristics for In-ear Microphones in Hearables0
L1-aware Multilingual Mispronunciation Detection Framework0
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit0
Ensemble knowledge distillation of self-supervised speech models0
German Phoneme Recognition with Text-to-Phoneme Data Augmentation0
SAN: a robust end-to-end ASR model architecture0
Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models0
A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning0
Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained modelsCode0
Speech Data Augmentation for Improving Phoneme Transcriptions of Aphasic Speech Using Wav2Vec 2.0 for the PSST Challenge0
Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition0
Phoneme transcription of endangered languages: an evaluation of recent ASR architectures in the single speaker scenario0
STRATA: Word Boundaries & Phoneme Recognition From Continuous Urdu Speech using Transfer Learning, Attention, & Data Augmentation0
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionCode0
Benchmarking Generative Latent Variable Models for SpeechCode0
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning EnvironmentsCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.