SOTAVerified

Phoneme Recognition

Papers

Showing 150 of 104 papers

TitleStatusHype
Using Neurogram Similarity Index Measure (NSIM) to Model Hearing Loss and Cochlear Neural Degeneration0
Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios0
Towards disentangling the contributions of articulation and acoustics in multimodal phoneme recognition0
Topological Deep Learning for Speech Data0
Self-Supervised Models for Phoneme Recognition: Applications in Children's Speech for Reading Learning0
SyntheticPop: Attacking Speaker Verification Systems With Synthetic VoicePops0
Improving Cross-Lingual Phonetic Representation of Low-Resource Languages Through Language Similarity Analysis0
A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework0
SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting0
DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs0
An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer0
More than words: Advancements and challenges in speech recognition for singing0
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content RepresentationsCode0
Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems0
Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations0
Segment Boundary Detection via Class Entropy Measurements in Connectionist Phoneme Recognition0
Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation0
Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones0
Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models0
Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation ScoringCode1
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints0
Speech-dependent Modeling of Own Voice Transfer Characteristics for In-ear Microphones in Hearables0
L1-aware Multilingual Mispronunciation Detection Framework0
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis0
Allophant: Cross-lingual Phoneme Recognition with Articulatory AttributesCode1
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit0
Ensemble knowledge distillation of self-supervised speech models0
German Phoneme Recognition with Text-to-Phoneme Data Augmentation0
SAN: a robust end-to-end ASR model architecture0
Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models0
A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised LearningCode1
SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning0
Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained modelsCode0
Text-Aware End-to-end Mispronunciation Detection and DiagnosisCode1
Speech Data Augmentation for Improving Phoneme Transcriptions of Aphasic Speech Using Wav2Vec 2.0 for the PSST Challenge0
Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition0
Phoneme transcription of endangered languages: an evaluation of recent ASR architectures in the single speaker scenario0
STRATA: Word Boundaries & Phoneme Recognition From Continuous Urdu Speech using Transfer Learning, Attention, & Data Augmentation0
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionCode0
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentCode1
Benchmarking Generative Latent Variable Models for SpeechCode0
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning EnvironmentsCode0
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization0
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition0
A Novel End-to-End CAPT System for L2 Children Learners0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.