SOTAVerified

Automatic Speech Recognition

Papers

Showing 26012650 of 3174 papers

TitleStatusHype
SapAugment: Learning A Sample Adaptive Policy for Data Augmentation0
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme0
CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition0
Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data0
Scalable Multi Corpora Neural Language Models for ASR0
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition0
Scaling A Simple Approach to Zero-Shot Speech Recognition0
Scaling ASR Improves Zero and Few Shot Learning0
Scaling Up Deliberation for Multilingual ASR0
Scene-aware Far-field Automatic Speech Recognition0
SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR0
SEAL: Speech Embedding Alignment Learning for Speech Large Language Model with Retrieval-Augmented Generation0
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge0
Security and Privacy Problems in Voice Assistant Applications: A Survey0
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition0
Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models0
Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference0
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping0
Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition0
Self-consistent context aware conformer transducer for speech recognition0
Self-critical Sequence Training for Automatic Speech Recognition0
Self-Normalized Importance Sampling for Neural Language Modeling0
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition0
Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification0
Self-Supervised Learning-Based Source Separation for Meeting Data0
Self-supervised Learning with Speech Modulation Dropout0
Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations0
Self-supervised reinforcement learning for speaker localisation with the iCub humanoid robot0
Self-supervised representations in speech-based depression detection0
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text0
Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric0
Self-Supervised Speech Representation Learning: A Review0
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions0
Semantic Data Augmentation for End-to-End Mandarin Speech Recognition0
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding0
Semantic Language Model for Tunisian Dialect0
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction0
Semantic-WER: A Unified Metric for the Evaluation of ASR Transcript for End Usability0
SeMaScore : a new evaluation metric for automatic speech recognition tasks0
Semi-Autoregressive Streaming ASR With Label Context0
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech0
Semi-supervised acoustic model training for speech with code-switching0
Semi-supervised ASR by End-to-end Self-training0
Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter0
Semi-supervised Learning with Sparse Autoencoders in Phone Classification0
Semi-Supervised Speech Recognition via Graph-based Temporal Classification0
Sentence Boundary Augmentation For Neural Machine Translation Robustness0
Sentence segmentation of aphasic speech0
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition0
SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation0
Show:102550
← PrevPage 53 of 64Next →

No leaderboard results yet.