SOTAVerified

Automatic Speech Recognition

Papers

Showing 31013125 of 3174 papers

TitleStatusHype
Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error CorrectionCode0
FastEmit: Low-latency Streaming ASR with Sequence-level Emission RegularizationCode0
Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2SeqCode0
Optimized Speculative Sampling for GPU Hardware AcceleratorsCode0
Boosting Cross-Domain Speech Recognition with Self-SupervisionCode0
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition SystemsCode0
SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition SystemsCode0
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition SystemsCode0
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation EvaluationCode0
Blank Collapse: Compressing CTC emission for the faster decodingCode0
Coupled Training of Sequence-to-Sequence Models for Accented Speech RecognitionCode0
A Theory of Unsupervised Speech RecognitionCode0
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasksCode0
Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream WeightsCode0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
Pansori: ASR Corpus Generation from Open Online Video ContentsCode0
When Is TTS Augmentation Through a Pivot Language Useful?Code0
FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech DataCode0
Analysis of EEG frequency bands for Envisioned Speech RecognitionCode0
AfriHuBERT: A self-supervised speech representation model for African languagesCode0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
Streaming Sequence Transduction through Dynamic CompressionCode0
Textless Speech-to-Speech Translation With Limited Parallel DataCode0
Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural NetworksCode0
Show:102550
← PrevPage 125 of 127Next →

No leaderboard results yet.