SOTAVerified

Automatic Speech Recognition

Papers

Showing 30013025 of 3174 papers

TitleStatusHype
LIP-RTVE: An Audiovisual Database for Continuous Spanish in the WildCode0
Guiding Frame-Level CTC Alignments Using Self-knowledge DistillationCode0
Seq2seq for Automatic Paraphasia Detection in Aphasic SpeechCode0
Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech RecognitionCode0
End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic HandsCode0
Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion ModelsCode0
Sequence Labeling Approach to the Task of Sentence Boundary DetectionCode0
Splitformer: An improved early-exit architecture for automatic speech recognition on edge devicesCode0
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASRCode0
Spoken English Intelligibility Remediation with PocketSphinx Alignment and Feature Extraction Improves Substantially over the State of the ArtCode0
Two-stage Textual Knowledge Distillation for End-to-End Spoken Language UnderstandingCode0
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of SpeechCode0
Realizing Petabyte Scale Acoustic ModelingCode0
LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic ContextCode0
Spoken Language Intent Detection using Confusion2VecCode0
Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with AphasiaCode0
ASR Benchmarking: Need for a More Representative Conversational DatasetCode0
End to End ASR System with Automatic Punctuation InsertionCode0
Detecting Adversarial Examples for Speech Recognition via Uncertainty QuantificationCode0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
Deep Spiking Neural Networks for Large Vocabulary Automatic Speech RecognitionCode0
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASRCode0
A Comparison of Adaptation Techniques and Recurrent Neural Network ArchitecturesCode0
ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language ModelsCode0
Textless Dependency Parsing by Labeled Sequence PredictionCode0
Show:102550
← PrevPage 121 of 127Next →

No leaderboard results yet.