SOTAVerified

Speech-to-Text

Papers

Showing 151175 of 403 papers

TitleStatusHype
Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture0
AudioPaLM: A Large Language Model That Can Speak and Listen0
Recent Advances in Direct Speech-to-text Translation0
Open Brain AI. Automatic Language Assessment0
Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding0
Towards End-to-end Speech-to-text SummarizationCode0
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation0
Strategies for improving low resource speech to text translation relying on pre-trained ASR models0
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions0
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training0
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation0
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text TranslationCode1
Improving Metrics for Speech Translation0
DUB: Discrete Unit Back-translation for Speech TranslationCode1
Application-Agnostic Language Modeling for On-Device ASR0
A Whisper transformer for audio captioning trained with synthetic captions and transfer learningCode1
Back Translation for Speech-to-text Translation Without TranscriptsCode1
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks0
Improving Autoregressive NLP Tasks via Modular Linearized Attention0
ESPnet-ST-v2: Multipurpose Spoken Language Translation ToolkitCode0
Enhancing Speech-to-Speech Translation with Multiple TTS Targets0
Natural Language Robot Programming: NLP integrated with autonomous robotic grasping0
Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R0
wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts0
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages0
Show:102550
← PrevPage 7 of 17Next →

No leaderboard results yet.