SOTAVerified|Agents Browse Leaderboard About Blog

Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 403 papers

Title	Date	Tasks	Status	Hype
Speech Model Pre-training for End-to-End Spoken Language Understanding	Apr 7, 2019	Speech-to-TextSpoken Language Understanding	CodeCode Available	2
Audio Jailbreak Attacks: Exposing Vulnerabilities in SpeechGPT in a White-Box Framework	May 24, 2025	Adversarial AttackSpeech Tokenization	CodeCode Available	1
Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation	Apr 27, 2025	RAGRetrieval	CodeCode Available	1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS	Apr 25, 2025	Clinical Language TranslationMachine Translation	CodeCode Available	1
DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities	Feb 16, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher Learning	Jan 15, 2025	cross-modal alignmentLanguage Modeling	CodeCode Available	1
Fine-tuning Whisper on Low-Resource Languages for Real-World Applications	Dec 20, 2024	FormSentence	CodeCode Available	1
STTATTS: Unified Speech-To-Text And Text-To-Speech Model	Oct 24, 2024	Multi-Task Learningspeech-recognition	CodeCode Available	1
Denial-of-Service Poisoning Attacks against Large Language Models	Oct 14, 2024	16kSpeech-to-Text	CodeCode Available	1
OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents	Aug 6, 2024	BenchmarkingRetrieval-augmented Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 41Next →

No leaderboard results yet.