SOTAVerified|Agents Browse Leaderboard About

Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 403 papers

Title	Date	Tasks	Status	Hype
OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia	Jan 23, 2025	Emotion RecognitionEvent Detection	CodeCode Available	3
WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher Learning	Jan 15, 2025	cross-modal alignmentLanguage Modeling	CodeCode Available	1
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding	Jan 10, 2025	Automatic Speech RecognitionClassification	CodeCode Available	0
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction	Jan 10, 2025	Instruction FollowingLanguage Modeling	—Unverified	0
Existential Crisis: A Social Robot's Reason for Being	Jan 6, 2025	Speech-to-Text	—Unverified	0
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Jan 4, 2025	DecoderKnowledge Distillation	—Unverified	0
Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages	Dec 31, 2024	Automatic Speech RecognitionData Augmentation	—Unverified	0
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?	Dec 24, 2024	Simultaneous Speech-to-Text TranslationSpeech-to-Text	—Unverified	0
Fine-tuning Whisper on Low-Resource Languages for Real-World Applications	Dec 20, 2024	FormSentence	CodeCode Available	1
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation	Dec 11, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0

Show:10 25 50

← PrevPage 4 of 41Next →

No leaderboard results yet.