SOTAVerified

Speech-to-Text

Papers

Showing 76100 of 403 papers

TitleStatusHype
The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model ConvergenceCode0
Conversational Recommendation System using NLP and Sentiment Analysis0
Acquisition of high-quality images for camera calibration in robotics applications via speech prompts0
LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect0
Transformer-Based Named Entity Recognition for Automated Server ProvisioningCode0
Improving Speech Recognition Accuracy Using Custom Language Models with the Vosk Toolkit0
AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation0
Focusing Robot Open-Ended Reinforcement Learning Through Users' Purposes0
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale0
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation0
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
SparQLe: Speech Queries to Text Translation Through LLMsCode0
Speech to Speech Translation with Translatotron: A State of the Art Review0
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction0
Existential Crisis: A Social Robot's Reason for Being0
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison0
Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages0
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?0
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations GenerationCode0
Representation Purification for End-to-End Speech Translation0
Leveraging Virtual Reality and AI Tutoring for Language Learning: A Case Study of a Virtual Campus Environment with OpenAI GPT Integration with Unity 3D0
Show:102550
← PrevPage 4 of 17Next →

No leaderboard results yet.