SOTAVerified

Speech-to-Text

Papers

Showing 151200 of 403 papers

TitleStatusHype
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs0
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages0
DARTS: Dialectal Arabic Transcription System0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
CTC Alignments Improve Autoregressive Translation0
A Voice Controlled E-Commerce Web Application0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research0
A combined approach to the analysis of speech conversations in a contact center domain0
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
Incorporating Domain Knowledge To Improve Topic Segmentation Of Long MOOC Lecture Videos0
Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R0
Cross-modal Contrastive Learning for Speech Translation0
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions0
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task0
Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses0
Crossing the SSH Bridge with Interview Data0
Improving Metrics for Speech Translation0
Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model0
Improving Language and Modality Transfer in Translation by Character-level Modeling0
Automated Testing of AI Models0
Analyzing ASR pretraining for low-resource speech-to-text translation0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech0
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation0
Improving RNN-Transducers with Acoustic LookAhead0
Improving Autoregressive NLP Tasks via Modular Linearized Attention0
Improving Speech Recognition Accuracy Using Custom Language Models with the Vosk Toolkit0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach0
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving0
IMS-Speech: A Speech to Text Tool0
AudioPaLM: A Large Language Model That Can Speak and Listen0
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation0
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility0
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning0
Infusing Future Information into Monotonic Attention Through Language Models0
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text0
Instance-Based Model Adaptation For Direct Speech Translation0
Interpreting Strategies Annotation in the WAW Corpus0
Investigating Decoder-only Large Language Models for Speech-to-text Translation0
Corpus Creation and Evaluation for Speech-to-Text and Speech Translation0
A low latency ASR-free end to end spoken language understanding system0
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks0
How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not0
Conversational Recommendation System using NLP and Sentiment Analysis0
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?0
Contextualized Translation of Automatically Segmented Speech0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.