SOTAVerified

Speech-to-Text

Papers

Showing 151200 of 403 papers

TitleStatusHype
Design of a novel Korean learning application for efficient pronunciation correction0
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
Deep Learning Based Natural Language Processing for End to End Speech Translation0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting0
Adversarial Attacks against Neural Networks in Audio Domain: Exploiting Principal Components0
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models0
DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems0
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems0
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning0
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs0
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages0
DARTS: Dialectal Arabic Transcription System0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
CTC Alignments Improve Autoregressive Translation0
A Voice Controlled E-Commerce Web Application0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research0
A combined approach to the analysis of speech conversations in a contact center domain0
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
Incorporating Domain Knowledge To Improve Topic Segmentation Of Long MOOC Lecture Videos0
Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R0
Cross-modal Contrastive Learning for Speech Translation0
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions0
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach0
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task0
Improving Speech Recognition Accuracy Using Custom Language Models with the Vosk Toolkit0
Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses0
Crossing the SSH Bridge with Interview Data0
IMS-Speech: A Speech to Text Tool0
Improving RNN-Transducers with Acoustic LookAhead0
Improving Metrics for Speech Translation0
Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model0
Improving Language and Modality Transfer in Translation by Character-level Modeling0
Automated Testing of AI Models0
Analyzing ASR pretraining for low-resource speech-to-text translation0
Instance-Based Model Adaptation For Direct Speech Translation0
Interpreting Strategies Annotation in the WAW Corpus0
Investigating Decoder-only Large Language Models for Speech-to-text Translation0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech0
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation0
Improving Autoregressive NLP Tasks via Modular Linearized Attention0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving0
AudioPaLM: A Large Language Model That Can Speak and Listen0
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.