SOTAVerified

Speech-to-Text

Papers

Showing 301325 of 403 papers

TitleStatusHype
A low latency ASR-free end to end spoken language understanding system0
Analyzing ASR pretraining for low-resource speech-to-text translation0
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
Application-Agnostic Language Modeling for On-Device ASR0
Application of Audio Fingerprinting Techniques for Real-Time Scalable Speech Retrieval and Speech Clusterization0
A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking0
A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems0
A Survey on Speech Large Language Models0
A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
Attention-Based End-to-End Speech Recognition on Voice Search0
Audio Adversarial Examples: Attacks Using Vocal Masks0
Audio Interval Retrieval using Convolutional Neural Networks0
AudioPaLM: A Large Language Model That Can Speak and Listen0
Automated Testing of AI Models0
A Voice Controlled E-Commerce Web Application0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models0
Bridging the Modality Gap for Speech-to-Text Translation0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?0
Show:102550
← PrevPage 13 of 17Next →

No leaderboard results yet.