SOTAVerified

Speech-to-Text

Papers

Showing 151200 of 403 papers

TitleStatusHype
Audio Adversarial Examples: Attacks Using Vocal Masks0
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers0
Comparison of SVD and factorized TDNN approaches for speech to text0
Acquisition of high-quality images for camera calibration in robotics applications via speech prompts0
Compact Speech Translation Models via Discrete Speech Units Pretraining0
Open Brain AI. Automatic Language Assessment0
Language Model Augmented Monotonic Attention for Simultaneous Translation0
Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center0
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation0
Findings of the Third Workshop on Automatic Simultaneous Translation0
Hands-Free VR0
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language0
Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks0
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?0
How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not0
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks0
Findings of the Second Workshop on Automatic Simultaneous Translation0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility0
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
Improving Autoregressive NLP Tasks via Modular Linearized Attention0
Attention-Based End-to-End Speech Recognition on Voice Search0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech0
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders0
Extending RNN-T-based speech recognition systems with emotion and language classification0
Improving Metrics for Speech Translation0
Improving RNN-Transducers with Acoustic LookAhead0
AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments0
Improving Speech Recognition Accuracy Using Custom Language Models with the Vosk Toolkit0
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task0
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach0
Exploring Transfer Learning For End-to-End Spoken Language Understanding0
IMS-Speech: A Speech to Text Tool0
Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation0
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
LASER: Attention with Exponential Transformation0
Interpreting Strategies Annotation in the WAW Corpus0
Investigating Decoder-only Large Language Models for Speech-to-text Translation0
Existential Crisis: A Social Robot's Reason for Being0
Evaluation of real-time transcriptions using end-to-end ASR models0
Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages0
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs0
CMU's IWSLT 2024 Simultaneous Speech Translation System0
Evaluating Voice Command Pipelines for Drone Control: From STT and LLM to Direct Classification and Siamese Networks0
Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.